-
Notifications
You must be signed in to change notification settings - Fork 371
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
[6034518] Remove return statement preventing remote auto tuning
#1361
opened Apr 28, 2026 by
dthienan-nv
Contributor
Loading…
Ensure removal of temp files on error in ONNX INT4 quantization
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1359
opened Apr 28, 2026 by
vishalpandya1990
Contributor
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1356
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
2 tasks done
[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1353
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327
opened Apr 22, 2026 by
ajrasane
Contributor
Loading…
3 of 5 tasks
Add Nemotron-Nano-9B-v2 → Pruned 7B e2e tutorial: Prune + Distill + Eval + Quantize + vLLM deployment
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1325
opened Apr 22, 2026 by
kevalmorabia97
Collaborator
Loading…
Fix NVFP4 quantization for Qwen3.x MoE models (4 silent-failure bugs)
#1323
opened Apr 22, 2026 by
erictinkeredapps
Loading…
Add demo (Puzzletron and Minitron guide) in Model-Optimizer/examples/pruning/ with README and notebooks
documentation
Improvements or additions to documentation
#1320
opened Apr 22, 2026 by
achidiac-nv
Loading…
fix: layerwise calibration backward-compat, recipe split, batch-size guard
#1310
opened Apr 21, 2026 by
realAsma
Contributor
Loading…
2 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.