Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

k25 dflash hardcode support
#1367 opened Apr 29, 2026 by h-guo18 Contributor Draft
[Fix]: $HOME in launcher eagle example
#1365 opened Apr 28, 2026 by h-guo18 Contributor Loading…
Experiment: MXFP4 -> NVFP4 conversion MSE study (scratch)
#1364 opened Apr 28, 2026 by cjluo-nv Collaborator Draft
3 tasks
Add Nemotron Super v3 NVFP4 PTQ recipe
#1363 opened Apr 28, 2026 by jenchen13 Contributor Loading…
[6034518] Remove return statement preventing remote auto tuning
#1361 opened Apr 28, 2026 by dthienan-nv Contributor Loading…
Ensure removal of temp files on error in ONNX INT4 quantization cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1359 opened Apr 28, 2026 by vishalpandya1990 Contributor Loading…
Enable runtime optimization
#1358 opened Apr 28, 2026 by grzegorz-k-karch Contributor Draft
Add pre-built evaluation recipes for common benchmarks
#1357 opened Apr 27, 2026 by kaix-nv Contributor Loading…
[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1356 opened Apr 27, 2026 by ajrasane Contributor Loading…
2 tasks done
[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1353 opened Apr 27, 2026 by ajrasane Contributor Loading…
[minor] fixes for layerwise calib + MSE
#1344 opened Apr 24, 2026 by Fridah-nv Contributor Loading…
DSV4 dequant on the fly
#1341 opened Apr 24, 2026 by mxinO Contributor Draft
Update
#1338 opened Apr 23, 2026 by jingyu-ml Contributor Draft
[Refactor] speculative decoding: use mto config subsystem
#1328 opened Apr 23, 2026 by h-guo18 Contributor Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327 opened Apr 22, 2026 by ajrasane Contributor Loading…
3 of 5 tasks
Update the DMD2 at the first stage
#1326 opened Apr 22, 2026 by jingyu-ml Contributor Draft
Add Nemotron-Nano-9B-v2 → Pruned 7B e2e tutorial: Prune + Distill + Eval + Quantize + vLLM deployment cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1325 opened Apr 22, 2026 by kevalmorabia97 Collaborator Loading…
VSA support for Wan 2.2 and LTX2
#1315 opened Apr 22, 2026 by jingyu-ml Contributor Loading…
Support NVFP4 W4A16 quantization
#1313 opened Apr 22, 2026 by hychiang-git Contributor Loading…
fix: layerwise calibration backward-compat, recipe split, batch-size guard
#1310 opened Apr 21, 2026 by realAsma Contributor Loading…
2 tasks done
ProTip! Follow long discussions with comments:>50.