-
Notifications
You must be signed in to change notification settings - Fork 74
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
check hardware constraints when setting threads per sm
enable-auto-merge
Auto-merge a PR when: 1) PR mergeable 2) Internal CI complete 3) No failures
#5814
opened Jan 13, 2026 by
liqiangxl
Loading…
print shared memory limited blocks per SM
enable-auto-merge
Auto-merge a PR when: 1) PR mergeable 2) Internal CI complete 3) No failures
#5813
opened Jan 13, 2026 by
liqiangxl
Loading…
Adding pytest script for llama4 inference benchmark
#5805
opened Jan 13, 2026 by
jjsjann123
Loading…
pass lparams to lower pass to allow dynamic shapes in inner persistent warp specialized scheduler
#5785
opened Jan 9, 2026 by
liqiangxl
Loading…
Migrate from pybind11 to nanobind
Direct Bindings
Python extension with direct mapping to NvFuser CPP objects.
GroupedBlockQuantizeOp PR2: Adding python API and updating llama4 benchmark
#5777
opened Jan 8, 2026 by
jjsjann123
•
Draft
support dynamic shapes in warp specialized inner outer persistent scheduler
#5765
opened Jan 6, 2026 by
liqiangxl
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.