Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix: Revert NIXL and ETCD from the main image
#4190 opened May 9, 2025 by Shixiaowei02 Loading…
Cherry-pick feat/llama4's 1-17 commits to main
#4189 opened May 9, 2025 by chenfeiz0326 Loading…
[bug/5247505] fix: CP accuracy on Blackwell
#4188 opened May 9, 2025 by DylanChen-NV Loading…
test: Remove CNN Dailymail tasks in favor of GSM8K
#4187 opened May 9, 2025 by syuoni Loading…
infra: open source fmha v2 kernels
#4185 opened May 9, 2025 by qsang-nv Loading…
feat: Support for Mistral Small 3.1 24B VLM
#4183 opened May 9, 2025 by brb-nv Loading…
^gdr_copy
#4181 opened May 9, 2025 by chuangz0 Loading…
add changes for fp8, nemotron-nas, API
#4180 opened May 9, 2025 by shaharmor98 Loading…
chore: Deprecate evaltool
#4173 opened May 9, 2025 by Tracin Loading…
chore: Remove deprecated Python runtime benchmark
#4171 opened May 9, 2025 by kaiyux Loading…
exp: pull/4114
#4170 opened May 9, 2025 by tongyuantongyu Draft
[feat] [AutoDeploy] Llama-4 Support
#4163 opened May 8, 2025 by lucaslie Loading…
2 of 5 tasks
fix: bump xgrammar
#4160 opened May 8, 2025 by milesial Draft
Add test case for kv memory estimation
#4158 opened May 8, 2025 by HuiGao-NV Loading…
remove cache_transceiver_prealloc_size
#4153 opened May 8, 2025 by chuangz0 Loading…
infra: Move SBSA build stage to Blossom
#4152 opened May 8, 2025 by ZhanruiSunCh Loading…
chore:update modelopt to 0.29
#4150 opened May 8, 2025 by nv-guomingz Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.