Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[https://nvbugs/6160629][fix] AutoDeploy: increase rtol for bf16 HF vs FI rope test
#14954 opened Jun 4, 2026 by galagam Collaborator Loading…
1 task done
[https://nvbugs/5859886][fix] Remove the waiver
#14948 opened Jun 4, 2026 by ziyixiong-nv Collaborator Loading…
1 task
[None][opt] attn kernel epilogue fuse RopeQuant
#14947 opened Jun 4, 2026 by yunruis Contributor Loading…
1 task done
[None][feat] Support beam search in KV cache manager v2
#14945 opened Jun 4, 2026 by yizhang-nv Member Loading…
1 task done
[None][feat] AutoDeploy: Fix hardcoded configs
#14943 opened Jun 4, 2026 by taylor-yb-lee Collaborator Loading…
1 task done
[https://nvbugs/6245317][test] set Harmony tiktoken env for GPT-OSS disagg
#14935 opened Jun 4, 2026 by dongfengy Collaborator Loading…
1 task done
[TRTLLM-13168][feat] test best ucx env for cache transceiver
#14933 opened Jun 4, 2026 by chuangz0 Collaborator Loading…
1 task done
[None][perf] Optimize r128 compressor prefill reduction deepseek-v4
#14927 opened Jun 4, 2026 by mingyangHao Collaborator Loading…
1 task done
Fix PyExecutor FPM iteration timing
#14922 opened Jun 3, 2026 by tedzhouhk Loading…
[TRTLLM-12648][test] implement disagg cancellation injector thread
#14920 opened Jun 3, 2026 by chienchunhung Collaborator Loading…
1 task done
[https://nvbugs/6250866][fix] fix deep-ep illegal memory access for GPTOSS on GB200
#14919 opened Jun 3, 2026 by dongfengy Collaborator Loading…
1 task done
ProTip! Add no:assignee to see everything that’s not assigned.