-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][infra] Add nv-xtf, rahul-steiger-nv, tedzhouhk, tensorrt-cicd to blossom-ci allowlist
#14955
opened Jun 4, 2026 by
ZhanruiSunCh
Collaborator
Loading…
1 task done
[https://nvbugs/6160629][fix] AutoDeploy: increase rtol for bf16 HF vs FI rope test
#14954
opened Jun 4, 2026 by
galagam
Collaborator
Loading…
1 task done
[None][feat] Sparse-attention behavior-layer framework + V2-migrated RocketKV with chunked prefill
#14953
opened Jun 4, 2026 by
Hudayday
Collaborator
Loading…
[None][fix] Optimize cache-aware router backfill and validate worker handshake
deepseek-v4
#14950
opened Jun 4, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[https://nvbugs/5859886][fix] Remove the waiver
#14948
opened Jun 4, 2026 by
ziyixiong-nv
Collaborator
Loading…
1 task
[None][opt] attn kernel epilogue fuse RopeQuant
#14947
opened Jun 4, 2026 by
yunruis
Contributor
Loading…
1 task done
[None][feat] Support beam search in KV cache manager v2
#14945
opened Jun 4, 2026 by
yizhang-nv
Member
Loading…
1 task done
[TRTLLM-13052][feat] Enable TRTLLM moe backend for nemotron-h BF16 ckpt
#14944
opened Jun 4, 2026 by
Wanli-Jiang
Collaborator
•
Draft
1 task done
[None][feat] AutoDeploy: Fix hardcoded configs
#14943
opened Jun 4, 2026 by
taylor-yb-lee
Collaborator
Loading…
1 task done
[TRTLLM-10184][chore] Remove legacy XQA precompiled path
#14941
opened Jun 4, 2026 by
pengbowang-nv
Collaborator
•
Draft
1 task
[None][fix] Update DeepGEMM to fix paged MQA metadata OOB
deepseek-v4
#14940
opened Jun 4, 2026 by
Barry-Delaney
Collaborator
Loading…
[https://nvbugs/6211441][fix] Resolve yaml_extra paths from the configs dir via a class-level YAML_EXTRA…
#14938
opened Jun 4, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6217533][fix] Prevent eviction-queue heap corruption from duplicate releaseBlock
#14937
opened Jun 4, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[https://nvbugs/6261164][fix] When spec_config is None, allocate a 1-slot placeholder for…
#14936
opened Jun 4, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[https://nvbugs/6245317][test] set Harmony tiktoken env for GPT-OSS disagg
#14935
opened Jun 4, 2026 by
dongfengy
Collaborator
Loading…
1 task done
[TRTLLM-13168][feat] test best ucx env for cache transceiver
#14933
opened Jun 4, 2026 by
chuangz0
Collaborator
Loading…
1 task done
[https://nvbugs/6248827][fix] Wrap both accesses in
getattr(self.config, 'num_nextn_predict_layers', 0)…
#14932
opened Jun 4, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][bug] DSA: read index_topk from checkpoint instead of the V4 default
deepseek-v4
#14931
opened Jun 4, 2026 by
longcheng-nv
Collaborator
Loading…
[None][perf] Optimize r128 compressor prefill reduction
deepseek-v4
#14927
opened Jun 4, 2026 by
mingyangHao
Collaborator
Loading…
1 task done
[None][feat] Enable MTP for Step-3.7 NVFP4 and port Step-3.7VL vision tower to TRT-LLM modules
#14926
opened Jun 4, 2026 by
kaiyux
Member
Loading…
1 task
[TRTLLM-12507][feat] Cudagraph support for routed-expert MoE LoRA with Cutlass backend - Part 1
#14923
opened Jun 4, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[TRTLLMINF-69][infra] Migrate A100X-FMHA-Post-Merge-1 and A100X-Triton-Post-Merge-[1,2] to SLURM
#14921
opened Jun 3, 2026 by
mlefeb01
Collaborator
Loading…
1 task done
[TRTLLM-12648][test] implement disagg cancellation injector thread
#14920
opened Jun 3, 2026 by
chienchunhung
Collaborator
Loading…
1 task done
[https://nvbugs/6250866][fix] fix deep-ep illegal memory access for GPTOSS on GB200
#14919
opened Jun 3, 2026 by
dongfengy
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.