NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2.4k
Star 13.8k

Code
Issues 596
Pull requests 739
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 65 Milestones 1

New pull request New

739 Open 10,207 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[None][infra] Add nv-xtf, rahul-steiger-nv, tedzhouhk, tensorrt-cicd to blossom-ci allowlist

#14955 opened Jun 4, 2026 by ZhanruiSunCh Collaborator

Loading…

1 task done

[https://nvbugs/6160629][fix] AutoDeploy: increase rtol for bf16 HF vs FI rope test

#14954 opened Jun 4, 2026 by galagam Collaborator

Loading…

1 task done

[None][feat] Sparse-attention behavior-layer framework + V2-migrated RocketKV with chunked prefill

#14953 opened Jun 4, 2026 by Hudayday Collaborator

Loading…

[None][fix] Optimize cache-aware router backfill and validate worker handshake deepseek-v4

#14950 opened Jun 4, 2026 by Shixiaowei02 Collaborator

Loading…

1 task done

[https://nvbugs/5859886][fix] Remove the waiver

#14948 opened Jun 4, 2026 by ziyixiong-nv Collaborator

Loading…

1 task

[None][opt] attn kernel epilogue fuse RopeQuant

#14947 opened Jun 4, 2026 by yunruis Contributor

Loading…

1 task done

[None][feat] Support beam search in KV cache manager v2

#14945 opened Jun 4, 2026 by yizhang-nv Member

Loading…

1 task done

[TRTLLM-13052][feat] Enable TRTLLM moe backend for nemotron-h BF16 ckpt

#14944 opened Jun 4, 2026 by Wanli-Jiang Collaborator • Draft

1 task done

[None][feat] AutoDeploy: Fix hardcoded configs

#14943 opened Jun 4, 2026 by taylor-yb-lee Collaborator

Loading…

1 task done

[TRTLLM-10184][chore] Remove legacy XQA precompiled path

#14941 opened Jun 4, 2026 by pengbowang-nv Collaborator • Draft

1 task

[None][fix] Update DeepGEMM to fix paged MQA metadata OOB deepseek-v4

#14940 opened Jun 4, 2026 by Barry-Delaney Collaborator

Loading…

[https://nvbugs/6211441][fix] Resolve yaml_extra paths from the configs dir via a class-level YAML_EXTRA…

#14938 opened Jun 4, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6217533][fix] Prevent eviction-queue heap corruption from duplicate releaseBlock

#14937 opened Jun 4, 2026 by Shixiaowei02 Collaborator

Loading…

1 task done

[https://nvbugs/6261164][fix] When spec_config is None, allocate a 1-slot placeholder for…

#14936 opened Jun 4, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[https://nvbugs/6245317][test] set Harmony tiktoken env for GPT-OSS disagg

#14935 opened Jun 4, 2026 by dongfengy Collaborator

Loading…

1 task done

[TRTLLM-13168][feat] test best ucx env for cache transceiver

#14933 opened Jun 4, 2026 by chuangz0 Collaborator

Loading…

1 task done

[https://nvbugs/6248827][fix] Wrap both accesses in getattr(self.config, 'num_nextn_predict_layers', 0)…

#14932 opened Jun 4, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[None][bug] DSA: read index_topk from checkpoint instead of the V4 default deepseek-v4

#14931 opened Jun 4, 2026 by longcheng-nv Collaborator

Loading…

[None][perf] Optimize r128 compressor prefill reduction deepseek-v4

#14927 opened Jun 4, 2026 by mingyangHao Collaborator

Loading…

1 task done

[None][feat] Enable MTP for Step-3.7 NVFP4 and port Step-3.7VL vision tower to TRT-LLM modules

#14926 opened Jun 4, 2026 by kaiyux Member

Loading…

1 task

[TRTLLM-12507][feat] Cudagraph support for routed-expert MoE LoRA with Cutlass backend - Part 1

#14923 opened Jun 4, 2026 by brb-nv Collaborator

Loading…

1 task done

Fix PyExecutor FPM iteration timing

#14922 opened Jun 3, 2026 by tedzhouhk

Loading…

[TRTLLMINF-69][infra] Migrate A100X-FMHA-Post-Merge-1 and A100X-Triton-Post-Merge-[1,2] to SLURM

#14921 opened Jun 3, 2026 by mlefeb01 Collaborator

Loading…

1 task done

[TRTLLM-12648][test] implement disagg cancellation injector thread

#14920 opened Jun 3, 2026 by chienchunhung Collaborator

Loading…

1 task done

[https://nvbugs/6250866][fix] fix deep-ep illegal memory access for GPTOSS on GB200

#14919 opened Jun 3, 2026 by dongfengy Collaborator

Loading…

1 task done

Previous 1 2 3 4 5 … 29 30 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!