Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][fix] Fix regression from SageAttention kernel: Use static scheduler
#15047 opened Jun 6, 2026 by xrq-phys Collaborator Loading…
1 task done
[None][feat] Add KV cache block reuse policy
#15046 opened Jun 6, 2026 by jiaganc Collaborator Loading…
1 task done
[None][fix] Enforce Responses conversation history capacity
#15043 opened Jun 6, 2026 by fallintoplace Loading…
1 task done
[None][fix] Absolute-block KV-cache-aware routing cost
#15040 opened Jun 6, 2026 by Shixiaowei02 Collaborator Draft
1 task
[TRTLLM-13262][ci] Move non-default-feature tests to post merge
#15038 opened Jun 6, 2026 by QiJune Collaborator Loading…
1 task done
[None][test] Waive 5 failed cases for main in QA CI
#15037 opened Jun 6, 2026 by tensorrt-cicd Collaborator Loading…
[None][test] Waive 7 failed cases for main in QA CI
#15036 opened Jun 6, 2026 by tensorrt-cicd Collaborator Loading…
[TRTLLM-13259][ci] Merge DGX_H100 DeepSeek and GptOss stages
#15035 opened Jun 6, 2026 by QiJune Collaborator Loading…
1 task done
[None][test] Waive 18 failed cases for main in QA CI
#15034 opened Jun 6, 2026 by tensorrt-cicd Collaborator Loading…
[None][feat] AutoDeploy: Add DeepSeekV4 Support
#15019 opened Jun 5, 2026 by bmarimuthu-nv Collaborator Draft
1 task
[#12933][fix] release orphaned KV-cache blocks in storeContextBlocks
#15018 opened Jun 5, 2026 by fbxai Loading…
1 task done
ProTip! Add no:assignee to see everything that’s not assigned.