-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][test] Waive 9 failed cases for main in QA CI
#15051
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 5 failed cases for main in QA CI
#15050
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 3 failed cases for main in QA CI
#15049
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 3 failed cases for main in QA CI
#15048
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][fix] Fix regression from SageAttention kernel: Use static scheduler
#15047
opened Jun 6, 2026 by
xrq-phys
Collaborator
Loading…
1 task done
[None][feat] Add KV cache block reuse policy
#15046
opened Jun 6, 2026 by
jiaganc
Collaborator
Loading…
1 task done
[None][test] Waive 5 failed cases for main in QA CI
#15045
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][fix] Enforce Responses conversation history capacity
#15043
opened Jun 6, 2026 by
fallintoplace
Loading…
1 task done
[None][fix] Absolute-block KV-cache-aware routing cost
#15040
opened Jun 6, 2026 by
Shixiaowei02
Collaborator
•
Draft
1 task
[None][test] Waive 1 failed cases for main in QA CI
#15039
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[TRTLLM-13262][ci] Move non-default-feature tests to post merge
#15038
opened Jun 6, 2026 by
QiJune
Collaborator
Loading…
1 task done
[None][test] Waive 5 failed cases for main in QA CI
#15037
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
Loading…
[None][test] Waive 7 failed cases for main in QA CI
#15036
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
Loading…
[TRTLLM-13259][ci] Merge DGX_H100 DeepSeek and GptOss stages
#15035
opened Jun 6, 2026 by
QiJune
Collaborator
Loading…
1 task done
[None][test] Waive 18 failed cases for main in QA CI
#15034
opened Jun 6, 2026 by
tensorrt-cicd
Collaborator
Loading…
fix: Allow explicit device mapping configurations in quantization CLI
#15029
opened Jun 5, 2026 by
Priyanshu31102003
Loading…
[https://nvbugs/6272666][fix] Added a GPU-count-aware post-processing block in pytorch_model_config.py that…
#15028
opened Jun 5, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
fix: Align bitmask batch size with logits tensor dimension inside GuidedDecoder
#15026
opened Jun 5, 2026 by
Priyanshu31102003
Loading…
[None][test] Waive 7 failed cases for main in QA CI
#15024
opened Jun 5, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[#15022][fix] Guided decoding (xgrammar) + EAGLE-3 + draft_len_schedule reaching 0 crashes during CUDA graph capture, "bitmask must have the same batch size as logits"
#15023
opened Jun 5, 2026 by
chungen04
Loading…
1 task done
[https://nvbugs/6261164][fix] In the kvcache insert transform (
_InsertCachedOperator._apply), when…
#15020
opened Jun 5, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][feat] AutoDeploy: Add DeepSeekV4 Support
#15019
opened Jun 5, 2026 by
bmarimuthu-nv
Collaborator
•
Draft
1 task
[#12933][fix] release orphaned KV-cache blocks in storeContextBlocks
#15018
opened Jun 5, 2026 by
fbxai
Loading…
1 task done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.