NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2.4k
Star 13.8k

Code
Issues 598
Pull requests 761
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 65 Milestones 1

New pull request New

761 Open 10,273 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[None][test] Waive 9 failed cases for main in QA CI

#15051 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 5 failed cases for main in QA CI

#15050 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 3 failed cases for main in QA CI

#15049 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[None][test] Waive 3 failed cases for main in QA CI

#15048 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[None][fix] Fix regression from SageAttention kernel: Use static scheduler

#15047 opened Jun 6, 2026 by xrq-phys Collaborator

Loading…

1 task done

[None][feat] Add KV cache block reuse policy

#15046 opened Jun 6, 2026 by jiaganc Collaborator

Loading…

1 task done

[None][test] Waive 5 failed cases for main in QA CI

#15045 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[None][fix] Enforce Responses conversation history capacity

#15043 opened Jun 6, 2026 by fallintoplace

Loading…

1 task done

[None][perf] disagg: skip per-chunk JSON parse on streaming usage rewrite

#15042 opened Jun 6, 2026 by lancelly Collaborator • Draft

[None][perf] disagg: single-pass request serialization on orchestrator forward path

#15041 opened Jun 6, 2026 by lancelly Collaborator • Draft

[None][fix] Absolute-block KV-cache-aware routing cost

#15040 opened Jun 6, 2026 by Shixiaowei02 Collaborator • Draft

1 task

[None][test] Waive 1 failed cases for main in QA CI

#15039 opened Jun 6, 2026 by tensorrt-cicd Collaborator • Draft

[TRTLLM-13262][ci] Move non-default-feature tests to post merge

#15038 opened Jun 6, 2026 by QiJune Collaborator

Loading…

1 task done

[None][test] Waive 5 failed cases for main in QA CI

#15037 opened Jun 6, 2026 by tensorrt-cicd Collaborator

Loading…

[None][test] Waive 7 failed cases for main in QA CI

#15036 opened Jun 6, 2026 by tensorrt-cicd Collaborator

Loading…

[TRTLLM-13259][ci] Merge DGX_H100 DeepSeek and GptOss stages

#15035 opened Jun 6, 2026 by QiJune Collaborator

Loading…

1 task done

[None][test] Waive 18 failed cases for main in QA CI

#15034 opened Jun 6, 2026 by tensorrt-cicd Collaborator

Loading…

fix: Allow explicit device mapping configurations in quantization CLI

#15029 opened Jun 5, 2026 by Priyanshu31102003

Loading…

[https://nvbugs/6272666][fix] Added a GPU-count-aware post-processing block in pytorch_model_config.py that…

#15028 opened Jun 5, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

fix: Align bitmask batch size with logits tensor dimension inside GuidedDecoder

#15026 opened Jun 5, 2026 by Priyanshu31102003

Loading…

[None][test] Waive 7 failed cases for main in QA CI

#15024 opened Jun 5, 2026 by tensorrt-cicd Collaborator • Draft

[#15022][fix] Guided decoding (xgrammar) + EAGLE-3 + draft_len_schedule reaching 0 crashes during CUDA graph capture, "bitmask must have the same batch size as logits"

#15023 opened Jun 5, 2026 by chungen04

Loading…

1 task done

[https://nvbugs/6261164][fix] In the kvcache insert transform (_InsertCachedOperator._apply), when…

#15020 opened Jun 5, 2026 by tensorrt-cicd Collaborator

Loading…

2 tasks done

[None][feat] AutoDeploy: Add DeepSeekV4 Support

#15019 opened Jun 5, 2026 by bmarimuthu-nv Collaborator • Draft

1 task

[#12933][fix] release orphaned KV-cache blocks in storeContextBlocks

#15018 opened Jun 5, 2026 by fbxai

Loading…

1 task done

Previous 1 2 3 4 5 … 30 31 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!