Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP][CI] Speed up sequence parallel tests ci/build ready ONLY add when PR is ready to merge/full CI is needed
#30568 opened Dec 12, 2025 by LucasWilkinson Loading…
[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size' qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30567 opened Dec 12, 2025 by yewentao256 Loading…
update to transformers v5 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#30566 opened Dec 12, 2025 by hmellor Loading…
[Docs] Remove references to VLLM_ATTENTION_BACKEND documentation Improvements or additions to documentation
#30564 opened Dec 12, 2025 by MatthewBonanni Loading…
2 of 5 tasks
[Attention] Update tests to remove VLLM_ATTENTION_BACKEND ci/build kv-connector multi-modality Related to multi-modality (#4194) nvidia rocm Related to AMD ROCm speculative-decoding v1
#30563 opened Dec 12, 2025 by MatthewBonanni Loading…
2 of 5 tasks
[Refactor] Small refactor for group topk ready ONLY add when PR is ready to merge/full CI is needed v1
#30562 opened Dec 12, 2025 by yewentao256 Loading…
feat(serve): add warmup support for consistent first-request performance documentation Improvements or additions to documentation frontend
#30561 opened Dec 12, 2025 by TheCodeWrangler Loading…
5 tasks done
[Feat] Enable eplb with default all2all backend ready ONLY add when PR is ready to merge/full CI is needed
#30559 opened Dec 12, 2025 by yewentao256 Loading…
[Core] Support multi prompt for AsyncLLM.generate() and encode() documentation Improvements or additions to documentation v1
#30558 opened Dec 12, 2025 by buaazp Loading…
3 of 5 tasks
[GPT OSS] Fix tool_choice required frontend gpt-oss Related to GPT-OSS models
#30557 opened Dec 12, 2025 by southfreebird Draft
feat: batched shared encoder for whisper beam search ci/build deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend gpt-oss Related to GPT-OSS models kv-connector llama Related to Llama models multi-modality Related to multi-modality (#4194) nvidia performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output tool-calling v1
#30556 opened Dec 12, 2025 by TheCodeWrangler Draft
3 of 5 tasks
typing: Add type hints to TurnMetrics class in context.py frontend gpt-oss Related to GPT-OSS models
#30552 opened Dec 12, 2025 by yurekami Loading…
3 tasks done
docs: Clarify block_quant_to_tensor_quant docstring (fixes #30098)
#30551 opened Dec 12, 2025 by yurekami Loading…
2 tasks done
[Frontend] Support passing custom score template as a CLI argument to vllm serve documentation Improvements or additions to documentation frontend
#30550 opened Dec 12, 2025 by jzakrzew Loading…
[CustomOp] Support object-level enable for CustomOp
#30547 opened Dec 12, 2025 by shen-shanshan Loading…
1 of 5 tasks
[KVEvent] User request.block_hash for parent block_hash v1
#30544 opened Dec 12, 2025 by heheda12345 Loading…
5 tasks
[Bugfix] Revert Qwen2-VL part of change in #28271 qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#30542 opened Dec 12, 2025 by zifeitong Loading…
[Doc]: fixing typos in various files documentation Improvements or additions to documentation frontend needs-rebase nvidia ready ONLY add when PR is ready to merge/full CI is needed
#30540 opened Dec 12, 2025 by didier-durand Loading…
2 tasks done
Add AudioFlamingo3 model support documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models
#30539 opened Dec 12, 2025 by lashahub Loading…
3 of 5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.