-
-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP][CI] Speed up sequence parallel tests
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#30568
opened Dec 12, 2025 by
LucasWilkinson
Loading…
[Bug] Fix AttributeError: 'Qwen3VLMoeConfig' object has no attribute 'intermediate_size'
qwen
Related to Qwen models
ready
ONLY add when PR is ready to merge/full CI is needed
#30567
opened Dec 12, 2025 by
yewentao256
Loading…
update to transformers v5
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#30566
opened Dec 12, 2025 by
hmellor
Loading…
[Bug] add sm100f check for flashinfer attention and moe
nvidia
v1
#30565
opened Dec 12, 2025 by
IwakuraRein
•
Draft
5 tasks
[Docs] Remove references to Improvements or additions to documentation
VLLM_ATTENTION_BACKEND
documentation
#30564
opened Dec 12, 2025 by
MatthewBonanni
Loading…
2 of 5 tasks
[Attention] Update tests to remove Related to multi-modality (#4194)
nvidia
rocm
Related to AMD ROCm
speculative-decoding
v1
VLLM_ATTENTION_BACKEND
ci/build
kv-connector
multi-modality
#30563
opened Dec 12, 2025 by
MatthewBonanni
Loading…
2 of 5 tasks
[Refactor] Small refactor for group topk
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#30562
opened Dec 12, 2025 by
yewentao256
Loading…
feat(serve): add warmup support for consistent first-request performance
documentation
Improvements or additions to documentation
frontend
#30561
opened Dec 12, 2025 by
TheCodeWrangler
Loading…
5 tasks done
[Feat] Enable eplb with default all2all backend
ready
ONLY add when PR is ready to merge/full CI is needed
#30559
opened Dec 12, 2025 by
yewentao256
Loading…
[Core] Support multi prompt for AsyncLLM.generate() and encode()
documentation
Improvements or additions to documentation
v1
#30558
opened Dec 12, 2025 by
buaazp
Loading…
3 of 5 tasks
[GPT OSS] Fix tool_choice required
frontend
gpt-oss
Related to GPT-OSS models
#30557
opened Dec 12, 2025 by
southfreebird
•
Draft
feat: batched shared encoder for whisper beam search
ci/build
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
gpt-oss
Related to GPT-OSS models
kv-connector
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
nvidia
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
speculative-decoding
structured-output
tool-calling
v1
#30556
opened Dec 12, 2025 by
TheCodeWrangler
•
Draft
3 of 5 tasks
[Bugfix][Frontend] Prevent IndexError in MiniMax M2 tool parser during streaming extraction
frontend
tool-calling
#30555
opened Dec 12, 2025 by
WangErXiao
Loading…
[Bug][CPU Backend]: Improve L2 cache size detection and usage on aarch64
#30553
opened Dec 12, 2025 by
Radu2k
Loading…
typing: Add type hints to TurnMetrics class in context.py
frontend
gpt-oss
Related to GPT-OSS models
#30552
opened Dec 12, 2025 by
yurekami
Loading…
3 tasks done
docs: Clarify block_quant_to_tensor_quant docstring (fixes #30098)
#30551
opened Dec 12, 2025 by
yurekami
Loading…
2 tasks done
[Frontend] Support passing custom score template as a CLI argument to vllm serve
documentation
Improvements or additions to documentation
frontend
#30550
opened Dec 12, 2025 by
jzakrzew
Loading…
[CustomOp] Support object-level enable for CustomOp
#30547
opened Dec 12, 2025 by
shen-shanshan
Loading…
1 of 5 tasks
[KVEvent] User request.block_hash for parent block_hash
v1
#30544
opened Dec 12, 2025 by
heheda12345
Loading…
5 tasks
[Doc]: fixing typos in various files
documentation
Improvements or additions to documentation
frontend
needs-rebase
nvidia
ready
ONLY add when PR is ready to merge/full CI is needed
#30540
opened Dec 12, 2025 by
didier-durand
Loading…
2 tasks done
Add AudioFlamingo3 model support
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
new-model
Requests to new models
#30539
opened Dec 12, 2025 by
lashahub
Loading…
3 of 5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.