ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 14.1k
Star 91.3k

Code
Issues 337
Pull requests 622
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 86 Milestones 0

New pull request New

622 Open 8,084 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

convert : refactor rope scaling handling python

python script changes

#18013 opened Dec 14, 2025 by CISC

Loading…

Async DirectIO model loading on Linux

#18012 opened Dec 13, 2025 by JTischbein

Loading…

webui: fix chat screen shadow width examples server

#18010 opened Dec 13, 2025 by polydecay

Loading…

model-conversion : cast logits to float32 examples python

python script changes

#18009 opened Dec 13, 2025 by ggerganov

Loading…

convert : fix gpt-oss python

python script changes

#18008 opened Dec 13, 2025 by ggerganov

Loading…

models : fix YaRN regression + consolidate logic

#18006 opened Dec 13, 2025 by ggerganov

Loading…

CLI: fixed adding cli and completion into docker containers, improved docs devops

improvements to build systems and github actions

documentation

Improvements or additions to documentation

#18003 opened Dec 13, 2025 by andrew-aladev

Loading…

Clarify that steps also apply to linux documentation

Improvements or additions to documentation

#18002 opened Dec 13, 2025 by alosslessdev

Loading…

server: add /v1/metrics endpoint examples server

#18001 opened Dec 13, 2025 by Kritavya

Loading…

mtmd: add GLM4V multimodal model with conversion support examples model

Model specific

python

python script changes

#17998 opened Dec 13, 2025 by eelbaz

Loading…

arg: clarify auto kvu/np being set on server examples server

#17997 opened Dec 13, 2025 by ngxson

Loading…

Optimization: Qwen3 next autoregressive pass model

Model specific

#17996 opened Dec 13, 2025 by pwilkin

Loading…

CLI: fixed dead links to tools/main for cli and completion, fixed code owners documentation

Improvements or additions to documentation

examples

#17993 opened Dec 13, 2025 by andrew-aladev

Loading…

HIP: Refactor mma for RDNA and CDNA ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17990 opened Dec 13, 2025 by zhang-hui-yulo • Draft

1 task

sync : ggml ggml

changes relating to the ggml tensor library for machine learning

script

Script related

#17988 opened Dec 13, 2025 by ggerganov

Loading…

kv-cache: Fix state restore fragmented cache testing

Everything test related

#17982 opened Dec 13, 2025 by ssweens

Loading…

webui: fix chat header width when sidebar is closed examples server

#17981 opened Dec 13, 2025 by polydecay

Loading…

mtmd: refactor audio preprocessing examples

#17978 opened Dec 12, 2025 by ngxson

Loading…

ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations ggml

changes relating to the ggml tensor library for machine learning

#17977 opened Dec 12, 2025 by ngdxzy

Loading…

webui: Improve copy to clipboard with text attachments examples server

#17969 opened Dec 12, 2025 by allozaur

Loading…

mtmd: (WIP) gemma3n vision support examples python

python script changes

#17961 opened Dec 12, 2025 by ngxson • Draft

server: support global section of presets examples server

#17959 opened Dec 12, 2025 by ngxson

Loading…

server: add encoder-decoder model support (T5, BART, MADLAD) examples server

#17956 opened Dec 12, 2025 by Turee

Loading…

vulkan: Add perf logger mode with concurrency ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17944 opened Dec 11, 2025 by jeffbolznv

Loading…

common : refactor common_sampler + grammar logic changes examples python

python script changes

server

#17937 opened Dec 11, 2025 by ggerganov

Loading…

Previous 1 2 3 4 5 … 24 25 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!