-
Notifications
You must be signed in to change notification settings - Fork 17.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml-cpu: cmake: append xsmtvdotii march for SpacemiT IME
#22317
opened Apr 24, 2026 by
qiurui144
Loading…
gitignore : add .pi + personal SYSTEM.md
devops
improvements to build systems and github actions
#22316
opened Apr 24, 2026 by
ggerganov
Member
Loading…
common : fix jinja warnings with clang 21
jinja parser
Issues related to the jinja parser
merge ready
A maintainer can use this label to indicate that they consider the changes final and ready to merge.
#22313
opened Apr 24, 2026 by
angt
Member
Loading…
hexagon: use DIRID 13 in libggml-htp.inf for modern InfVerif
ggml
changes relating to the ggml tensor library for machine learning
Hexagon
#22306
opened Apr 24, 2026 by
mengshengwu
Contributor
Loading…
3 tasks done
CI Snapdragon: Switch ubuntu-latest to ubuntu-slim runner
devops
improvements to build systems and github actions
#22303
opened Apr 23, 2026 by
shreyajn
Contributor
Loading…
Adreno optimization for MoE - MxFP4
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#22301
opened Apr 23, 2026 by
shawngu-quic
Contributor
Loading…
internal AllReduce kernel for CUDA provider
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22299
opened Apr 23, 2026 by
scutler-nv
•
Draft
CUDA: reduce MMQ stream-k overhead
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22298
opened Apr 23, 2026 by
JohannesGaessler
Contributor
Loading…
CUDA: add POOL_1D
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22297
opened Apr 23, 2026 by
LeoYangXY
Loading…
ggml : skip already registered backends and devices
ggml
changes relating to the ggml tensor library for machine learning
#22296
opened Apr 23, 2026 by
angt
Member
Loading…
ggml-cpu : disable tiled matmul on AIX to fix page boundary segfault
ggml
changes relating to the ggml tensor library for machine learning
#22293
opened Apr 23, 2026 by
shalinib-ibm
Contributor
Loading…
[SYCL] Optimize Q4_0 mul_mat for Arc770, add scripts
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22291
opened Apr 23, 2026 by
arthw
Contributor
Loading…
common : only load backends when required
ggml
changes relating to the ggml tensor library for machine learning
#22290
opened Apr 23, 2026 by
angt
Member
Loading…
ggml-cuda: add flash-attn support for DKQ=320/DV=256 with ncols2=32 (…
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#22286
opened Apr 23, 2026 by
lnigam
Loading…
Update CMakeLists.txt
ggml
changes relating to the ggml tensor library for machine learning
need more info
The OP should provide more details about the issue
#22282
opened Apr 23, 2026 by
fsavanur
Loading…
llama : integer type consistency in llama.h #4574
#22277
opened Apr 23, 2026 by
Vladimirhoa
Loading…
common: Changed to leak logger singleton to prevent hanging on Windows
#22273
opened Apr 23, 2026 by
rillomas
Contributor
Loading…
opencl: add iq4_nl support
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
webui: add option for LLM title generation
examples
server/webui
server
#22265
opened Apr 22, 2026 by
smugman-dot
Loading…
common: fix macOS cache path segfault when HOME is unset
#22263
opened Apr 22, 2026 by
Geramy
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.