CARVIEW |
Select Language
HTTP/2 301
date: Tue, 14 Oct 2025 21:03:57 GMT
content-type: text/html; charset=utf-8
content-length: 0
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
location: https://github.com/ggml-org/llama.cpp/releases
cache-control: no-cache
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: origin-when-cross-origin, strict-origin-when-cross-origin
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
set-cookie: _gh_sess=ZeqMmduUQCfDUiSpG3PxR0HLAdB5rnmRPtZbT3g7ItVDKkmhOe%2FARNTKAakvMqi0nEwwfbWzpFjs2SJ67bbX6U9n3ncZfkubQi6u%2F%2Bl7%2Fc0kY1l2XfsbaRKTkCa2bb4u0auhJk41XvBn55r%2BUPI9ie2QxfuW4ZJhZ3DZjlisSh70NpBHCbMY2ti12p505w16kr1UomuNb2uw5KVLxK4vyWFYc1oSp3paOzxw2rnOyOu1G8Pvs9%2FEfPNxIljPHDU7h7StcfAD7TYB%2FSbAn7ZdNw%3D%3D--9ND23wQtRIZcig7c--bN2RGNtaf35yDIHJlp6M2Q%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.432107850.1760475836; Path=/; Domain=github.com; Expires=Wed, 14 Oct 2026 21:03:56 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Wed, 14 Oct 2026 21:03:56 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: EC4A:7E5E8:10709D9:1362B7A:68EEBABC
HTTP/2 200
date: Tue, 14 Oct 2025 21:03:57 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"771ecfd6485ab4394ae4acbeb8543cdb"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
x-github-request-id: EC4A:7E5E8:10709EC:1362B9D:68EEBABC
Releases · ggml-org/llama.cpp · GitHub
14 Oct 18:26
14 Oct 17:48
Loading
14 Oct 15:19
Loading
14 Oct 14:31
Loading
14 Oct 13:32
Loading
14 Oct 12:47
Loading
14 Oct 11:48
Loading
14 Oct 11:47
Loading
14 Oct 10:54
Loading
14 Oct 06:08
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b6765
fa882fd
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
metal : avoid using Metal's gpuAddress property (#16576) * metal : avoid using Metal's gpuAddress property * metal : fix rope kernels buffer check
Assets 15
- sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6373 MB
2025-10-14T18:26:38Z - sha256:fb9a212fc79ca8858e75f1f46b7c391e7dc73117a7c25dcb076ad0285ac20ce410.4 MB
2025-10-14T18:26:48Z - sha256:2f1465fbeb121f5f6dc7ae4b1110252987f6d94ccb7df17b09478e8083cbdbc527 MB
2025-10-14T18:26:49Z - sha256:1f4d352a858c6361e1a00ba7327054066bf140e6a20bd546d6a5f631838a099f25.8 MB
2025-10-14T18:26:51Z - sha256:70393145b5dc87b2a508e96f6916d133204ad83d678ac7bf0866136696a77a0f12.5 MB
2025-10-14T18:26:52Z - sha256:9d123d28505356095a29408f9df59b72a93d22710c782ea08916b446462e428610.6 MB
2025-10-14T18:26:53Z - sha256:4fea6d25bcf649a67e84483db1afa21a857aedbb391d0d3ca68f466d60645ff213.6 MB
2025-10-14T18:26:54Z - sha256:cf888c303769fa5b7706af78d94527200f1ab563a65c11268b105812a789ae0b169 MB
2025-10-14T18:26:55Z - sha256:4205e0d335293c9dfcaf22efbaaa5eb8ea9e6bef61c974140fee88d0c69ddecf321 MB
2025-10-14T18:27:02Z - sha256:b53f28e644728cfca58e3533986d56736d7bc6721eb90c49fa1566c78bc772de11 MB
2025-10-14T18:27:12Z -
2025-10-14T17:33:05Z -
2025-10-14T17:33:05Z - Loading
1 person reacted
b6764
ffa0590
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
vulkan: Add ACC_TYPE_VEC2 implementation (#16203) Signed-off-by: Stefan Savic <stefan.savic@huawei.com> Co-authored-by: Stefan Savic <stefan.savic@huawei.com>
Assets 15
b6763
120bf70
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion …
Assets 15
b6762
4258e0c
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
vulkan: Support FA with K/V in F32 (#16543)
Assets 15
2 people reacted
b6761
7ea15bb
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
vulkan: Improve build time for MSVC (#16545) Enable CMP0147 so custom build steps (invoking vulkan-shader-gen) are run in parallel. Enable /MP so source files are compiled in parallel.
Assets 15
2 people reacted
b6760
9c7185d
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: enable FA for FP32 KV cache (#16546)
Assets 15
b6759
1ee9d0b
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557) * CUDA: use fastdiv + ggml_cuda_mad for mmvf * use bf16 directly + fix formatting * Add exception for HIP code
Assets 15
b6758
48e2fa9
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: add fp kernel for larger batch size MoE (#16512) * CUDA: kernel for larger batch sizes for MoE * WIP * WIP * WIP * WIP * WIP * WIP * fixup * tests * Move mmq_ids_helper to mmid * cleanup * Remove redundant checks
Assets 15
b6757
5b6913c
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
cuda : remove legacy copy-op pointer indirection code (#16485) * remove legacy copy-op pointer indirection code * further removal of copy-op indirection code * renamed check_node_graph_compatibility_and_refresh_copy_ops function
Assets 15
b6756
bc07349
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
server : dynamic token limit for prompt cache (#16560) * server : dynamic token limit for prompt cache * cont : print estimated token limit
Assets 15
2 people reacted
Previous Next
You can’t perform that action at this time.