CARVIEW |
Select Language
HTTP/2 301
date: Sat, 19 Jul 2025 17:29:57 GMT
content-type: text/html; charset=utf-8
content-length: 0
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
location: https://github.com/ggml-org/llama.cpp/releases
cache-control: no-cache
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: origin-when-cross-origin, strict-origin-when-cross-origin
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
set-cookie: _gh_sess=uTf0SK%2B8Ju7Wz128MTYZCItOlr3JvPh1D4hd5DnoIHA0OsvC%2Bq5x2glbE8J1HBEujpwrGaa7q5LhtWCPJ5vADmnmXL%2BUPVpMZerNObMCqM3d2rpXSX1rptR2iePno%2FqWlKZnU7d1r4tyOdEFSiZCP%2FhYsUS4%2FP7be35vj5VHcQ2lrSAVOpPDlVP1zx4t95ZyBHbfR%2F5N3wy2uUQISWwA3q9vHI83jrnE5wV8HICoVI317%2BYIFoyCVq4mjlLMCwp29fi0XQbZQH375JS%2FrF8PUQ%3D%3D--la9y7WSlwO6P%2FCVV--bpp%2BQJQLV1P%2BUMCWqd8nTA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.744553451.1752946197; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 17:29:57 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 17:29:57 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 9D24:A43E1:9B583A:C90E3C:687BD615
HTTP/2 200
date: Sat, 19 Jul 2025 17:29:58 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"6ed4bb548436bf11c8ec4883992e4638"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
x-github-request-id: 9D24:A43E1:9B586C:C90E75:687BD615
Releases · ggml-org/llama.cpp · GitHub
19 Jul 17:16
19 Jul 16:39
Loading
19 Jul 16:24
Loading
18 Jul 18:10
Loading
18 Jul 17:54
Loading
18 Jul 15:05
Loading
18 Jul 12:19
Loading
18 Jul 12:16
Loading
18 Jul 09:22
Loading
18 Jul 07:31
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 12.4k
Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b5942
9008328
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
imatrix : use GGUF to store importance matrices (#9400) * imatrix : allow processing multiple chunks per batch * perplexity : simplify filling the batch * imatrix : fix segfault when using a single chunk per batch * imatrix : use GGUF to store imatrix data * imatrix : fix conversion problems * imatrix : use FMA and sort tensor names * py : add requirements for legacy imatrix convert script * perplexity : revert changes * py : include imatrix converter requirements in toplevel requirements * imatrix : avoid using designated initializers in C++ * imatrix : remove unused n_entries * imatrix : allow loading mis-ordered tensors Sums and counts tensors no longer need to be consecutive. * imatrix : more sanity checks when loading multiple imatrix files * imatrix : use ggml_format_name instead of std::string concatenation Co-authored-by: Xuan Son Nguyen <son@huggingface.co> * quantize : use unused imatrix chunk_size with LLAMA_TRACE * common : use GGUF for imatrix output by default * imatrix : two-way conversion between old format and GGUF * convert : remove imatrix to gguf python script * imatrix : use the function name in more error messages * imatrix : don't use FMA explicitly This should make comparisons between the formats easier because this matches the behavior of the previous version. * imatrix : avoid returning from void function save_imatrix * imatrix : support 3d tensors with MUL_MAT * quantize : fix dataset name loading from gguf imatrix * common : move string_remove_suffix from quantize and imatrix Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * imatrix : add warning when legacy format is written * imatrix : warn when writing partial data, to help guess dataset coverage Also make the legacy format store partial data by using neutral values for missing data. This matches what is done at read-time for the new format, and so should get the same quality in case the old format is still used. * imatrix : avoid loading model to convert or combine imatrix * imatrix : avoid using imatrix.dat in README --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Assets 15
- sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6
2025-07-19T17:16:54Z - sha256:ef5bbff672620df3bc686cb6659c03f7f797b84bfcc141709e1d28d4b6a2345e
2025-07-19T17:17:04Z - sha256:604cadc63e79972c15c3cc6c26c6fd50986066540f4b4f8cb23b93f97db9eb11
2025-07-19T17:17:05Z - sha256:378bb341b2a636c90986323f57e5301d011f5858cc0ef84b5406e700858ed988
2025-07-19T17:17:06Z - sha256:57e2a135dcd7ee753bb1b043d0d594cc6688d0c8b8cb3b8a31488368460d0d4e
2025-07-19T17:17:07Z - sha256:ea14f5dc88bd84461e198f1344e4249dbd1cb05aef37051328c19c02b37c2ed9
2025-07-19T17:17:08Z - sha256:ecd795e21fe02f53c3c78da5df528f2bcd9affd1e3f1cbda4b298653dac1e95e
2025-07-19T17:17:09Z - sha256:853614e41c9773036f7259094e0892528239a1a954600d9e6547ea4ab9a4142e
2025-07-19T17:17:10Z - sha256:5b2b5dec8788a3f0d9cab9a297865cb562492d106db865849c642cadfbf9b690
2025-07-19T17:17:14Z - sha256:a919e96a694f805ec651c8a98aebfce07d1fb8f4e1434a30bb948aaa13666e46
2025-07-19T17:17:21Z -
2025-07-19T16:51:22Z -
2025-07-19T16:51:22Z - Loading
b5941
d4b91ea
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#132…
Assets 15
b5940
83f5872
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Vulkan: Fix fprintf format-security warning (#14770)
Assets 15
b5937
bf9087f
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
metal : fuse add, mul + add tests (#14596) ggml-ci
Assets 15
2 people reacted
b5936
9fb1042
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
graph : fix graph reuse reset of params (#14760) ggml-ci
Assets 15
b5935
2adf8d8
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
parallel : add option for different RNG seeds (#14757) ggml-ci
Assets 15
b5934
021cc28
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
cuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs (#14741) * Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs Gemma3n uses Matrix-Matrix addition as part of their input processing, wrongly triggering CUDA_GRAPH disablement on NVGPUs even when batch-size of 1 is used. * Exclude `project_per_layer_input` by matching node names This ensures that all other graphs which don't exhibit this pattern do not have their behavior changed. * Revert unnecessary formatting changes
Assets 15
b5933
d498af3
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
graph : avoid huge warm-up graphs for MoE models (#14753) * graph : avoid huge warm-up graphs for MoE models ggml-ci * cont : bump max nodes to 8x model tensors
Assets 15
b5932
eacdeb5
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
model : fix build after merge conflict (#14754)
Assets 15
b5930
f9a31ee
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: set_rows + cpy.cu refactor (#14712)
Assets 15
Previous Next
You can’t perform that action at this time.