CARVIEW |
Select Language
HTTP/2 200
date: Fri, 18 Jul 2025 17:36:37 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"1152364f877ca919c0658b692cc5833b"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=Q02n4SLB6BAX3gAr0%2FMjsOZ8SyygOy1DXiKlwbCYN1nOJMrIyQ6r5SSMAkiKX5sDggnXOhX55JNWwjfezSAd76rAHsdIYNIJBwOyE%2FXqzJ%2BRRRHU%2BJg5mhQ7LUVkZOMKNK%2B8cA6DkprR%2BhwHew7mX4I7%2BRyFLrtM1EkRKZ%2FwqUnTsvmi%2FFAYaJt46vaByTKwt4%2FXhPPqlsF2ZrCc9m1u1CiMXfnOX9%2BwsfsMxF4UA0RI%2FDvuyCuKCqCqXLZD0dxj9Rxboe1PGgjvEAdemsC%2FrQ%3D%3D--5uqzGBbsGpc649rB--qeor9szqQgu8soJrQaR%2BNQ%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1077245603.1752860197; Path=/; Domain=github.com; Expires=Sat, 18 Jul 2026 17:36:37 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sat, 18 Jul 2026 17:36:37 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 8234:1101E6:313EC:3CA70:687A8625
Releases · ggml-org/llama.cpp · GitHub
18 Jul 15:05
18 Jul 12:19
Loading
18 Jul 12:16
Loading
18 Jul 09:22
Loading
18 Jul 07:31
Loading
18 Jul 05:56
Loading
18 Jul 04:55
Loading
18 Jul 02:46
Loading
17 Jul 21:32
Loading
17 Jul 19:07
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 12.4k
Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b5935
2adf8d8
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Assets 15
- sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6
2025-07-18T15:05:51Z - sha256:4e9d470f21ca25aad3824cf15ca02d675556ba5a3c3bcc21aac084dd371c9723
2025-07-18T15:06:08Z - sha256:68f2d952861d4384b37fa46f908fcc66aa394cae0ea37c0578d8d6a75ead1561
2025-07-18T15:06:09Z - sha256:c58df9abbee553daa12eb6b017b6f8421362da3f03245b3f79ec9b89e9467dcd
2025-07-18T15:06:11Z - sha256:4fba4db38e697a15d226831539b13eba2ad0da51c52630c25ea607c8d317bd70
2025-07-18T15:06:13Z - sha256:907e4671160e82e6c202387356c3b57aa1fd8f45b16613320c24baae67d27fb6
2025-07-18T15:06:14Z - sha256:96b81dd52c47cd432e8eaeb17ef183a9ce45f53d0d913a829b69e55032462444
2025-07-18T15:06:16Z - sha256:543bfa620f5a4273f7d9078e4dfc0449744a752d56765d58baae6e174a4d7268
2025-07-18T15:06:17Z - sha256:5a5b401ca94ed68a6a0688252c81fc309648ed430bb685e6486b5f5346991932
2025-07-18T15:06:23Z - sha256:8301fbb4e51d348b1c4d645d8e8bd1391b017ccc4e7405dc853bfca8635a1484
2025-07-18T15:06:35Z -
2025-07-18T14:33:41Z -
2025-07-18T14:33:41Z - Loading
b5934
021cc28
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
cuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs (#14741) * Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs Gemma3n uses Matrix-Matrix addition as part of their input processing, wrongly triggering CUDA_GRAPH disablement on NVGPUs even when batch-size of 1 is used. * Exclude `project_per_layer_input` by matching node names This ensures that all other graphs which don't exhibit this pattern do not have their behavior changed. * Revert unnecessary formatting changes
Assets 15
b5933
d498af3
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
graph : avoid huge warm-up graphs for MoE models (#14753) * graph : avoid huge warm-up graphs for MoE models ggml-ci * cont : bump max nodes to 8x model tensors
Assets 15
b5932
eacdeb5
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
model : fix build after merge conflict (#14754)
Assets 15
b5930
f9a31ee
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: set_rows + cpy.cu refactor (#14712)
Assets 15
b5929
8f974bc
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
graph : refactor context to not pass gf explicitly (#14629) ggml-ci
Assets 15
b5928
09651d0
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
graph : Pass the graph placeholder message in debug mode (#14748) Without that condition, this debug log clutters the screen every batch treated in the prompt processing, or every token generated in Kobold.cpp.
Assets 15
b5927
349ea79
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
use max work group size for device to replace the magic number (#14732)
Assets 15
b5924
cb887f1
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
model: add Ernie 4.5 MoE support (#14658) * Add Ernie4.5 MoE * Fix Flake errors. * Properly encode/decode MoE layer step * Correct tensor mappings (.weight) * Pass and read n_ff_exp * n_ff_shexp calculation and further minor changes * Rope fixes. * .gitignore fix * Add unit32 cast for Linux builds * Apply suggestions from code review Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * Further fixes from code review * Fix trailing whitespace * Reenable missing experts error * Code style from code review Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * Fix non-MoE regression Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Assets 15
2 people reacted
b5923
d6fb3f6
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
kv-cache : fix k-shift for multiple streams (#14742) ggml-ci
Assets 15
Previous Next
You can’t perform that action at this time.