CARVIEW |
Select Language
HTTP/2 200
date: Sun, 12 Oct 2025 23:23:10 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"b59b101be1c2c6924292f98a85599a77"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=x%2FeGH8rp51Po9r8J1B5IIX907x%2Flh42Z5Qd2D2tjGlE7Y43j35tIBwsTcv93cLho%2BuB6f2AxVDLxpoRuN2NmwAV50ie5ps2fo5cY%2FzmLnsK%2FoCk2pDjCQ%2B4P5EInChaN3n1AoLrxoTanU0mt824scedT6G2CLo1yJEa91XXN1pWC7hqxOttj1QDt%2Ffk30UHI7qfyWdlkPCfiC23I2UHj3hAWUBPOqkLaVgjT6a4dLQtmizElwiXXa3IXpXcOt14xVhqR4QNYm3tfSMTE4J30mA%3D%3D--n98sWMBtPpM2vU%2Fe--TqslJ1A9OkEi7lRm7cwFig%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1012964100.1760311390; Path=/; Domain=github.com; Expires=Mon, 12 Oct 2026 23:23:10 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Mon, 12 Oct 2026 23:23:10 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: C836:3B926C:133F0F2:199BC59:68EC385E
Releases · ggml-org/llama.cpp · GitHub
12 Oct 19:12
12 Oct 14:21
Loading
12 Oct 13:37
Loading
12 Oct 06:36
Loading
12 Oct 06:19
Loading
12 Oct 06:16
Loading
11 Oct 19:50
Loading
11 Oct 15:48
Loading
11 Oct 15:14
Loading
11 Oct 11:19
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b6745
a31cf36
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
metal : add opt_step_adamw and op_sum (#16529) * scaffold to support opt step adamw on metal (not written so far) * add opt-step-adamw kernel for metal * pass op->src[4] as a separate buffer to the pipeline * add bounds check to opt-step-adamw kernel * complete scaffold for GGML_OP_SUM * naive GGML_OP_SUM kernel * remove unwanted comment * change OP_SUM capability gate * Add has_simdgroup_reduction to both ops to pass CI
Assets 15
- sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6373 MB
2025-10-12T19:12:51Z - sha256:a7da92223a5cf2e13cdbfc73665a472d121a5673a31697fc10fd21f63d196c8710.4 MB
2025-10-12T19:13:06Z - sha256:4bcca84a97c17ae1667c10faf08690f24f2d98e05f4b67c240b8064a2eb1f94527 MB
2025-10-12T19:13:07Z - sha256:8129a2c9090f4b2216fbd929df2d6786dd1d1258db3f8067441d774a9d8b87bb25.6 MB
2025-10-12T19:13:09Z - sha256:fd59357fffee32b02e003eabd4f3cb019da557969c5c2d79673491c9c239782512.5 MB
2025-10-12T19:13:11Z - sha256:2baa8309c5dfdceb23ca686e3db936c8904168e858b7b5b8745582a096ac5dda10.6 MB
2025-10-12T19:13:13Z - sha256:cc66900d0aa2305f50aa16fae1b3db7ec6fb178914baf82e4fdc86fea523254913.6 MB
2025-10-12T19:13:14Z - sha256:b1e334a89706ca2027a651d1b182d2dd0504f6c4973fcfe02da26e780f3549a8161 MB
2025-10-12T19:13:16Z - sha256:7b55d016a98698877d773b27b6f9a05df5a5d6e0ca3fe8c3459853c40be93cec321 MB
2025-10-12T19:13:24Z - sha256:8c92dc5a5f3db66594adaa02c746f601c9f93f2dd16ac35b246030472b16d5d611 MB
2025-10-12T19:13:37Z -
2025-10-12T18:43:14Z -
2025-10-12T18:43:14Z - Loading
b6743
c7be9fe
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
[SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) * fix/refactor OP argsort, pad * fix count-equal op * update SYCL OP list * fix format issue --------- Co-authored-by: Zhang Jianyu <zhang.jianyu@outlook.com>
Assets 15
3 people reacted
b6741
2c301e9
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
common : handle unicode during partial json parsing (#16526) * common : handle unicode during partial json parsing * common : set missing `ensure_ascii = true` during json dump
Assets 15
b6739
41aac5c
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
ggml : Fix FP16 ELU positive branch (#16519) Co-authored-by: Aaron <shelhamer.aaron@gmail.com>
Assets 15
b6738
a2fba89
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
hparams : add check for layer index in is_recurrent (#16511) * hparams : add check for layer index in is_recurrent This commit adds a check in the is_recurrent method to ensure that the provided layer index is within the valid range. The motivation for this change is to prevent potential out-of-bounds and also be consistent with other methods in the class that perform similar checks, like is_swa.
Assets 15
b6737
20cc625
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
ggml: Correct SVE implementation in ggml_vec_dot_f16_unroll (#16518) The previous SVE implementation for `ggml_vec_dot_f16_unroll` contained a bug due to a copy-paste error. The wrong variable was used in an FMA instruction, leading to incorrect results. This commit corrects the variable usage and improves the clarity of the code by renaming variables to avoid confusion. Co-authored-by: Aaron <shelhamer.aaron@gmail.com>
Assets 15
b6736
11f0af5
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
CUDA: faster tile FA, add oob checks, more HSs (#16492)
Assets 15
4 people reacted
b6735
a3cb047
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
metal : fix mul-mm condition + fix mul-mv permuted kernels (#16494)
Assets 15
b6733
31d0ff1
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
server / ranking : add sorting and management of top_n (#16403) * server / ranking : add sorting and management of top_n * Make the retro compatible if no top_n will return all results here is a script to make some test ```script URL=${1:-https://127.0.0.1:8181} curl "$URL/v1/rerank" -H "Content-Type: application/json" \ -d '{ "model": "M", "query": "What is the recipe to make bread ?", "return_text" : true, "texts" : true, "top_n": 6, "documents": [ "voici la recette pour faire du pain, il faut de la farine de l eau et du levain et du sel", "it is a bear", "bread recipe : floor, water, yest, salt", "The giant panda (Ailuropoda melanoleuca), sometimes called a panda bear or simply panda, is a bear species endemic to China.", "here is the ingedients to bake bread : 500g floor, 350g water, 120g fresh refresh yest, 15g salt", "recipe to make cookies : floor, eggs, water, chocolat", "here is the recipe to make bread : 500g floor, 350g water, 120g fresh refresh yest, 15g salt", "il fait tres beau aujourd hui", "je n ai pas faim, je ne veux pas manger", "je suis a paris" ] }' | jq ``` * use resize() instead for(...) * simplify top_n init since no need to return error result to test : ./tests.sh unit/test_rerank.py -v -x ==================================================== test session starts ===================================================== platform linux -- Python 3.12.3, pytest-8.3.5, pluggy-1.6.0 -- /home/yann/dev/yann/llama.cpp/tools/server/tests/test/bin/python3 cachedir: .pytest_cache rootdir: /home/yann/dev/yann/llama.cpp/tools/server/tests configfile: pytest.ini plugins: anyio-4.11.0 collected 8 items unit/test_rerank.py::test_rerank PASSED [ 12%] unit/test_rerank.py::test_rerank_tei_format PASSED [ 25%] unit/test_rerank.py::test_invalid_rerank_req[documents0] PASSED [ 37%] unit/test_rerank.py::test_invalid_rerank_req[None] PASSED [ 50%] unit/test_rerank.py::test_invalid_rerank_req[123] PASSED [ 62%] unit/test_rerank.py::test_invalid_rerank_req[documents3] PASSED [ 75%] unit/test_rerank.py::test_rerank_usage[Machine learning is-A machine-Learning is-19] PASSED [ 87%] unit/test_rerank.py::test_rerank_usage[Which city?-Machine learning is -Paris, capitale de la-26] PASSED [100%] ===================================================== 8 passed in 4.31s ====================================================== * add rerank top_n unit test here is the result : ./tests.sh unit/test_rerank.py -v -x =================================================================== test session starts =================================================================== platform linux -- Python 3.12.3, pytest-8.3.5, pluggy-1.6.0 -- /home/yann/dev/yann/llama.cpp/tools/server/tests/test/bin/python3 cachedir: .pytest_cache rootdir: /home/yann/dev/yann/llama.cpp/tools/server/tests configfile: pytest.ini plugins: anyio-4.11.0 collected 16 items unit/test_rerank.py::test_rerank PASSED [ 6%] unit/test_rerank.py::test_rerank_tei_format PASSED [ 12%] unit/test_rerank.py::test_invalid_rerank_req[documents0] PASSED [ 18%] unit/test_rerank.py::test_invalid_rerank_req[None] PASSED [ 25%] unit/test_rerank.py::test_invalid_rerank_req[123] PASSED [ 31%] unit/test_rerank.py::test_invalid_rerank_req[documents3] PASSED [ 37%] unit/test_rerank.py::test_rerank_usage[Machine learning is-A machine-Learning is-19] PASSED [ 43%] unit/test_rerank.py::test_rerank_usage[Which city?-Machine learning is -Paris, capitale de la-26] PASSED [ 50%] unit/test_rerank.py::test_rerank_top_n[None-4] PASSED [ 56%] unit/test_rerank.py::test_rerank_top_n[2-2] PASSED [ 62%] unit/test_rerank.py::test_rerank_top_n[4-4] PASSED [ 68%] unit/test_rerank.py::test_rerank_top_n[99-4] PASSED [ 75%] unit/test_rerank.py::test_rerank_tei_top_n[None-4] PASSED [ 81%] unit/test_rerank.py::test_rerank_tei_top_n[2-2] PASSED [ 87%] unit/test_rerank.py::test_rerank_tei_top_n[4-4] PASSED [ 93%] unit/test_rerank.py::test_rerank_tei_top_n[99-4] PASSED [100%] =================================================================== 16 passed in 8.84s =================================================================== * editor config check fix
Assets 15
b6732
97870e6
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
cuda : avoid initializing unused devices (#16510)
Assets 15
Previous Next
You can’t perform that action at this time.