| CARVIEW |
Select Language
HTTP/2 200
date: Mon, 29 Dec 2025 17:56:25 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"4966b1f10f461a87e21fc7908ccc5cad"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com github.githubassets.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=r%2BovaAcFf2ca3WUg2v5zVzPttqCDVrklrcxYZp5TF1qX76RSUWfDSOl33PnNtamnzZSSwFmLemkh0QcJTOoOe%2BO54HPqrHG7CDAr4OxaoFdCI3mq%2Bfdg67%2FayOiFRuVlde%2FLUP7NSDOUoB8h7%2FihmEDdeglkIEepWvnjy89x21P%2BwCwTcU091%2BOCc%2FoCy6WxFOwqqQiTbSvf9L09wtnFyUyPLKMNh093tt38aZ3KTyYC25yZ7IpKRDT22T73jWNEG73bOR8OeFdt1QkFsjM52A%3D%3D--JyL3TgsSCMvd4ALC--bM1IGSJ8E9iX347Of5C1zg%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.450968696.1767030984; Path=/; Domain=github.com; Expires=Tue, 29 Dec 2026 17:56:24 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Tue, 29 Dec 2026 17:56:24 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: D95A:1533AA:655FE2B:795457C:6952C0C8
Releases · microsoft/VPTQ · GitHub
06 Mar 13:01
Loading
18 Jan 12:20
Loading
18 Nov 14:32
Loading
01 Nov 03:20
Loading
08 Oct 10:03
Loading
05 Oct 09:54
Loading
26 Sep 05:24
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 49
Releases: microsoft/VPTQ
Releases · microsoft/VPTQ
v0.0.5post1
584bb75
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @YangWang92 in #167
- refactor (csrc): Restructure C++ code organization to facilitate adding new kernels by @lcy-seso in #169
- [pack save] fix save cpu offloda model by @wejoncy in #177
- fix bug on logger res_index by @YangWang92 in #178
- Update README.md by @YangWang92 in #179
- add index bitwidth=10 by @YangWang92 in #181
Full Changelog: v0.0.5...v0.0.5post1
Assets 7
v0.0.5
692a41c
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @YangWang92 in #127
- Update README.md by @wejoncy in #129
- Update vptq_example.ipynb by @YangWang92 in #138
- Update README.md by @OpenSourceRonin in #141
- Update README.md by @OpenSourceRonin in #142
- update huggingface transformers support by @YangWang92 in #143
- Update README.md by @YangWang92 in #144
- refactor: refactor and optimize python code implementations. by @lcy-seso in #145
- Update README.md by @YangWang92 in #146
- fix: a small bug fix for the initialization of the residual index tensor. by @lcy-seso in #147
- fix(build): build using cmake. by @lcy-seso in #149
- update setuptools version by @YangWang92 in #151
- fix: Fix the bug where the Torch library is not correctly linked. by @lcy-seso in #152
- update version by @YangWang92 in #153
- fix loading by @wejoncy in #154
- fix perm by @wejoncy in #157
- add tools by @wejoncy in #158
- quick fix by @wejoncy in #160
- Update README.md by @wejoncy in #161
- fix(build): fix the undefined symbols runtime error. by @lcy-seso in #162
- fix(cmake): building dynamic library for specified GPU architectures and support multi threads compile by @lcy-seso in #164
- bump to 0.0.5 by @wejoncy in #165
- fix(csrc): Remove strong dependency on specific Torch version. by @lcy-seso in #166
New Contributors
Full Changelog: v0.0.4...v0.0.5
Assets 7
v0.0.4
139a380
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @YangWang92 in #108
- Update README.md by @YangWang92 in #109
- Add CUDA_HOME instructions to README by @caronzh03 in #112
- Update README.md by @YangWang92 in #117
- fix config format for transformers by @wejoncy in #120
- Bump to 0.0.4 by @wejoncy in #121
- fix version__ by @wejoncy in #122
- fix setup version number by @wejoncy in #123
- Update model_base.py by @YangWang92 in #124
New Contributors
- @caronzh03 made their first contribution in #112
Full Changelog: v0.0.3...v0.0.4
Assets 7
v0.0.3
46ad33b
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @OpenSourceRonin in #40
- Update README.md by @YangWang92 in #43
- add sm_89 by @wejoncy in #45
- Update README.md by @OpenSourceRonin in #46
- add notebook by @YangWang92 in #48
- docs: update README.md by @eltociear in #47
- add catlog and index for readme by @wejoncy in #49
- support bf16 by @wejoncy in #51
- add gpu monitor at web app by @TITC in #50
- update installation by @wejoncy in #55
- Delete models directory by @YangWang92 in #58
- fix offload bug in accelerator by @wejoncy in #60
- Revert "fix offload bug in accelerator" by @wejoncy in #61
- support rocm by @wejoncy in #63
- rocm fix by @wejoncy in #65
- improve web app demo by @YangWang92 in #66
- Update setup.py by @YangWang92 in #68
- refine online demo by @YangWang92 in #69
- Update README.md by @OpenSourceRonin in #70
- Update setup.py by @YangWang92 in #72
- update device map by @YangWang92 in #74
- Update README.md by @OpenSourceRonin in #80
- Update README.md by @OpenSourceRonin in #81
- Update README.md by @OpenSourceRonin in #82
- Update README.md by @OpenSourceRonin in #83
- add math example by @YangWang92 in #84
- add acknowledgement and disclaimer by @YangWang92 in #85
- Update README.md by @YangWang92 in #94
- fix format by @YangWang92 in #95
- Use absolute imports by @bndos in #90
- Set version by @YangWang92 in #97
- fix compiling error by @YangWang92 in #98
- Update vqlinear.py by @laomao0 in #103
- add package info by @YangWang92 in #104
- update pyproject by @YangWang92 in #106
New Contributors
- @eltociear made their first contribution in #47
- @TITC made their first contribution in #50
- @bndos made their first contribution in #90
- @laomao0 made their first contribution in #103
Full Changelog: v0.0.2...v0.0.3
Assets 7
v0.0.2.post1
d515ec5
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @OpenSourceRonin in #40
- Update README.md by @YangWang92 in #43
- add sm_89 by @wejoncy in #45
- Update README.md by @OpenSourceRonin in #46
- add notebook by @YangWang92 in #48
- docs: update README.md by @eltociear in #47
- add catlog and index for readme by @wejoncy in #49
- support bf16 by @wejoncy in #51
- add gpu monitor at web app by @TITC in #50
New Contributors
- @eltociear made their first contribution in #47
- @TITC made their first contribution in #50
Full Changelog: v0.0.2...v0.0.2.post1
Assets 7
v0.0.2
f540ff3
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Update README.md by @YangWang92 in #22
- Update README.md by @YangWang92 in #25
- Patch 1 by @OpenSourceRonin in #27
- Update README.md by @OpenSourceRonin in #28
- Update README.md by @YangWang92 in #30
- Update README.md by @OpenSourceRonin in #31
- Update README.md by @YangWang92 in #32
- Update README.md by @YangWang92 in #33
- Update README.md by @OpenSourceRonin in #34
- Update README.md by @OpenSourceRonin in #35
- add prompt args and check cuda kernel by @YangWang92 in #36
- update readme and tech report by @YangWang92 in #37
- support cuda-arch_list by @wejoncy in #38
- bump to 0.0.2 by @wejoncy in #39
Full Changelog: v0.0.1...v0.0.2
Assets 7
2 people reacted
v0.0.1
03b4187
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- add Acknowledgement by @YangWang92 in #14
- gradio app by @wejoncy in #16
- Update README.md by @YangWang92 in #18
- publish ci by @wejoncy in #17
- Bias by @wejoncy in #19
- fix out of bound error by @wejoncy in #20
- Add open source community by @OpenSourceRonin in #21
- memory and bf16 by @wejoncy in #23
New Contributors
- @OpenSourceRonin made their first contribution in #21
Full Changelog: https://github.com/microsoft/VPTQ/commits/v0.0.1
Assets 7
You can’t perform that action at this time.