CARVIEW |
Select Language
HTTP/2 200
date: Wed, 23 Jul 2025 22:56:05 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
x-repository-download: git clone https://github.com/intel/neural-compressor.git
etag: W/"6f365ae96eb0db439c7e7c35e10d6f02"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=7O5pYkhHEYzlXkCKnil%2Bb4qvzdOUiEnidUou108FzfWhGr0MSRL%2Bos%2B%2BDdfdXGtrBXiXHXr%2B5dbs4H32zZDadfkHQXZwHRFzAjzAcQTfPQCkRvawJsT4VfBgMezivPusT7vU9abIYzE%2BtyfKIjDoJ%2FBVvMrRTma9OKV86J8Rg51fSA0oCtC5YjTo%2BBPGCFe6rfiZfv101EqshtskQT5rvImlSaW253ZPYu7hcPHU4Ks56Rr27iJ8RxIFhUWnGtJMLDriVU8lJNx8dvn12TTWWg%3D%3D--g%2BFJ52YEY0NsIsx%2F--5lB6UzvurlXUI82156W%2BKA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1081396599.1753311364; Path=/; Domain=github.com; Expires=Thu, 23 Jul 2026 22:56:04 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Thu, 23 Jul 2026 22:56:04 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: E650:0A76:1002F9:16D154:68816884
ONNXRT LLM examples support latest optimum version (#1578) · intel/neural-compressor@26b260e · GitHub
Copy file name to clipboardExpand all lines: examples/.config/model_params_onnxrt.json
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/README.md
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/main.py
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/prepare_model.py
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 277
Commit 26b260e
File tree
Expand file treeCollapse file tree
12 files changed
+316
-232
lines changedFilter options
- examples
- .config
- onnxrt/nlp/huggingface_model/text_generation/llama/quantization
- ptq_static
- weight_only
- neural_compressor/model
Expand file treeCollapse file tree
12 files changed
+316
-232
lines changedexamples/.config/model_params_onnxrt.json
Copy file name to clipboardExpand all lines: examples/.config/model_params_onnxrt.json+49-7Lines changed: 49 additions & 7 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
756 | 756 |
| |
757 | 757 |
| |
758 | 758 |
| |
759 |
| - | |
| 759 | + | |
760 | 760 |
| |
761 | 761 |
| |
762 |
| - | |
| 762 | + | |
| 763 | + | |
| 764 | + | |
| 765 | + | |
| 766 | + | |
| 767 | + | |
| 768 | + | |
| 769 | + | |
763 | 770 |
| |
764 | 771 |
| |
765 | 772 |
| |
766 | 773 |
| |
767 | 774 |
| |
768 | 775 |
| |
769 |
| - | |
| 776 | + | |
| 777 | + | |
| 778 | + | |
| 779 | + | |
| 780 | + | |
| 781 | + | |
| 782 | + | |
| 783 | + | |
770 | 784 |
| |
771 | 785 |
| |
772 | 786 |
| |
773 | 787 |
| |
774 | 788 |
| |
775 | 789 |
| |
776 |
| - | |
| 790 | + | |
| 791 | + | |
| 792 | + | |
| 793 | + | |
| 794 | + | |
| 795 | + | |
| 796 | + | |
| 797 | + | |
777 | 798 |
| |
778 | 799 |
| |
779 | 800 |
| |
780 | 801 |
| |
781 | 802 |
| |
782 | 803 |
| |
783 |
| - | |
| 804 | + | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
784 | 812 |
| |
785 | 813 |
| |
786 | 814 |
| |
787 | 815 |
| |
788 | 816 |
| |
789 | 817 |
| |
790 |
| - | |
| 818 | + | |
| 819 | + | |
| 820 | + | |
| 821 | + | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
| 825 | + | |
791 | 826 |
| |
792 | 827 |
| |
793 | 828 |
| |
794 | 829 |
| |
795 | 830 |
| |
796 | 831 |
| |
797 |
| - | |
| 832 | + | |
| 833 | + | |
| 834 | + | |
| 835 | + | |
| 836 | + | |
| 837 | + | |
| 838 | + | |
| 839 | + | |
798 | 840 |
| |
799 | 841 |
| |
800 | 842 |
| |
|
examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/README.md
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/README.md+4-2Lines changed: 4 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
27 | 27 |
| |
28 | 28 |
| |
29 | 29 |
| |
30 |
| - | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
31 | 33 |
| |
32 | 34 |
| |
33 | 35 |
| |
| |||
41 | 43 |
| |
42 | 44 |
| |
43 | 45 |
| |
44 |
| - | |
| 46 | + | |
45 | 47 |
| |
46 | 48 |
| |
47 | 49 |
| |
|
examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/main.py
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/main.py+60-68Lines changed: 60 additions & 68 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
84 | 84 |
| |
85 | 85 |
| |
86 | 86 |
| |
87 |
| - | |
| 87 | + | |
88 | 88 |
| |
89 | 89 |
| |
90 | 90 |
| |
| |||
124 | 124 |
| |
125 | 125 |
| |
126 | 126 |
| |
127 |
| - | |
| 127 | + | |
128 | 128 |
| |
| 129 | + | |
129 | 130 |
| |
130 | 131 |
| |
131 | 132 |
| |
| |||
134 | 135 |
| |
135 | 136 |
| |
136 | 137 |
| |
137 |
| - | |
138 | 138 |
| |
139 | 139 |
| |
140 |
| - | |
141 |
| - | |
142 |
| - | |
143 |
| - | |
144 |
| - | |
145 |
| - | |
146 |
| - | |
147 |
| - | |
148 |
| - | |
149 |
| - | |
150 |
| - | |
151 |
| - | |
152 |
| - | |
153 |
| - | |
154 |
| - | |
155 |
| - | |
156 |
| - | |
157 |
| - | |
158 |
| - | |
159 |
| - | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
160 | 152 |
| |
161 | 153 |
| |
162 | 154 |
| |
| |||
192 | 184 |
| |
193 | 185 |
| |
194 | 186 |
| |
195 |
| - | |
| 187 | + | |
196 | 188 |
| |
197 | 189 |
| |
198 | 190 |
| |
| |||
201 | 193 |
| |
202 | 194 |
| |
203 | 195 |
| |
204 |
| - | |
| 196 | + | |
205 | 197 |
| |
206 | 198 |
| |
207 | 199 |
| |
| |||
234 | 226 |
| |
235 | 227 |
| |
236 | 228 |
| |
| 229 | + | |
237 | 230 |
| |
238 | 231 |
| |
239 | 232 |
| |
| |||
247 | 240 |
| |
248 | 241 |
| |
249 | 242 |
| |
250 |
| - | |
251 |
| - | |
252 |
| - | |
253 |
| - | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
254 | 248 |
| |
255 | 249 |
| |
256 | 250 |
| |
| |||
269 | 263 |
| |
270 | 264 |
| |
271 | 265 |
| |
272 |
| - | |
273 | 266 |
| |
274 | 267 |
| |
275 | 268 |
| |
276 |
| - | |
277 |
| - | |
278 |
| - | |
279 |
| - | |
280 |
| - | |
281 |
| - | |
282 |
| - | |
283 |
| - | |
284 |
| - | |
285 |
| - | |
286 |
| - | |
287 |
| - | |
288 |
| - | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
289 | 286 |
| |
290 | 287 |
| |
291 | 288 |
| |
| |||
294 | 291 |
| |
295 | 292 |
| |
296 | 293 |
| |
297 |
| - | |
| 294 | + | |
298 | 295 |
| |
299 | 296 |
| |
300 | 297 |
| |
301 | 298 |
| |
302 | 299 |
| |
303 | 300 |
| |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
304 | 305 |
| |
305 | 306 |
| |
306 |
| - | |
| 307 | + | |
307 | 308 |
| |
308 | 309 |
| |
309 |
| - | |
| 310 | + | |
| 311 | + | |
310 | 312 |
| |
311 |
| - | |
312 |
| - | |
313 |
| - | |
314 |
| - | |
315 |
| - | |
316 |
| - | |
317 |
| - | |
318 |
| - | |
319 |
| - | |
320 |
| - | |
321 | 313 |
| |
322 |
| - | |
| 314 | + | |
323 | 315 |
| |
324 | 316 |
| |
325 |
| - | |
326 |
| - | |
327 |
| - | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
328 | 320 |
| |
329 |
| - | |
330 |
| - | |
331 |
| - | |
332 |
| - | |
333 |
| - | |
334 |
| - | |
335 |
| - | |
336 |
| - | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + |
examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/prepare_model.py
Copy file name to clipboardExpand all lines: examples/onnxrt/nlp/huggingface_model/text_generation/llama/quantization/ptq_static/prepare_model.py+24-33Lines changed: 24 additions & 33 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
10 | 10 |
| |
11 | 11 |
| |
12 | 12 |
| |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
13 | 18 |
| |
14 | 19 |
| |
15 | 20 |
| |
16 |
| - | |
| 21 | + | |
17 | 22 |
| |
18 |
| - | |
19 |
| - | |
20 |
| - | |
21 |
| - | |
22 |
| - | |
23 |
| - | |
24 |
| - | |
25 |
| - | |
26 |
| - | |
27 |
| - | |
28 |
| - | |
29 |
| - | |
30 |
| - | |
31 |
| - | |
32 |
| - | |
33 |
| - | |
34 |
| - | |
35 |
| - | |
36 |
| - | |
37 |
| - | |
38 |
| - | |
39 |
| - | |
40 |
| - | |
41 |
| - | |
42 |
| - | |
43 |
| - | |
44 |
| - | |
45 |
| - | |
46 |
| - | |
47 |
| - | |
48 |
| - | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
49 | 40 |
| |
50 | 41 |
| |
51 | 42 |
| |
52 | 43 |
| |
53 | 44 |
| |
54 | 45 |
| |
55 |
| - | |
| 46 | + |
You can’t perform that action at this time.
0 commit comments