CARVIEW |
Select Language
HTTP/2 200
date: Tue, 29 Jul 2025 03:08:15 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
x-repository-download: git clone https://github.com/PaddlePaddle/Paddle.git
etag: W/"371665fa88149f2be5efd327512f6d52"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=kCu6Y2WVux28HIk5Hq8Dnl4RP8ti39S54Kr8layi4rynJElqojFaQS0d3MWLZyTaCtCDaFdwBI3dFvyt3V0RmN%2F4bRo6ZQQ%2B0uOuxpuIweLRvlPO4Wa2QVga3JStlP4uKk%2Ba0R4ZiEqvKxV5ZXKyBz%2FWePoXI78QwykWQZzQqgvJ76Mu0djEUAPi4%2FE6sC9U%2B12OHDNDm7Di8s%2BKdzif%2BdhHe8z7YATdmVUT9Hc5aQCK3yu4oDudRPVrTzwAI7hXe1S5zmLeqyNggKwswhR%2Flw%3D%3D--1gB9nMS3Il%2BHlhvE--bfOTI1rhhIdS7XcwQw%2BFaQ%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1900032583.1753758494; Path=/; Domain=github.com; Expires=Wed, 29 Jul 2026 03:08:14 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Wed, 29 Jul 2026 03:08:14 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: B176:E69F3:2068EA:2FC74B:68883B1E
[inference] Support wint4 groupwise with cutlass gemm (#60422) · PaddlePaddle/Paddle@193fea3 · GitHub
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/arch/mma.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/ft_gemm_configs.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm_split_k.h
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Commit 193fea3
authored
[inference] Support wint4 groupwise with cutlass gemm (#60422)
* support gemv-groupwise func && weightQuanter-groupwise && weightDeQuanter-groupwise
* fix build bug
* add unit_test && fix bug
* delete useless code
* fix ci build bug
* fix ci && optimize
* fix merge conflict
* add op change info
* fix weight_only_linear_pass
* fix format
* solve ci unit_test
* init
* support cutlass gemm with groupwise
* add unit test
* fix strange bug
* delete random bug
* fix sm70 build bug
* try to fix ci build bug
* fix bug
* fix volta build bug
* skip sm70 in groupwise mode
* change cutlass branch1 parent aacdc4d commit 193fea3Copy full SHA for 193fea3
File tree
Expand file treeCollapse file tree
18 files changed
+2429
-769
lines changedFilter options
- paddle/phi/kernels
- fusion/cutlass
- cutlass_extensions
- arch
- gemm
- kernel
- threadblock
- warp
- transform/threadblock
- cutlass_kernels
- fpA_intB_gemm
- gpu
- test/quantization
Expand file treeCollapse file tree
18 files changed
+2429
-769
lines changedpaddle/phi/kernels/fusion/cutlass/cutlass_extensions/arch/mma.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/arch/mma.h+81-12Lines changed: 81 additions & 12 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
2 |
| - | |
| 1 | + | |
| 2 | + | |
| 3 | + | |
3 | 4 |
| |
4 |
| - | |
5 |
| - | |
6 |
| - | |
| 5 | + | |
| 6 | + | |
7 | 7 |
| |
8 |
| - | |
| 8 | + | |
| 9 | + | |
9 | 10 |
| |
10 |
| - | |
11 |
| - | |
12 |
| - | |
13 |
| - | |
14 |
| - | |
15 |
| - | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
16 | 32 |
| |
17 | 33 |
| |
18 | 34 |
| |
| |||
42 | 58 |
| |
43 | 59 |
| |
44 | 60 |
| |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
45 | 114 |
| |
46 | 115 |
|
paddle/phi/kernels/fusion/cutlass/cutlass_extensions/ft_gemm_configs.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/ft_gemm_configs.h+3-1Lines changed: 3 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
61 | 61 |
| |
62 | 62 |
| |
63 | 63 |
| |
64 |
| - | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
65 | 67 |
| |
66 | 68 |
| |
67 | 69 |
| |
|
paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm.h+74-8Lines changed: 74 additions & 8 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
56 | 56 |
| |
57 | 57 |
| |
58 | 58 |
| |
59 |
| - | |
| 59 | + | |
60 | 60 |
| |
| 61 | + | |
| 62 | + | |
61 | 63 |
| |
62 | 64 |
| |
63 | 65 |
| |
| |||
103 | 105 |
| |
104 | 106 |
| |
105 | 107 |
| |
| 108 | + | |
106 | 109 |
| |
107 | 110 |
| |
108 | 111 |
| |
| |||
125 | 128 |
| |
126 | 129 |
| |
127 | 130 |
| |
| 131 | + | |
128 | 132 |
| |
129 | 133 |
| |
130 | 134 |
| |
| |||
143 | 147 |
| |
144 | 148 |
| |
145 | 149 |
| |
| 150 | + | |
146 | 151 |
| |
147 | 152 |
| |
148 | 153 |
| |
| |||
181 | 186 |
| |
182 | 187 |
| |
183 | 188 |
| |
| 189 | + | |
184 | 190 |
| |
185 | 191 |
| |
186 | 192 |
| |
| |||
192 | 198 |
| |
193 | 199 |
| |
194 | 200 |
| |
| 201 | + | |
195 | 202 |
| |
196 | 203 |
| |
197 | 204 |
| |
| |||
276 | 283 |
| |
277 | 284 |
| |
278 | 285 |
| |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
279 | 332 |
| |
280 | 333 |
| |
281 | 334 |
| |
| |||
335 | 388 |
| |
336 | 389 |
| |
337 | 390 |
| |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
338 | 395 |
| |
339 |
| - | |
| 396 | + | |
340 | 397 |
| |
341 | 398 |
| |
342 | 399 |
| |
| |||
368 | 425 |
| |
369 | 426 |
| |
370 | 427 |
| |
371 |
| - | |
372 |
| - | |
373 |
| - | |
374 |
| - | |
375 |
| - | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
376 | 438 |
| |
377 | 439 |
| |
378 | 440 |
| |
| |||
383 | 445 |
| |
384 | 446 |
| |
385 | 447 |
| |
386 |
| - | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
387 | 453 |
| |
388 | 454 |
| |
389 | 455 |
| |
|
paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm_split_k.h
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/cutlass/cutlass_extensions/gemm/kernel/fpA_intB_gemm_split_k.h+1-1Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
847 | 847 |
| |
848 | 848 |
| |
849 | 849 |
| |
850 |
| - | |
| 850 | + | |
851 | 851 |
| |
852 | 852 |
| |
853 | 853 |
| |
|
You can’t perform that action at this time.
0 commit comments