CARVIEW |
Select Language
HTTP/2 200
date: Wed, 23 Jul 2025 20:11:14 GMT
content-type: text/html; charset=utf-8
cache-control: no-cache
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
referrer-policy: no-referrer-when-downgrade
server-timing: pull_request_layout-fragment;desc="pull_request_layout fragment";dur=292.807022,conversation_content-fragment;desc="conversation_content fragment";dur=1162.837028,conversation_sidebar-fragment;desc="conversation_sidebar fragment";dur=411.247397,nginx;desc="NGINX";dur=1.404709,glb;desc="GLB";dur=100.536714
strict-transport-security: max-age=31536000; includeSubdomains; preload
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
x-content-type-options: nosniff
x-frame-options: deny
x-voltron-version: fd8fbbc
x-xss-protection: 0
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=rPPbtQnfai9kaM0xe6QJayWlHSYfhJnTF2CbcoESjLMAda4YKCb0w8Xo7YH7MvEDJHytYk7qI1Jat%2BfBv0SrLnLx6eUKAsDzOT4BRaYlBPMK3hsizl7NvedVA8c0z2WZVSl0N0ccBNDK5GcqIu0Twz33WlGpCY2D9Cs%2BG5zag3InqIsQXJnTMaIItfNH0e7Wi742MerpbyOLCNoYIfxSV2jfCef7okvyM2Ue6n%2BroCxyqpbXXGmtP8VAe9m1MlDIYy7vYeWgHYeKLt0i5NmGVg%3D%3D--HwUqhPU0wbiyD1va--LUwrq7xG9iGD%2BbuVE8rrDg%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1896912873.1753301473; Path=/; Domain=github.com; Expires=Thu, 23 Jul 2026 20:11:13 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Thu, 23 Jul 2026 20:11:13 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: E0D6:941DF:1082FF0:139C015:688141E1
Add fp8 blockwise quant & blockwise gemm op by lshpku · Pull Request #73364 · PaddlePaddle/Paddle · GitHub
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
from
June 17, 2025 08:04
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
6 times, most recently
from
June 18, 2025 01:43
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
2 times, most recently
from
June 18, 2025 05:06
lshpku
changed the title
Add quantize_1x128_kernel
Add fp8 blockwise quant & blockwise gemm op
Jun 18, 2025
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
4 times, most recently
from
June 19, 2025 09:25
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
from
June 20, 2025 02:45
lshpku
force-pushed
the
add-quantize-1x128-kernel
branch
from
June 20, 2025 04:38
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Add fp8 blockwise quant & blockwise gemm op #73364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
lshpku
merged 20 commits into
PaddlePaddle:develop
from
lshpku:add-quantize-1x128-kernel
Jun 21, 2025
Merged
Add fp8 blockwise quant & blockwise gemm op #73364
lshpku
merged 20 commits into
PaddlePaddle:develop
from
lshpku:add-quantize-1x128-kernel
Jun 21, 2025
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
b06546b
to
d7c4530
Compare
443c3d7
to
7484f14
Compare
A-nnonymous
reviewed
Jun 18, 2025
A-nnonymous
reviewed
Jun 18, 2025
A-nnonymous
approved these changes
Jun 18, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM in all CUDA kernels
f934224
to
aa6d572
Compare
9014b93
to
4165839
Compare
SigureMo
reviewed
Jun 20, 2025
080dd76
to
e4160b6
Compare
zyfncg
reviewed
Jun 20, 2025
99af5d9
to
1d25c42
Compare
zhangbo9674
approved these changes
Jun 21, 2025
zyfncg
approved these changes
Jun 21, 2025
XiaoguangHu01
approved these changes
Jun 21, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
SigureMo
approved these changes
Jun 21, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
You can’t perform that action at this time.
PR Category
Operator Mechanism
PR Types
New features
Description
新增FP8量化相关的2个API:
fp8_gemm_blockwise
、fp8_quant_blockwise
Pcard-91067