CARVIEW |
Select Language
HTTP/2 200
date: Sat, 26 Jul 2025 16:27:32 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
x-repository-download: git clone https://github.com/PaddlePaddle/Paddle.git
etag: W/"6d4cd71b03e48561fe567f1f7fec07dc"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=dN4V7hDwll7X3MBgII%2Fe5Uy1SdFOyTWZ6ThMoHPWhVKnMFoS0s8Nvh%2B5xjueqjSwcrjVIhvfDuHKDCbvcNVPpM09JZwmWyGd6uJYILOt3PTg4EpLcAndhSIFMjICxzWU%2FsI%2F6o7LFPjfah6gzZaudJmSSIBa5kOPDSyduJXCbXTQ4UrAvTGQuS%2FJGDRkaurdSywUK8TepOp7BOpFGMZ0O7HXejooqC%2BaQXGXJEpbe14QYxHcsTB%2FmAy6WOs3UfIeuVNcbjPWScoLoLw2wFrRsQ%3D%3D--fFDQKrB%2BqNhJT%2Bre--gNThPQDDkae1spDx5%2BlBMA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.836697097.1753547251; Path=/; Domain=github.com; Expires=Sun, 26 Jul 2026 16:27:31 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 26 Jul 2026 16:27:31 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: D54E:91727:7343DC:94F2F3:688501F3
fix stage2 main_grad acc bug (#59142) · PaddlePaddle/Paddle@929174f · GitHub


authored
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/gpu/fused_linear_param_grad_add_kernel.cu
Copy file name to clipboardExpand all lines: python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_optimizer_stage2.py
Copy file name to clipboardExpand all lines: python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_stage2.py
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Commit 929174f
fix stage2 main_grad acc bug (#59142)
* fix stage2 main_grad acc bug
* update code according to suggest
* scale in opt
* merge grad scale
* add note
* delete debug info
* keep offload unchange
* Optimize the BF16 unittest of sharding stage2 and stage3.
* fix stage3 bug
* add fp16 judge
* add init
* add fp16
* fix grad clip
* add if data type is fp16
* change if location
* delete fault arg
* add enmu.value
---------
Co-authored-by: Liu Yiqun <liuyiqun01@baidu.com>
Co-authored-by: tianhaodongbd <tianhaodong@baidu.com>1 parent 1266da1 commit 929174fCopy full SHA for 929174f
File tree
Expand file treeCollapse file tree
5 files changed
+273
-283
lines changedFilter options
- paddle/phi/kernels/fusion/gpu
- python/paddle/distributed/fleet/meta_parallel/sharding
- test/collective/fleet
Expand file treeCollapse file tree
5 files changed
+273
-283
lines changedpaddle/phi/kernels/fusion/gpu/fused_linear_param_grad_add_kernel.cu
Copy file name to clipboardExpand all lines: paddle/phi/kernels/fusion/gpu/fused_linear_param_grad_add_kernel.cu+7Lines changed: 7 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
134 | 134 |
| |
135 | 135 |
| |
136 | 136 |
| |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
137 | 144 |
| |
138 | 145 |
| |
139 | 146 |
| |
|
python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_optimizer_stage2.py
Copy file name to clipboardExpand all lines: python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_optimizer_stage2.py+19-6Lines changed: 19 additions & 6 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
194 | 194 |
| |
195 | 195 |
| |
196 | 196 |
| |
197 |
| - | |
198 |
| - | |
199 |
| - | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
200 | 205 |
| |
201 |
| - | |
202 |
| - | |
203 |
| - | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
204 | 217 |
| |
205 | 218 |
| |
206 | 219 |
| |
|
python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_stage2.py
Copy file name to clipboardExpand all lines: python/paddle/distributed/fleet/meta_parallel/sharding/group_sharded_stage2.py+32-15Lines changed: 32 additions & 15 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
155 | 155 |
| |
156 | 156 |
| |
157 | 157 |
| |
| 158 | + | |
| 159 | + | |
158 | 160 |
| |
159 | 161 |
| |
160 | 162 |
| |
| |||
232 | 234 |
| |
233 | 235 |
| |
234 | 236 |
| |
235 |
| - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
236 | 240 |
| |
237 | 241 |
| |
238 |
| - | |
239 |
| - | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
240 | 245 |
| |
241 |
| - | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
242 | 251 |
| |
243 | 252 |
| |
244 | 253 |
| |
| |||
249 | 258 |
| |
250 | 259 |
| |
251 | 260 |
| |
252 |
| - | |
253 | 261 |
| |
254 | 262 |
| |
255 | 263 |
| |
| |||
258 | 266 |
| |
259 | 267 |
| |
260 | 268 |
| |
261 |
| - | |
262 | 269 |
| |
263 |
| - | |
| 270 | + | |
264 | 271 |
| |
265 |
| - | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
266 | 277 |
| |
267 | 278 |
| |
268 | 279 |
| |
| |||
379 | 390 |
| |
380 | 391 |
| |
381 | 392 |
| |
382 |
| - | |
383 |
| - | |
384 |
| - | |
385 |
| - | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
386 | 405 |
| |
387 | 406 |
| |
388 |
| - | |
389 |
| - | |
390 |
| - | |
| 407 | + | |
391 | 408 |
| |
392 | 409 |
| |
393 | 410 |
| |
|
You can’t perform that action at this time.
0 commit comments