[XPU] Add rms_norm and rms_norm_grad op #63989

cqulilujia · 2024-04-29T12:39:52Z

PR Category

Custom Device

PR Types

New features

Description

添加XPU rms_norm和rms_norm_grad融合算子，与GPU保持一致

paddle-bot · 2024-04-29T12:39:56Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2024-05-08T03:25:36Z

Sorry to inform you that 8aade47's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

paddle-ci-bot · 2024-05-23T03:19:09Z

Sorry to inform you that d7554df's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

cqulilujia · 2024-08-28T08:33:51Z

test/xpu/test_rms_norm_op_xpu.py

+            paddle.autograd.backward([out], [out_g], True)
+            return out, (x.grad, scale.grad)
+
+        # dtypes = [paddle.float32, paddle.bfloat16, paddle.float16]


bf16数据类型，单测拼的小算子kl2/kl3都有一些小问题，待api更新后再添加进来

zyfncg · 2024-08-29T06:08:28Z

paddle/phi/kernels/rms_norm_grad_kernel.h

+#pragma once
+
+#include "paddle/phi/core/dense_tensor.h"
+#include "paddle/phi/core/selected_rows.h"


这个头文件应该没用到

好的，下个PR我一起删掉

zhangyk0314 · 2024-08-29T06:18:53Z

paddle/phi/infermeta/backward.cc

@@ -13,6 +13,7 @@ See the License for the specific language governing permissions and
 limitations under the License. */

 #include "paddle/phi/infermeta/backward.h"
+#include "glog/logging.h"


这个头文件用到了吗？

调试时加上的，后续我来删掉

zhangyk0314

LGTM

houj04

从外部使用者的角度来看，新增算子绑定，和现有的走fast_paddle，有啥区别吗？

cqulilujia · 2024-08-29T06:59:22Z

从外部使用者的角度来看，新增算子绑定，和现有的走fast_paddle，有啥区别吗？

一方面，这个算子是PaddleNLP中的算子，GPU角度看，这个算子在PaddleNLP中已经被移到legacy文件夹，由PaddleNLP迁向paddle是一个趋势；另一方面，从XPU角度，fast paddle的地位需要逐渐弱化，让用户尽量不修改GPU代码就能使用，能迁移的尽量迁移，对外部使用者来说使用Paddle中的算子更为友好

cqulilujia · 2024-08-29T07:03:41Z

从外部使用者的角度来看，新增算子绑定，和现有的走fast_paddle，有啥区别吗？

一方面，这个算子是PaddleNLP中的算子，GPU角度看，这个算子在PaddleNLP中已经被移到legacy文件夹，由PaddleNLP迁向paddle是一个趋势；另一方面，从XPU角度，fast paddle的地位需要逐渐弱化，让用户尽量不修改GPU代码就能使用，能迁移的尽量迁移，对外部使用者来说使用Paddle中的算子更为友好

PaddleNLP模型侧的代码修改可以参考 PaddlePaddle/PaddleNLP#8746

skywalker2012 · 2024-08-29T07:07:25Z

LGTM

cqulilujia force-pushed the rmsnorm branch from 8aade47 to d7554df Compare May 15, 2024 07:30

cqulilujia force-pushed the rmsnorm branch 8 times, most recently from 0266647 to 829c231 Compare July 12, 2024 07:05

cqulilujia force-pushed the rmsnorm branch 4 times, most recently from 6c0b398 to 5bdfd9d Compare July 19, 2024 09:06

cqulilujia force-pushed the rmsnorm branch 2 times, most recently from f24a5b8 to e66362f Compare July 25, 2024 12:00

cqulilujia force-pushed the rmsnorm branch from b44d340 to 19874ed Compare August 15, 2024 09:40

cqulilujia force-pushed the rmsnorm branch from 19874ed to 13cff94 Compare August 27, 2024 06:50

[XPU] Add rms_norm and rms_norm_grad op

ce39833

cqulilujia force-pushed the rmsnorm branch 2 times, most recently from 4618bc6 to bbaa41a Compare August 28, 2024 08:29

cqulilujia commented Aug 28, 2024

View reviewed changes

cqulilujia force-pushed the rmsnorm branch from bbaa41a to a2e3050 Compare August 28, 2024 09:15

use python backword for test

4bfaf0d

cqulilujia force-pushed the rmsnorm branch from a2e3050 to 4bfaf0d Compare August 28, 2024 11:46

zyfncg approved these changes Aug 29, 2024

View reviewed changes

zhangyk0314 reviewed Aug 29, 2024

View reviewed changes

zhangyk0314 approved these changes Aug 29, 2024

View reviewed changes

houj04 reviewed Aug 29, 2024

View reviewed changes

houj04 approved these changes Aug 29, 2024

View reviewed changes

houj04 merged commit 4563660 into PaddlePaddle:develop Aug 29, 2024
29 checks passed

houj04 added the XPU label Sep 4, 2024

cqulilujia mentioned this pull request Sep 26, 2024

[XPU], add bf16 unittest for rms_norm #68433

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[XPU] Add rms_norm and rms_norm_grad op #63989

[XPU] Add rms_norm and rms_norm_grad op #63989

Uh oh!

cqulilujia commented Apr 29, 2024 •

edited

Loading

Uh oh!

paddle-bot bot commented Apr 29, 2024

Uh oh!

paddle-ci-bot bot commented May 8, 2024

Uh oh!

paddle-ci-bot bot commented May 23, 2024

Uh oh!

cqulilujia Aug 28, 2024

Uh oh!

zyfncg Aug 29, 2024

Uh oh!

cqulilujia Aug 29, 2024

Uh oh!

zhangyk0314 Aug 29, 2024

Uh oh!

cqulilujia Aug 29, 2024

Uh oh!

zhangyk0314 left a comment

Uh oh!

houj04 left a comment

Uh oh!

cqulilujia commented Aug 29, 2024

Uh oh!

cqulilujia commented Aug 29, 2024

Uh oh!

skywalker2012 commented Aug 29, 2024

Uh oh!

Uh oh!

Uh oh!

[XPU] Add rms_norm and rms_norm_grad op #63989

[XPU] Add rms_norm and rms_norm_grad op #63989

Uh oh!

Conversation

cqulilujia commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Apr 29, 2024

Uh oh!

paddle-ci-bot bot commented May 8, 2024

Uh oh!

paddle-ci-bot bot commented May 23, 2024

Uh oh!

cqulilujia Aug 28, 2024

Choose a reason for hiding this comment

Uh oh!

zyfncg Aug 29, 2024

Choose a reason for hiding this comment

Uh oh!

cqulilujia Aug 29, 2024

Choose a reason for hiding this comment

Uh oh!

zhangyk0314 Aug 29, 2024

Choose a reason for hiding this comment

Uh oh!

cqulilujia Aug 29, 2024

Choose a reason for hiding this comment

Uh oh!

zhangyk0314 left a comment

Choose a reason for hiding this comment

Uh oh!

houj04 left a comment

Choose a reason for hiding this comment

Uh oh!

cqulilujia commented Aug 29, 2024

Uh oh!

cqulilujia commented Aug 29, 2024

Uh oh!

skywalker2012 commented Aug 29, 2024

Uh oh!

Uh oh!

Uh oh!

cqulilujia commented Apr 29, 2024 •

edited

Loading