fuse vit attention for faster-rcnn on BML #54139

abenmao · 2023-05-26T09:22:57Z

PR types

Performance optimization

PR changes

Others

Description

For Vison Transformer in Faster-RCNN model, adds a fuse that replaces the scaled_dp_attention in the self-attention with a single op.
On Cascade Lake, this optimization leads to a 1.67X speedup in the VisionTransformer block and a 1.46X improvement (reducing the latency from 3.8s to 2.6s) in the end-to-end inference time for Faster-RCNN model (from BML).
On Sapphire Rapids, the optimization has the potential to achieve a speedup of bf16/int8 by reducing the amount of memory access.

paddle-bot · 2023-05-26T09:23:02Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

CLAassistant · 2023-05-26T09:23:04Z

All committers have signed the CLA.

paddle-bot · 2023-05-26T09:23:05Z

❌ The PR is not created using PR's template. You can refer to this Demo.
Please use PR's template, it helps save our maintainers' time so that more developers get helped.

yaomichael · 2023-05-31T06:24:28Z

@YuanRisheng

YuanRisheng

LGTM

abenmao · 2023-06-02T03:29:25Z

@YuanRisheng Rebased. Please help have a check again. Thanks!

weishengying · 2023-06-02T06:49:11Z

LGTM

qingqing01

考虑到算子库及OneDNN很多算子都已经迁移到Phi算子，融合算子也有已经迁移到Phi的，倾向于直接使用Phi实现。
PR描述里也给下性能提升数据吧

paddle-bot · 2023-06-07T06:29:12Z

很抱歉，经过我们的反复讨论，你的PR暂未达到合入标准，请阅读飞桨原生算子开发规范，你可以重新提交新的PR，我们先将此PR关闭，感谢你的贡献。
Sorry to inform you that through our discussion, your PR fails to meet the merging standard (Reference: Paddle Custom Operator Design Doc). You can also submit an new one. Thank you.

abenmao · 2023-06-07T06:48:31Z

@qingqing01 您好，1. 对于第一个问题，CI里提到要通过@risheng等人的review，和相关人员以及reviewer进行了讨论，会后@YuanRisheng 这边做了进一步的处理判断，最后确认目前加op的方法是可以的（LGTM）。 2. 性能提升相关的数据加到PR描述里了。

risemeup1 · 2023-06-10T11:21:02Z

@abenmao 你好，paddle/fluid/operators/fused/self_dp_attention_op.h和self_dp_attention_op.cc文件引入之后，在gcc12下会报错一个未初始化的错误，幸苦修复一下

abenmao · 2023-06-10T13:37:15Z

你好 @risemeup1 问题已收到，情况确认后会及时联系。

risemeup1 · 2023-06-10T13:38:14Z

你好 @risemeup1 问题已收到，情况确认后会及时联系。

ok

abenmao · 2023-06-11T08:39:30Z

@risemeup1 你好，在gcc12.1下进行了排查，发现是gcc12.1这个版本已知的一个issue，具体bug参见链接： https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105593 我们如果要workaround这个问题的话，可以在相关的地方加一个flag，用来把它进行隐藏，具体方法如下：

risemeup1 · 2023-06-11T08:42:29Z

@risemeup1 你好，在gcc12.1下进行了排查，发现是gcc12.1这个版本已知的一个issue，具体bug参见链接： https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105593 我们如果要workaround这个问题的话，可以在相关的地方加一个flag，用来把它进行隐藏，具体方法如下：

这边幸苦提一个PR绕过一下吧，我觉得这样改没问题的

risemeup1 · 2023-06-11T08:43:17Z

@risemeup1 你好，在gcc12.1下进行了排查，发现是gcc12.1这个版本已知的一个issue，具体bug参见链接： https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105593 我们如果要workaround这个问题的话，可以在相关的地方加一个flag，用来把它进行隐藏，具体方法如下：

这边幸苦提一个PR绕过一下吧，我觉得这样改没问题的

其实我理解这个bug是不是可以通过初始化赋值的方法也能解决啊

abenmao · 2023-06-12T01:37:30Z

这边幸苦提一个PR绕过一下吧，我觉得这样改没问题的

其实我理解这个bug是不是可以通过初始化赋值的方法也能解决啊

@risemeup1 这个未初始化的赋值是gnu自己里边的代码没有初始化：

我们目前只能workaround

abenmao · 2023-06-12T03:52:17Z

@risemeup1 刚要提pr，rebase发现你已经提交了~

risemeup1 · 2023-06-12T03:53:39Z

@risemeup1 刚要提pr，rebase发现你已经提交了~

对，已经合入了，比较急，所以就周末提了，幸苦

paddle-bot bot added contributor External developers status: proposed labels May 26, 2023

abenmao force-pushed the fused_vit_atten branch 3 times, most recently from 5c3c75b to 3cb4b1f Compare May 29, 2023 08:46

xinyu-intel added the Intel label May 30, 2023

paddle-bot bot removed the status: proposed label May 30, 2023

abenmao force-pushed the fused_vit_atten branch 3 times, most recently from 25cb7d8 to 2f4146d Compare May 30, 2023 06:30

onecatcn assigned weishengying May 31, 2023

YuanRisheng previously approved these changes May 31, 2023

View reviewed changes

abenmao dismissed YuanRisheng’s stale review via 30831ea June 1, 2023 06:42

abenmao force-pushed the fused_vit_atten branch from 2f4146d to 30831ea Compare June 1, 2023 06:42

fuse vit attention for faster-rcnn on BML

2d5ed47

abenmao force-pushed the fused_vit_atten branch from 30831ea to 2d5ed47 Compare June 2, 2023 01:23

YuanRisheng approved these changes Jun 2, 2023

View reviewed changes

qingqing01 reviewed Jun 7, 2023

View reviewed changes

abenmao closed this Jun 7, 2023

paddle-bot bot added the status: not progressed label Jun 7, 2023

abenmao reopened this Jun 7, 2023

abenmao requested a review from qingqing01 June 7, 2023 10:06

qingqing01 approved these changes Jun 8, 2023

View reviewed changes

qingqing01 merged commit fc88020 into PaddlePaddle:develop Jun 8, 2023

yuanlehome mentioned this pull request Oct 26, 2023

【Hackathon 5th No.104】move fusion_repeated_fc_relu/fusion_squared_mat_sub to phi #58300

Merged

fuse vit attention for faster-rcnn on BML #54139

fuse vit attention for faster-rcnn on BML #54139

Uh oh!

Conversation

abenmao commented May 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented May 26, 2023

Uh oh!

CLAassistant commented May 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented May 26, 2023

Uh oh!

yaomichael commented May 31, 2023

Uh oh!

YuanRisheng left a comment

Choose a reason for hiding this comment

Uh oh!

abenmao commented Jun 2, 2023

Uh oh!

weishengying commented Jun 2, 2023

Uh oh!

qingqing01 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paddle-bot bot commented Jun 7, 2023

Uh oh!

abenmao commented Jun 7, 2023

Uh oh!

risemeup1 commented Jun 10, 2023

Uh oh!

abenmao commented Jun 10, 2023

Uh oh!

risemeup1 commented Jun 10, 2023

Uh oh!

abenmao commented Jun 11, 2023

Uh oh!

risemeup1 commented Jun 11, 2023

Uh oh!

risemeup1 commented Jun 11, 2023

Uh oh!

abenmao commented Jun 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abenmao commented Jun 12, 2023

Uh oh!

risemeup1 commented Jun 12, 2023

Uh oh!

Uh oh!

abenmao commented May 26, 2023 •

edited

Loading

CLAassistant commented May 26, 2023 •

edited

Loading

qingqing01 left a comment •

edited

Loading

abenmao commented Jun 12, 2023 •

edited

Loading