【Hackathon 7th No.32】为 paddle.nn.functional.scaled_dot_product_attention 进行功能增强 #70166

FlamingoPg · 2024-12-12T05:33:10Z

PR Category

User Experience

PR Types

Improvements

Description

修改paddle sdpa代码，支持后端 math、mem efficient、flash选择。并对齐torch选择代码的方式
ps：帮忙CR一下还需要补充哪些单测吧

…to develop

paddle-bot · 2024-12-12T05:33:16Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhwesky2010 · 2024-12-16T06:58:28Z

python/paddle/nn/functional/flash_attention.py

+                    },
+                )
+                return out
+        elif sdp_func_name == "mem_efficient":


有个疑问：看 nn.functional.flash_attention 的mem_efficient分支是直接调用的memory_efficient_attention函数，而这个是用的variable_length_memory_efficient_attention加的一些组装的逻辑，是因为attn_mask的区别吗

yes，memory_efficient_attention不支持mask。调用了这个支持mask的并吧seq_lens展平加入

zhwesky2010 · 2024-12-16T07:05:26Z

python/paddle/nn/functional/flash_attention.py

@@ -191,6 +306,54 @@ def _select_sdp(head_dim: int) -> str:
    return "mem_efficient"


+def _select_sdp_for_sdpa(query, key, attn_mask, dropout, is_causal) -> str:


这个与之前的逻辑是兼容的吧

和之前是并行的，flash用不到这个接口

zhwesky2010 · 2024-12-16T07:11:24Z

python/paddle/nn/functional/flash_attention.py

+    sdp_func_name = _select_sdp_for_sdpa(
+        query, key, attn_mask, dropout_p, is_causal
+    )
+
    if attn_mask is None:
        # downgraded to ordinary flash attention implementation
        out, _ = flash_attention(query, key, value, dropout_p, is_causal)


这里是直接调用上面的 flash_attention API，会跑一遍 sdp_func_name = _select_sdp(head_dim) 重置后端，这个和上面的 _select_sdp_for_sdpa 后端选择逻辑是否冲突？

我理解这里的逻辑是，

如果这个输入是不带mask的，它会走下面flash的逻辑。并最后由_select_sdp确定使用哪种算法，所以不带mask的能和之前的版本完全对齐。

也就是说，调用新的_select_sdp之后，之前的_select_sdp_for_sdpa就被覆盖了。这是符合预期的

GuoxiaWang

LGTM

zhwesky2010

@yinfan98 我理解是

不带mask版本的，与之前完全一致
带mask版本的，采用了新的_select_sdp_for_sdpa，这个和之前是否兼容呢

python/paddle/nn/functional/flash_attention.py

FlamingoPg · 2024-12-17T12:04:38Z

@yinfan98 我理解是

不带mask版本的，与之前完全一致

带mask版本的，采用了新的_select_sdp_for_sdpa，这个和之前是否兼容呢

是的，是这样。带mask版本能选到flash的地方，仍然会选到flash（但是flash的检查条件比之前稍微严格了一些，这里对齐了torch）。
之前版本只有flash，如果说想完全和之前版本兼容。100%确保之前的各种case仍然能选到flash，而不是会退化成mem efficient。我的建议是去掉这个函数，_select_sdp_for_sdpa。它对是否能使用flash和torch强制对齐了，可能会在这里导致一些diff

zhwesky2010

LGTM

FlamingoPg added 26 commits November 1, 2024 01:43

fix scaled dot product attention

2f261aa

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

04c06c7

add test & fix

818f32a

Merge branch 'hackthon7/sdp' of https://github.com/yinfan98/Paddle in…

1221af4

…to develop

fix

e28299e

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

b96b839

fix ci

dae8b51

fix ci

5e41832

Update flash_attention.py

9af9663

Update flash_attention.py

0e5c8a9

assert paddle sdpa accuracy

40acb1f

Update flash_attention.py

05fb9f7

fix alignment for sdpa

dbd7461

Merge branch 'hackthon7/sdp' of https://github.com/yinfan98/Paddle in…

682b305

…to develop

add some torch test

74c9215

delete same expected output

ab18981

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

a687e87

Update flash_attention.py

537b092

Update flash_attention.py

3072dbf

Update flash_attention.py

1740a0d

Update test_flash_attention.py

ca24450

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

217ead0

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

8d9fa9e

Update flash_attention.py

5a8fc46

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

1555dbe

Merge branch 'PaddlePaddle:develop' into hackthon7/sdp

f9bbba0

paddle-bot bot added the contributor External developers label Dec 12, 2024

Update flash_attention.py

ed6ab3d

luotao1 added the PaddlePaddle Hackathon label Dec 12, 2024

luotao1 added the API label Dec 12, 2024

luotao1 assigned luotao1, GuoxiaWang and zhwesky2010 Dec 12, 2024

luotao1 mentioned this pull request Dec 12, 2024

【Hackathon 7th】开源贡献个人挑战赛 #68244

Closed

zhwesky2010 reviewed Dec 16, 2024

View reviewed changes

GuoxiaWang previously approved these changes Dec 16, 2024

View reviewed changes

zhwesky2010 reviewed Dec 17, 2024

View reviewed changes

python/paddle/nn/functional/flash_attention.py Outdated Show resolved Hide resolved

python/paddle/nn/functional/flash_attention.py Show resolved Hide resolved

Update flash_attention.py

f992a85

FlamingoPg dismissed GuoxiaWang’s stale review via f992a85 December 17, 2024 12:07

zhwesky2010 approved these changes Dec 18, 2024

View reviewed changes

luotao1 merged commit 7ed64ea into PaddlePaddle:develop Dec 18, 2024
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【Hackathon 7th No.32】为 paddle.nn.functional.scaled_dot_product_attention 进行功能增强 #70166

【Hackathon 7th No.32】为 paddle.nn.functional.scaled_dot_product_attention 进行功能增强 #70166

Uh oh!

FlamingoPg commented Dec 12, 2024

Uh oh!

paddle-bot bot commented Dec 12, 2024

Uh oh!

zhwesky2010 Dec 16, 2024 •

edited

Loading

Uh oh!

FlamingoPg Dec 16, 2024

Uh oh!

zhwesky2010 Dec 16, 2024

Uh oh!

FlamingoPg Dec 16, 2024

Uh oh!

zhwesky2010 Dec 16, 2024 •

edited

Loading

Uh oh!

FlamingoPg Dec 16, 2024

Uh oh!

FlamingoPg Dec 16, 2024

Uh oh!

GuoxiaWang left a comment

Uh oh!

zhwesky2010 left a comment

Uh oh!

Uh oh!

Uh oh!

FlamingoPg commented Dec 17, 2024

Uh oh!

zhwesky2010 left a comment

Uh oh!

Uh oh!

Uh oh!

		@@ -191,6 +306,54 @@ def _select_sdp(head_dim: int) -> str:
		return "mem_efficient"


		def _select_sdp_for_sdpa(query, key, attn_mask, dropout, is_causal) -> str:

【Hackathon 7th No.32】为 paddle.nn.functional.scaled_dot_product_attention 进行功能增强 #70166

【Hackathon 7th No.32】为 paddle.nn.functional.scaled_dot_product_attention 进行功能增强 #70166

Uh oh!

Conversation

FlamingoPg commented Dec 12, 2024

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Dec 12, 2024

Uh oh!

zhwesky2010 Dec 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FlamingoPg Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

zhwesky2010 Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

FlamingoPg Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

zhwesky2010 Dec 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FlamingoPg Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

FlamingoPg Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

GuoxiaWang left a comment

Choose a reason for hiding this comment

Uh oh!

zhwesky2010 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

FlamingoPg commented Dec 17, 2024

Uh oh!

zhwesky2010 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zhwesky2010 Dec 16, 2024 •

edited

Loading

zhwesky2010 Dec 16, 2024 •

edited

Loading