support flash attention with sparse mask #62029

GuoxiaWang · 2024-02-23T16:02:37Z

PR types

New features

PR changes

APIs

Description

support flash attention with sparse mask
Pcard-73145

paddle-bot · 2024-02-23T16:02:42Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2024-03-15T03:24:16Z

Sorry to inform you that 17ed1c6's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

sunzhongkai588

修改文档以符合规范

python/paddle/nn/functional/flash_attention.py

sunzhongkai588

LGTM for docs

heavyrain-lzy

LGTM for YAML

heavyrain-lzy · 2024-03-27T12:22:05Z

paddle/phi/api/yaml/backward.yaml

@@ -859,6 +859,17 @@
    func : flash_attn_unpadded_grad
    data_type: q

+- backward_op : flash_attn_with_sparse_mask_grad
+  forward : flash_attn_with_sparse_mask (Tensor q, Tensor k, Tensor v, Tensor attn_mask_start_row_indices, Tensor fixed_seed_offset, float dropout = 0.0, bool causal = false, int attn_mask_start_row = 0, bool return_softmax = false, bool is_test = false, str rng_name = "") -> Tensor(out), Tensor(softmax), Tensor(softmax_lse), Tensor(seed_offset)
+  args : (Tensor q, Tensor k, Tensor v, Tensor attn_mask_start_row_indices, Tensor out, Tensor softmax_lse, Tensor seed_offset, Tensor out_grad, float dropout = 0.0, bool causal = false, int attn_mask_start_row = 0)


后续该算子若要支持自动并行，请添加切分推导规则

jeff41404 · 2024-03-28T03:19:49Z

API is fine, please attach the Chinese document PR link in description above.

deepllz · 2024-03-28T09:21:29Z

API is fine, please attach the Chinese document PR link in description above.

添加相关中文文档：PaddlePaddle/docs#6554

jeff41404

LGTM for API

* add flash attention with sparse mask * fix doc * Update python/paddle/nn/functional/flash_attention.py * Update python/paddle/nn/functional/flash_attention.py * Update python/paddle/nn/functional/flash_attention.py * Update python/paddle/nn/functional/flash_attention.py * fix docstring --------- Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com> Co-authored-by: zachary sun <sunzhongkai@baidu.com>

This reverts commit e05764a.

add flash attention with sparse mask

f2ae287

GuoxiaWang force-pushed the dev_fa_sparsemask branch from 17ed1c6 to f2ae287 Compare March 22, 2024 16:24

GuoxiaWang added 2 commits March 25, 2024 22:04

fix conflict

de4a3f5

fix doc

9429053

sunzhongkai588 reviewed Mar 27, 2024

View reviewed changes

sunzhongkai588 and others added 5 commits March 27, 2024 15:05

Update python/paddle/nn/functional/flash_attention.py

51b4b9e

Update python/paddle/nn/functional/flash_attention.py

93ddfaa

Update python/paddle/nn/functional/flash_attention.py

b05a608

Update python/paddle/nn/functional/flash_attention.py

a7830e5

fix docstring

eb9bd78

sunzhongkai588 approved these changes Mar 27, 2024

View reviewed changes

YuanRisheng approved these changes Mar 27, 2024

View reviewed changes

kolinwei approved these changes Mar 27, 2024

View reviewed changes

heavyrain-lzy approved these changes Mar 27, 2024

View reviewed changes

zhangbo9674 approved these changes Mar 27, 2024

View reviewed changes

jeff41404 approved these changes Mar 28, 2024

View reviewed changes

sneaxiy approved these changes Mar 28, 2024

View reviewed changes

sneaxiy merged commit e05764a into PaddlePaddle:develop Mar 28, 2024

tianshuo78520a added a commit to tianshuo78520a/Paddle that referenced this pull request Mar 29, 2024

Revert "support flash attention with sparse mask (PaddlePaddle#62029)"

6adc83d

This reverts commit e05764a.

tianshuo78520a mentioned this pull request Mar 29, 2024

Revert "support flash attention with sparse mask" #63099

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support flash attention with sparse mask #62029

support flash attention with sparse mask #62029

Uh oh!

GuoxiaWang commented Feb 23, 2024 •

edited

Loading

Uh oh!

paddle-bot bot commented Feb 23, 2024

Uh oh!

paddle-ci-bot bot commented Mar 15, 2024

Uh oh!

sunzhongkai588 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sunzhongkai588 left a comment

Uh oh!

heavyrain-lzy left a comment

Uh oh!

heavyrain-lzy Mar 27, 2024

Uh oh!

GuoxiaWang Mar 28, 2024

Uh oh!

jeff41404 commented Mar 28, 2024 •

edited

Loading

Uh oh!

deepllz commented Mar 28, 2024

Uh oh!

jeff41404 left a comment

Uh oh!

Uh oh!

support flash attention with sparse mask #62029

support flash attention with sparse mask #62029

Uh oh!

Conversation

GuoxiaWang commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Feb 23, 2024

Uh oh!

paddle-ci-bot bot commented Mar 15, 2024

Uh oh!

sunzhongkai588 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sunzhongkai588 left a comment

Choose a reason for hiding this comment

Uh oh!

heavyrain-lzy left a comment

Choose a reason for hiding this comment

Uh oh!

heavyrain-lzy Mar 27, 2024

Choose a reason for hiding this comment

Uh oh!

GuoxiaWang Mar 28, 2024

Choose a reason for hiding this comment

Uh oh!

jeff41404 commented Mar 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deepllz commented Mar 28, 2024

Uh oh!

jeff41404 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GuoxiaWang commented Feb 23, 2024 •

edited

Loading

jeff41404 commented Mar 28, 2024 •

edited

Loading