[Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode #137966

PenghuiCheng · 2024-10-15T06:22:48Z

The "scatter_add" op with the deterministic mode in XPU device is not implemented, it will report that "scatter_add_kernel" does not have a deterministic implementation in UT.

Just like the implementation of CUDA, we need to check _deterministic_algorithms in scatter_add op for the XPU device.

The UT is in: https://github.com/intel/torch-xpu-ops/blob/main/test/xpu/test_scatter_gather_ops_xpu.py. We reused PyTorch UT code.
Now the UT case is skipped in torch-xpu-ops test. Will open it when this PR is merged.

pytorch-bot · 2024-10-15T06:22:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137966

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1b26d16 with merge base 565a794 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2024-10-15T06:22:54Z

The committers listed above are authorized under a signed CLA.

✅ login: PenghuiCheng / name: Cheng, Penghui (30cd402, 1b26d16)

EikanWang · 2024-10-15T14:19:16Z

May I know what's the impact? Does it lead to any case failure?

pytorch-bot · 2024-10-15T14:19:39Z

Please seek CI approval before scheduling CIFlow labels

EikanWang · 2024-10-15T14:19:51Z

Please add test cases.

PenghuiCheng · 2024-10-18T01:36:12Z

May I know what's the impact? Does it lead to any case failure?

Yes, the UT covered in https://github.com/intel/torch-xpu-ops/blob/main/test/xpu/test_scatter_gather_ops_xpu.py.
"test_scatter_reduce_mean_xpu_bfloat16",
"test_scatter_reduce_mean_xpu_float16",
"test_scatter_reduce_mean_xpu_float32",
"test_scatter_reduce_mean_xpu_float64",
"test_scatter_reduce_mean_xpu_int16",
"test_scatter_reduce_mean_xpu_int32",
"test_scatter_reduce_mean_xpu_int64",
"test_scatter_reduce_mean_xpu_int8",
"test_scatter_reduce_mean_xpu_uint8"

Signed-off-by: Cheng Penghui <penghui.cheng@intel.com>

ezyang · 2024-11-13T03:33:04Z

@pytorchbot merge

pytorchmergebot · 2024-11-13T03:34:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…istic mode (pytorch#137966) The "scatter_add" op with the deterministic mode in XPU device is not implemented, it will report that "scatter_add_kernel" does not have a deterministic implementation in UT. Just like the implementation of CUDA, we need to check _deterministic_algorithms in scatter_add op for the XPU device. The UT is in: https://github.com/intel/torch-xpu-ops/blob/main/test/xpu/test_scatter_gather_ops_xpu.py. We reused [PyTorch UT code]( https://github.com/pytorch/pytorch/blob/96b30dcb25c80513769dae2a8688aec080b00117/test/test_scatter_gather_ops.py#L233). Now the UT case is [skipped in torch-xpu-ops test](https://github.com/intel/torch-xpu-ops/blob/4fa7921f1e9a0bf300d25da9b8758524f2751092/test/xpu/skip_list_common.py#L731). Will open it when this PR is merged. Pull Request resolved: pytorch#137966 Approved by: https://github.com/EikanWang, https://github.com/guangyey, https://github.com/ezyang

pytorchbot added the open source label Oct 15, 2024

EikanWang marked this pull request as draft October 15, 2024 07:13

EikanWang changed the title ~~avoid gpuAtomicAdd for XPU device for satter_add by deterministic mode~~ [WIP] avoid gpuAtomicAdd for XPU device for satter_add by deterministic mode Oct 15, 2024

EikanWang added topic: not user facing topic category ciflow/xpu Run XPU CI tasks labels Oct 15, 2024

pytorch-bot bot removed the ciflow/xpu Run XPU CI tasks label Oct 15, 2024

PenghuiCheng changed the title ~~[WIP] avoid gpuAtomicAdd for XPU device for satter_add by deterministic mode~~ [WIP] avoid atomic add for XPU device in satter_add by deterministic mode Oct 16, 2024

PenghuiCheng mentioned this pull request Oct 16, 2024

[Release/2.5.0] UT failures intel/torch-xpu-ops#899

Closed

4 tasks

EikanWang added the ciflow/xpu Run XPU CI tasks label Oct 24, 2024

PenghuiCheng marked this pull request as ready for review November 5, 2024 07:08

PenghuiCheng force-pushed the penghuic/scatter_add_deterministic branch 2 times, most recently from 3d7a311 to fc26d80 Compare November 7, 2024 08:08

PenghuiCheng added 2 commits November 11, 2024 02:30

avoid gpuAtomicAdd for XPU device for satter_add by deterministic mode

30cd402

Signed-off-by: Cheng Penghui <penghui.cheng@intel.com>

Update TensorAdvancedIndexing.cpp

1b26d16

PenghuiCheng force-pushed the penghuic/scatter_add_deterministic branch from fc26d80 to 1b26d16 Compare November 11, 2024 02:31

EikanWang changed the title ~~[WIP] avoid atomic add for XPU device in satter_add by deterministic mode~~ [Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode Nov 11, 2024

EikanWang requested a review from malfet November 11, 2024 07:08

EikanWang approved these changes Nov 11, 2024

View reviewed changes

EikanWang requested a review from atalman November 11, 2024 07:08

guangyey approved these changes Nov 12, 2024

View reviewed changes

guangyey requested a review from ezyang November 12, 2024 06:13

guangyey added this to the 2.6.0 milestone Nov 12, 2024

ezyang approved these changes Nov 13, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 13, 2024

pytorchmergebot added the merging label Nov 13, 2024

pytorchmergebot added the Merged label Nov 13, 2024

pytorchmergebot closed this in 5b1c67c Nov 13, 2024

pytorchmergebot removed the merging label Nov 13, 2024

atalman mentioned this pull request Jan 13, 2025

Release 2.6.0 validations checklist and cherry-picks #144503

Closed

73 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode #137966

[Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode #137966

Uh oh!

PenghuiCheng commented Oct 15, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 15, 2024 •

edited

Loading

Uh oh!

linux-foundation-easycla bot commented Oct 15, 2024 •

edited

Loading

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

pytorch-bot bot commented Oct 15, 2024

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

PenghuiCheng commented Oct 18, 2024 •

edited

Loading

Uh oh!

ezyang commented Nov 13, 2024

Uh oh!

pytorchmergebot commented Nov 13, 2024

Uh oh!

Uh oh!

[Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode #137966

[Intel GPU] Avoid atomic add for XPU device in satter_add by deterministic mode #137966

Uh oh!

Conversation

PenghuiCheng commented Oct 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137966

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented Oct 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

pytorch-bot bot commented Oct 15, 2024

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

PenghuiCheng commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Nov 13, 2024

Uh oh!

pytorchmergebot commented Nov 13, 2024

Merge started

Uh oh!

Uh oh!

PenghuiCheng commented Oct 15, 2024 •

edited

Loading

pytorch-bot bot commented Oct 15, 2024 •

edited

Loading

linux-foundation-easycla bot commented Oct 15, 2024 •

edited

Loading

PenghuiCheng commented Oct 18, 2024 •

edited

Loading