Add CI for Triton CPU backend #135342

int3 · 2024-09-06T09:36:57Z

Stack from ghstack (oldest at bottom):

Where possible, I have marked failing tests (which we intend to fix or triage) as @xfail_if_triton_cpu. This will help us track progress of the Triton CPU backend over time. Tests that I don't think we need to address, or that are flaky, have been marked as skips.

Successful CI run: https://github.com/pytorch/pytorch/actions/runs/10822238062/job/30028284549

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-09-06T09:37:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135342

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 9b357a3 with merge base 6966811 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor-periodic / cuda12.1-py3.10-gcc9-sm80 / test (inductor_torchbench_smoketest_perf, 1, 1, linux.gcp.a100) (gh) (similar failure)
moco
pull / linux-focal-py3.12-clang10 / test (dynamo, 3, 3, lf.linux.2xlarge) (gh) (disabled by #134602)
test_transformers.py::TestSDPAPrivateUse1Only::test_scaled_dot_product_fused_attention_overrideable_backward

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 2a1a5cf Pull Request resolved: #135342

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 05a8acf Pull Request resolved: #135342

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 6c47c98 Pull Request resolved: #135342

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Where possible, I have marked failing tests (which we intend to fix or triage) as `xfail_if_triton_cpu`. This will help us track progress of the Triton CPU backend over time. Tests that I don't think we need to address, or that are flaky, have been marked as skips. Successful CI run: https://github.com/pytorch/pytorch/actions/runs/10822238062/job/30028284549 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

malfet · 2024-09-26T00:23:45Z

.ci/docker/build.sh

    TRITON=yes
    ;;
+  pytorch-linux-jammy-py3.12-triton-cpu)
+    CUDA_VERSION=12.4


You probably don't want CUDA here, do you?

Suggested change

CUDA_VERSION=12.4

This seems to have caused an error due to libmpi_cxx not being found. I think it got installed as part of the CUDA PyTorch install. I guess I could figure out how to install it independently, but just installing CUDA seems easier

Where possible, I have marked failing tests (which we intend to fix or triage) as `xfail_if_triton_cpu`. This will help us track progress of the Triton CPU backend over time. Tests that I don't think we need to address, or that are flaky, have been marked as skips. Successful CI run: https://github.com/pytorch/pytorch/actions/runs/10822238062/job/30028284549 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 1c7567a Pull Request resolved: #135342

int3 · 2024-09-27T13:52:05Z

@pytorchbot merge

pytorchmergebot · 2024-09-27T13:54:01Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-09-27T16:39:29Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

Where possible, I have marked failing tests (which we intend to fix or triage) as `xfail_if_triton_cpu`. This will help us track progress of the Triton CPU backend over time. Tests that I don't think we need to address, or that are flaky, have been marked as skips. Successful CI run: https://github.com/pytorch/pytorch/actions/runs/10822238062/job/30028284549 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 · 2024-10-01T20:41:17Z

@pytorchbot merge

pytorchmergebot · 2024-10-01T20:42:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Where possible, I have marked failing tests (which we intend to fix or triage) as `@xfail_if_triton_cpu`. This will help us track progress of the Triton CPU backend over time. Tests that I don't think we need to address, or that are flaky, have been marked as skips. Successful CI run: https://github.com/pytorch/pytorch/actions/runs/10822238062/job/30028284549 Pull Request resolved: pytorch#135342 Approved by: https://github.com/jansel, https://github.com/desertfire, https://github.com/malfet

This greatly reduces compile time; TorchBench models that were previously 50-100x slower (vs the cpp backend) are now ~20x slower. More work needs to be done on the Triton side, but smaller block sizes will still be helpful. Pull Request resolved: #136612 Approved by: https://github.com/desertfire ghstack dependencies: #135342

Add CI for Triton CPU backend

8c93af9

[ghstack-poisoned]

int3 requested review from a team and jeffdaily as code owners September 6, 2024 09:36

int3 mentioned this pull request Sep 6, 2024

Add Triton CPU as an Inductor backend #133408

Closed

pytorch-bot bot added ciflow/inductor module: inductor release notes: releng release notes category labels Sep 6, 2024

Update on "Add CI for Triton CPU backend"

8707ab2

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 marked this pull request as draft September 6, 2024 10:47

Update on "Add CI for Triton CPU backend"

da29589

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Update on "Add CI for Triton CPU backend"

d4d0cfd

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 added a commit that referenced this pull request Sep 6, 2024

Add CI for Triton CPU backend

9da15f7

ghstack-source-id: 2a1a5cf Pull Request resolved: #135342

Update on "Add CI for Triton CPU backend"

60e25a6

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Update on "Add CI for Triton CPU backend"

e832649

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 added a commit that referenced this pull request Sep 10, 2024

Add CI for Triton CPU backend

caf2418

ghstack-source-id: 05a8acf Pull Request resolved: #135342

Update on "Add CI for Triton CPU backend"

df80358

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 added a commit that referenced this pull request Sep 10, 2024

Add CI for Triton CPU backend

9c3702d

ghstack-source-id: 6c47c98 Pull Request resolved: #135342

Update on "Add CI for Triton CPU backend"

267dcd2

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Update on "Add CI for Triton CPU backend"

8b89596

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Update on "Add CI for Triton CPU backend"

58cecc0

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

Update on "Add CI for Triton CPU backend"

a62d606

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

int3 mentioned this pull request Sep 25, 2024

[inductor] Reduce block sizes when using Triton CPU backend #136612

Closed

int3 added 2 commits September 25, 2024 01:56

malfet approved these changes Sep 26, 2024

View reviewed changes

pytorchbot mentioned this pull request Sep 26, 2024

Make test_skip_data_serialization regex more flexible #136710

Merged

int3 added a commit that referenced this pull request Sep 27, 2024

Add CI for Triton CPU backend

705e67a

ghstack-source-id: 1c7567a Pull Request resolved: #135342

pytorchmergebot added the merging label Sep 27, 2024

int3 added 3 commits September 30, 2024 20:16

pytorchmergebot closed this in 99eb47f Oct 1, 2024

pytorchmergebot removed the merging label Oct 1, 2024

int3 mentioned this pull request Oct 3, 2024

Have Triton CPU backend respect max_autotune setting #137276

Closed

github-actions bot deleted the gh/int3/101/head branch November 3, 2024 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add CI for Triton CPU backend #135342

Add CI for Triton CPU backend #135342

Uh oh!

int3 commented Sep 6, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 6, 2024 •

edited

Loading

Uh oh!

malfet Sep 26, 2024

Uh oh!

int3 Sep 27, 2024

Uh oh!

int3 commented Sep 27, 2024

Uh oh!

pytorchmergebot commented Sep 27, 2024

Uh oh!

pytorchmergebot commented Sep 27, 2024

Uh oh!

int3 commented Oct 1, 2024

Uh oh!

pytorchmergebot commented Oct 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add CI for Triton CPU backend #135342

Add CI for Triton CPU backend #135342

Uh oh!

Conversation

int3 commented Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/135342

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

malfet Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

int3 Sep 27, 2024

Choose a reason for hiding this comment

Uh oh!

int3 commented Sep 27, 2024

Uh oh!

pytorchmergebot commented Sep 27, 2024

Merge started

Uh oh!

pytorchmergebot commented Sep 27, 2024

Uh oh!

int3 commented Oct 1, 2024

Uh oh!

pytorchmergebot commented Oct 1, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

int3 commented Sep 6, 2024 •

edited

Loading

pytorch-bot bot commented Sep 6, 2024 •

edited

Loading