CARVIEW |
Navigation Menu
-
Notifications
You must be signed in to change notification settings - Fork 24.7k
[PyTorch] Port ExecuTorch bfdot improvement back to ATen BlasKernel, Try #2 #137377
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
β¦Try #2 ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
π Helpful Linksπ§ͺ See artifacts and rendered test results at hud.pytorch.org/pr/137377
Note: Links to docs will display an error until the docs builds have been completed. β No FailuresAs of commit fcb5288 with merge base de4c2a3 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦Try #2 ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) ghstack-source-id: 246411194 Pull Request resolved: #137377
@pytorchbot label "ciflow/linux-aarch64" |
β¦lasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦lasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦Try #2 Pull Request resolved: #137377 ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . ghstack-source-id: 246616406 Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/)
What's the difference with the previous attempt? |
aten/src/ATen/native/BlasKernel.cpp
Outdated
} | ||
|
||
// NOTE: The first attempt at landing BFDOT support with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@albanD here is the difference
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ho thanks for the pointer!
Should we undef DOT_WITH_FP32_ARITH_TAIL_AFTER_MAIN_LOOP_BODY ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is a .cpp file, so the macro isn't going to leak anywhere and it's not particularly necessary, but sure I can do that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
β¦lasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦ back to ATen BlasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦Ten BlasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
β¦Try #2 Pull Request resolved: #137377 ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . ghstack-source-id: 246956192 Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/)
CI is 100% green; please review |
β¦lasKernel, Try #2" ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 . Differential Revision: [D63923166](https://our.internmc.facebook.com/intern/diff/D63923166/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D63923166 |
If CI is green, than sure, LGTM |
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Stack from ghstack (oldest at bottom):
defined(__aarch64__) && !defined(CPU_CAPABILITY_SVE256)
instead ofdefined(CPU_CAPABILITY_NEON)
Β #137722ExecuTorch's fork of BlasKernel.cpp grew bfdot support, complete with demonstration that it helps. Port it back to PyTorch. First attempt was #136331 .
Differential Revision: D63923166