CARVIEW |
Select Language
HTTP/2 200
date: Wed, 15 Oct 2025 06:15:55 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"db0a701188957191670157a1bdcb2134"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=xkLZFNMIDEK%2FQ41LYOGT3gzsBsi%2BrxXgcQhWSn8j3yn9e8kPuVDSAQeUMppbDqPTMktqaRKq2mcMrVEaggrUbV4sw5SSC1Dk0rP74kRW9QL%2BPytMucC%2B43tuvvvXrYh%2BWir5FHHyv68GMsmdjXIHDfzZBoYKcqbxtdFxk7KwtzDCxhx0s%2F4asYOKb9%2FxJQga0bYmG1WFKM5aN8rn2FYsErNhSJlubJp69M%2BCPhDufB7dyDczOOKiJ1WnTZNVzuyAHq1ZI5HbyfqF75UYloq4%2BA%3D%3D--Tyl6Xqo7t3%2BSru04--zTtYqRwxxkukHBaZPr8a9A%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1044910783.1760508954; Path=/; Domain=github.com; Expires=Thu, 15 Oct 2026 06:15:54 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Thu, 15 Oct 2026 06:15:54 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 9DBC:50661:140BD0D:1818936:68EF3C1A
Tags · pytorch/pytorch · GitHub
Toggle trunk/36871622f1061ff5b4e1458274659b9138835b19's commit message
Toggle trunk/3915898c22472cbde83ba437bd6580b504a92db2's commit message
Toggle trunk/3681312ce03e425e280a110df2153db107616a15's commit message
Toggle trunk/47524dcc4839548431e06dbe036faf752509001a's commit message
Toggle trunk/7778a58e7c3a9dfca8c4fa00d936581e7549d918's commit message
Toggle trunk/3044e1a460a2ae71a95e77d9ac0c33d3e8294e85's commit message
Toggle trunk/839f6facdba92f8fe90cbd50721ff9a025474969's commit message
Toggle trunk/132ae8e6dd5e1a206dfb330eb7c94555f6eaaf9e's commit message
Toggle trunk/59d30d1b75849f21fe86f0b3244b2306abef4cb9's commit message
Toggle trunk/9ffba8a2f98b10d2f33a414ec2c68bc8abb01106's commit message
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Tags: pytorch/pytorch
Tags
trunk/36871622f1061ff5b4e1458274659b9138835b19
[2/N] Mark unused parameters in C++ code (#165121) This is follow-up of #164912 to mark unused C++ parameters to improve code readability. Pull Request resolved: #165121 Approved by: https://github.com/Skylion007
trunk/3915898c22472cbde83ba437bd6580b504a92db2
[audio hash update] update the pinned audio hash (#165495) This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml). Update the pinned audio hash. Pull Request resolved: #165495 Approved by: https://github.com/pytorchbot
trunk/3681312ce03e425e280a110df2153db107616a15
varlen api (#164502) **Summary** Today, the only way to have variable sequence length support in PyTorch attention is through nested tensors [here](https://docs.pytorch.org/tutorials/intermediate/scaled_dot_product_attention_tutorial.html#nestedtensor-and-dense-tensor-support). We also want to add an explicit lower-level API that provides variable sequence length support without padding/masking in SDPA. This PR builds out `varlen_attn`, the public API that users can call for the forward method, and `_varlen_attn`, the private API that calls into the Flash Attention/cuDNN backend. **Benchmarking** To benchmark, we compare runtime and TFLOPs against the current SDPA approach with padding. Settings: - 1 H100 machine - `batch_size=8`, `max_seq_len=2048`, `embed_dim=1024`, `num_heads=16` - dtype `torch.bfloat16` - `is_causal=False` - for variable length, we set sequences to be random multiples of 64 up to `max_seq_len` - 100 runs | | Variable Length API | SDPA | |--------|--------------------|----------| | Runtime | 0.21750560760498047 ms | 0.43171775817871094 ms | | TFLOPs | 231.812 | 320.840 | The sparsity is 0.453 which we can see matches the speedup we get from Varlen (approx 50%). TFLOPs remains around the same, with SDPA slightly larger due to potential higher overhead and total flops scaling with sequence length. **Testing** Run `python test/test_varlen_attention.py` for unit tests where we verify basic functionality and confirm numerical match between varlen outputs vs SDPA. **Next steps** Next steps from this PR (higher in the stack) include registering the private API `_varlen_attn` as a custom op, implementing backward support, and enabling cuDNN with correct numerics. (This stack builds on top of #162326) Pull Request resolved: #164502 Approved by: https://github.com/v0i0, https://github.com/drisspg
trunk/47524dcc4839548431e06dbe036faf752509001a
[benchmark] Add more timm models (#165381) Added following models to timm_models - [convnextv2_nano.fcmae_ft_in22k_in1k](https://huggingface.co/timm/convnextv2_nano.fcmae_ft_in22k_in1k) - [vit_base_patch14_dinov2.lvd142m](https://huggingface.co/timm/vit_base_patch14_dinov2.lvd142m) - [ViT-B-16-SigLIP-i18n-256](https://huggingface.co/timm/ViT-B-16-SigLIP-i18n-256) - [deit_tiny_patch16_224.fb_in1k](https://huggingface.co/timm/deit_tiny_patch16_224.fb_in1k) Pull Request resolved: #165381 Approved by: https://github.com/BoyuanFeng
trunk/7778a58e7c3a9dfca8c4fa00d936581e7549d918
Revert "[export] Handle kwargs better in aot_export_joint_with_descri… …ptors (#165334)" This reverts commit bbb902c. Reverted #165334 on behalf of https://github.com/jeffdaily due to trunk CI passed here but failures on HUD after merge? test/functorch/test_aot_joint_with_descriptors.py::TestAOTJointWithDescriptors::test_module_with_kwargs [GH job link](https://github.com/pytorch/pytorch/actions/runs/18511729262/job/52755708742) [HUD commit link](https://hud.pytorch.org/pytorch/pytorch/commit/bbb902c8dd911e1587253f496c1e2fb178d4b6a1) ([comment](#165334 (comment)))
trunk/3044e1a460a2ae71a95e77d9ac0c33d3e8294e85
Revert "varlen api (#164502)" This reverts commit 3681312. Reverted #164502 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but the doctests failure is legit ([comment](#164502 (comment)))
trunk/839f6facdba92f8fe90cbd50721ff9a025474969
[precompile] Fix frame construction for wrapped model. (#165454) Summary: If a function is wrapped with functools, we should not look at the wrapped function signature but rather the wrapper, since we need to construct the frame for the top level function here. Test Plan: test_decorated_function_with_functools_wrap_aot Differential Revision: D84626752 Pull Request resolved: #165454 Approved by: https://github.com/yiming0416
trunk/132ae8e6dd5e1a206dfb330eb7c94555f6eaaf9e
Don't link with libnvToolsExt when building for 12.9 (#165465) This is to bring back this logic from https://github.com/pytorch/pytorch/pull/161916/files#diff-bf46b4a09ca67e50622bf84fefc0d11b584ffcc24ee6cc5019cf0fc7565d81a8L170. Building libtorch on 12.9 is failing otherwise https://github.com/pytorch/pytorch/actions/runs/18458531395/job/52610761895: ``` cp: cannot stat '/usr/local/cuda/lib64/libnvToolsExt.so.1': No such file or directory ``` Pull Request resolved: #165465 Approved by: https://github.com/atalman, https://github.com/malfet
trunk/59d30d1b75849f21fe86f0b3244b2306abef4cb9
[vision hash update] update the pinned vision hash (#165496) This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml). Update the pinned vision hash. Pull Request resolved: #165496 Approved by: https://github.com/pytorchbot
trunk/9ffba8a2f98b10d2f33a414ec2c68bc8abb01106
fixing stress test failure (#164353) Summary: This diff fixes a stress test failure by adding a new binary echo4.py and modifying the existing echo1.py binary. The changes are made in both fbcode and xplat directories. The api_test.py file is updated to use the new echo4.py binary, and the BUCK file is updated to include the new binary. Test Plan: ``` buck test -j 18 'fbcode//mode/opt' fbcode//caffe2/test/distributed/elastic/multiprocessing:api_test -- --exact 'caffe2/test/distributed/elastic/multiprocessing:api_test - test_binary_redirect_and_tee (api_test.StartProcessesListAsBinaryTest)' --run-disabled --stress-runs 20 --record-results ``` ``` buck test -j 18 'fbcode//mode/opt' fbcode//caffe2/test/distributed/elastic/multiprocessing:api_test -- --exact 'caffe2/test/distributed/elastic/multiprocessing:api_test - test_binary (api_test.StartProcessesListAsBinaryTest)' --run-disabled --stress-runs 20 --record-results ``` https://www.internalfb.com/intern/testinfra/testrun/17732923648474906 https://www.internalfb.com/intern/testinfra/testrun/15481123834815653 Differential Revision: D83623694 Pull Request resolved: #164353 Approved by: https://github.com/d4l3k
PreviousNext
You can’t perform that action at this time.