| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://estija.github.io/publications/
access-control-allow-origin: *
strict-transport-security: max-age=31556952
expires: Tue, 30 Dec 2025 17:36:01 GMT
cache-control: max-age=600
x-proxy-cache: MISS
x-github-request-id: F58E:3FD64F:A5E19D:BA34B3:69540B29
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 17:26:01 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210087-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767115562.587879,VS0,VE226
vary: Accept-Encoding
x-fastly-request-id: 48721c67dd843ca8190c2fa047c51f67709f14b5
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 29 Oct 2025 18:29:13 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"69025cf9-2825"
expires: Tue, 30 Dec 2025 17:36:01 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 2B4C:272D88:A612A6:BA616D:69540B29
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 17:26:02 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210087-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767115562.827669,VS0,VE221
vary: Accept-Encoding
x-fastly-request-id: 89a337df8eca430a1300c481e2fda3d3e921f40a
content-length: 3201
Publications - Bhavya Vasudeva
Publications
Complete list of papers on Google Scholar. * denotes equal contribution.
How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Bhavya Vasudeva, Puneesh Deora, Yize Zhao, Vatsal Sharan, Christos Thrampoulidis
Submitted
Latent Concept Disentanglement in Transformer-based Language Models
Guanzhe Hong*, Bhavya Vasudeva*, Vatsal Sharan, Cyrus Rashtchian, Prabhakar Raghavan, Rina Panigrahy
Submitted
The Rich and the Simple: On the Implicit Bias of Adam and SGD
Bhavya Vasudeva, Jung Whan Lee, Vatsal Sharan, Mahdi Soltanolkotabi
NeurIPS 2025
In-Context Occam’s Razor: How Transformers Prefer Simpler Hypotheses on the Fly
Puneesh Deora, Bhavya Vasudeva, Tina Behnia, Christos Thrampoulidis
COLM 2025; MOSS Workshop at ICML 2025 (Oral)
Transformers Learn Low-Sensitivity Functions: Investigations and Implications
Bhavya Vasudeva*, Deqing Fu*, Tianyi Zhou, Elliott Kau, Youqi Huang, Vatsal Sharan
ICLR 2025
Implicit Bias and Fast Convergence Rates for Self-attention
Bhavya Vasudeva*, Puneesh Deora*, Christos Thrampoulidis
TMLR 2025
Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Bhavya Vasudeva, Kameron Shahabi, Vatsal Sharan
TMLR 2024