| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
date: Sun, 28 Dec 2025 14:52:38 GMT
content-type: text/html; charset=utf-8
last-modified: Tue, 16 Jul 2024 06:51:13 GMT
vary: Accept-Encoding
access-control-allow-origin: *
etag: W/"66961861-c6a"
expires: Sun, 28 Dec 2025 15:02:38 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: AD86:1DE9E5:347A192:38ABCBE:69514436
Vaishaal Shankar
About
I am a research scientist with the Machine Learning Research group at Apple. I work on dataset design. I have been fortunate to be a part of the DataComp, ImageNetV2, and OpenCLIP projects.
Previously, I spent 9 wonderful years at UC Berkeley and had the pleasure of working with Ben Recht, Ludwig Schmidt, Eric Jonas, Shivaram Venkataraman, and many others.
Contact me at vs at vaishaal dot com.
Selected Publications
- DataComp-LM: In search of the next generation of training sets for language models
- Data Filtering Networks - ICLR 2024
- DataComp: In search of the next generation of multimodal datasets - Neurips 2023
- Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP) - ICML 2021
- Do ImageNet classifiers generalize to ImageNet? - ICML 2019