| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Thu, 19 Jun 2025 15:30:03 GMT
access-control-allow-origin: *
etag: W/"68542cfb-2550"
expires: Mon, 29 Dec 2025 01:08:56 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 1AB1:36A0B4:81EDA1:91F86C:6951D24F
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 00:58:56 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210077-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766969936.227630,VS0,VE204
vary: Accept-Encoding
x-fastly-request-id: 42bab872fbe6ece2e5fe25c821e6f16a1fad566d
content-length: 3127
Etash Guha
Hi, I'm Etash Guha
I'm a Ph.D. student at the University of Washington Computer Science and Engineering Department.
I research how to design and improve training data curation protocols for training large text and image models. This includes synthetic data generation, data filtering, and online data sampling. I am extremely fortunate to be advised by the amazing Professors Ludwig Schmidt and Yejin Choi. I'm graciously supported by the NSF Graduate Research Fellowship.
I was a researcher at SambaNova Systems working on the reliability of Large Language Models. Most recently, I was a research intern under Dr. Emtiyaz Khan on the Approximate Bayesian Inference Team at RIKEN AIP in Tokyo, Japan. I was both an undergraduate student and research assistant at Georgia Tech where I worked with Vidya Muthukumar, Ashwin Pananjady, Jacob Abernethy, and Xiaoming Huo.
I have worked with researchers, traders, and software engineers while working at SambaNova Systems, FORT LP, and SAS.