| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 23 Dec 2025 03:26:08 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"694a0bd0-139f"
expires: Mon, 29 Dec 2025 13:40:35 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 7932:2DDCFF:8DBFC8:9F199D:6952827B
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 13:30:35 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210050-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767015035.423050,VS0,VE220
vary: Accept-Encoding
x-fastly-request-id: 94e4057161819fc90c730dcb16b30b6eaa6d463c
content-length: 2046
Julian Quevedo's Homepage
Hello, I'm Julian!
I'm an undergrad at Stanford interested in large scale models of the human experience.
Most recently, I was a researcher at World Labs. Before that, I co-created Oasis, the first realtime world model with a playable demo. And prior to that, I optimized inference for the first era of large language models at Cohere and MosaicML.
Highlighted Work
![]() |
WorldGym: World Model as An Environment for Policy Evaluation Julian Quevedo, Ansh Kumar Sharma, Yixiang Sun, Varad Suryavanshi, Percy Liang, Sherry Yang [arxiv] [website] [code] |
![]() |
Real-Time Frame Model World Labs [blog] |
![]() |
Oasis: A Universe in a Transformer Decart & Quevedo, et al. [blog] [demo] [code] Press: TechCrunch |
![]() |
Serving Quantized LLMs on NVIDIA H100 Tensor Core GPUs Nikhil Sardana, Julian Quevedo, Daya Khudia [blog] |
![]() |
LLM inference performance engineering: Best practices Megha Agarwal, Asfandyar Qureshi, Nikhil Sardana, Linden Li, Julian Quevedo, Daya Khudia [blog] |
github | twitter | google scholar | linkedin
I would love to hear from you: julianq@stanford.edu





