| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 26 Nov 2025 06:19:29 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"69269bf1-1eda"
expires: Sat, 27 Dec 2025 08:51:39 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 92C5:2C10E1:625310:6E196D:694F9BC1
accept-ranges: bytes
age: 0
date: Sat, 27 Dec 2025 08:41:40 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210082-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766824900.870526,VS0,VE207
vary: Accept-Encoding
x-fastly-request-id: 78a6aa9389f2ab1fee325a9037a88cea8d1e2de9
content-length: 3278
Home - Yueming Yuan
Welcome!
š About me
Iām Yueming Yuan (č¢ę¦č), a second year CS PhD student at UIUC.
I have a broad interest in accurate and efficient techniques for LLM/VLM training, inference and reasoning.
⨠Research
I mainly worked on algorithm-system co-design for efficient LLM training/inference, for example:
- (1) Large-scale MoE pretraining, parallelism (~1k GPUs scale) [SC 2025, Best Student Paper]
- (2) MoE quantization, inference efficiency [MLSys 2025]
- (3) Efficient computation & code-generation for sparse attention. [OOPSLA 2025]
ā ā
About
Yueming Yuan
"Pinaster"
Contact
Email
WeChat ID: _Pinaster
Coordinates
University of Illinois Urbana-Champaign
Urbana, IL 61801
© 2025 Yueming Yuan
