HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Fri, 06 Sep 2024 03:20:36 GMT
access-control-allow-origin: *
etag: W/"66da7504-3a47"
expires: Mon, 29 Dec 2025 01:08:00 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 17DA:1F53DD:820B58:9215C7:6951D215
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 00:58:00 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210035-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766969881.719705,VS0,VE202
vary: Accept-Encoding
x-fastly-request-id: a68ac68bd140d42c8794bea3f254a18874879b6d
content-length: 3770
Xin Jin
Xin Jin
Associate Professor
School of Computer Science
Peking University
Email: xinjinpku (at) pku (dot) edu (dot) cn
I am an Associate Professor in the
School of Computer Science at
Peking University .
I work on computer systems and networking. My research has received
USENIX NSDI Best Paper Award (2018) and USENIX FAST Best Paper Award
(2019).
Research
I am broadly interested in computer systems and networking. My research
currently focuses on designing and building systems for cloud computing and large language models.
Current Projects
Serverless Computing
Training and Serving Large Language Models
Network Testing and Verification
Disaggregated Storage with RDMA and DPUs
Recent Publications (All Publications )
[SOSP 24] LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism
[OSDI 24] DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
[OSDI 24] dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
[OSDI 24] Burstable Cloud Block Storage with Data Processing Units
[NSDI 24] Jolteon: Unleashing the Promise of Serverless for Serverless Workflows
[NSDI 24] Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining
[NSDI 24] MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
[SOSP 23] Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing
[SOSP 23] Automated Verification of an In-Production DNS Authoritative Engine
[SOSP 23] Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates
[SIGCOMM 23] Ditto: Efficient Serverless Analytics with Elastic Parallelism
[SIGCOMM 23] Klotski: Efficient and Safe Network Migration of Large Production Datacenters
[SIGCOMM 23] Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina
[SIGCOMM 23] XRON: A Hybrid Elastic Cloud Overlay Network for Video Conferencing at Planetary Scale
[OSDI 23] AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
[NSDI 23] Transparent GPU Sharing in Container Clouds for Deep Learning Workloads
[NSDI 23] Fast, Approximate Vector Queries on Very Large Unstructured Datasets