| Jan 31, 2026 |
📜EuroSys 2026: Suika, a cluster training system that supports efficient and high-quality rescheduling for 3D-parallelized LLM training jobs, is accepted to EuroSys 2026! See our paper “Suika: Efficient and High-quality Rescheduling of 3D-parallelized LLM Training Jobs in Shared Clusters” for details.
|
| Nov 21, 2025 |
♻️AI for Good Global Submmit: I presented “China Telecom drives ubiquitous intelligence through AI Flow”. Positioned at the intersection of AI and communications infrastructure, AI Flow aims to enable ubiquitous intelligence by bridging the gap between devices, edge computing, and the cloud.
|
| Aug 23, 2025 |
📜EuroSys 2026: GRouter, a GPU-centric data plane system designed for serverless inference workflows, is accepted to EuroSys 2026! See our paper “Efficient Data Passing for Serverless Inference Workflows: A GPU-Centric Approach” for details.
|
| Jun 15, 2025 |
♻️Invited Keynote Speaker at AI for Good Global Submmit: I will be delivering a Keynote speech on AI Solutions in China Telecom at the AI for Good Global Summit 8-11 July in Geneva, hosted by the ITU of the United Nations. Join us as we discuss how AI can shape a sustainable future!
|
| Apr 1, 2025 |
📜USENIX ATC 2025: Toppings, an efficient multi-tenant system that serves many LoRA adapters with a common base LLM, is accepted to USENIX ATC 2025! See our paper “Toppings: CPU-Assisted, Rank-Aware Adapter Serving for LLM Inference” for details.
|
| Apr 1, 2025 |
📜USENIX NSDI 2025: Prism, a production DLRM serving system that eliminates GPU fragmentation by means of resource disaggregation, is accepted to USENIX NSDI 2025! See our paper “GPU-Disaggregated Serving for Deep Learning Recommendation Models at Scale” for details.
|
| Sep 1, 2024 |
💡Openings: Calling highly motivated students interning in Shanghai for 3+ months. Passionate about large AI models (e.g., LLM, VLM, DiT)? Please send your CV to my email. Experience with DL frameworks, distributed systems, or CUDA programming is a plus but not required.
|