Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

publications

Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform

Published in IEEE Transactions on Parallel and Distributed Systems, 2021

Yuchen Liu*, Yixuan Meng*, Kaiyuan Xu*, Zijun Xu*, Tianyuan Wu*, Yiwei Yang*, Shu Yin* (* All authors contributed equally).

Recommended citation: Yuchen Liu, Yixuan Meng, Kaiyuan Xu, Zijun Xu, Tianyuan Wu, Yiwei Yang, and Shu Yin. "Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform." IEEE Transactions on Parallel and Distributed Systems 33, no. 9 (2021): 2047-2049.
Download Paper

Portus: Efficient DNN Checkpointing to Persistent Memory with Zero-Copy

Published in 44th IEEE International Conference on Distributed Computing Systems (ICDCS 24), 2024

Yuanhao Li*, Tianyuan Wu*, Guancheng Li, Yanjie Song, Shu Yin (* Equal contribution).

Recommended citation: Li, Yuanhao, Tianyuan Wu, Guancheng Li, Yanjie Song, and Shu Yin. "Portus: Efficient DNN Checkpointing to Persistent Memory with Zero-Copy." In 2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS), pp. 59-70. IEEE, 2024.
Download Paper

A Data Optimizer for Region-Aware Self-describing Files in Scientific Computing

Published in 2024 ACM Symposium on Cloud Computing (SoCC 24), 2024

Yanjie Song*, Tianyuan Wu*, Yuanhao Li, Guancheng Li, Yuchen Liu, Shu Yin, Wei Xue, Junchao Wang (* Equal contribution).

Recommended citation: Yanjie Song, Tianyuan Wu, Yuanhao Li, Guancheng Li, Yuchen Liu, Shu Yin, Wei Xue, and Junchao Wang. "A Data Optimizer for Region-Aware Self-describing Files in Scientific Computing." In Proceedings of the 15th ACM Symposium on Cloud Computing, pp. 431-446. 2024.
Download Paper

Greyhound: Hunting Fail-Slows in Hybrid-Parallel Training at Scale

Published in 2025 USENIX Annual Technical Conference (USENIX ATC 25), 2025

Tianyuan Wu, Wei Wang, Yinghao Yu, Siran Yang, Wenchao Wu, Qinkai Duan, Guodong Yang, Jiamang Wang, Lin Qu, Liping Zhang.

Recommended citation: Tianyuan Wu, Wei Wang, Yinghao Yu, Siran Yang, Wenchao Wu, Qinkai Duan, Guodong Yang, Jiamang Wang, Lin Qu, and Liping Zhang, ‘‘Greyhound: Hunting Fail-Slows in Hybrid-Parallel Training at Scale,’’ in the Proceedings of USENIX Annual Technical Conference (ATC ’25), Boston, MA, USA, July 2025.
Download Paper

Toppings: CPU-Assisted, Rank-Aware Adapter Serving for LLM Inference

Published in 2025 USENIX Annual Technical Conference (USENIX ATC 25), 2025

Suyi Li*, Hanfeng Lu*, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang (* Equal contribution).

Recommended citation: Suyi Li*, Hanfeng Lu*, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, and Wei Wang, ‘‘Toppings: CPU-Assisted, Rank-Aware Adapter Serving for LLM Inference,’’ in the Proceedings of USENIX Annual Technical Conference (ATC ’25), Boston, MA, USA, July 2025. (*Equal contribution)
Download Paper

Adaptra: Straggler-Resilient Hybrid-Parallel Training with Pipeline Adaptation

Published in Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI ’26), 2025

Tianyuan Wu*, Lunxi Cao*, Hangeng Lu, Xiaoxiao Jiang, Yinghao Yu, Siran Yang, Guodong Yang, Jiamang Wang, Lin Qu, Liping Zhang, Wei Wang.

Recommended citation: Tianyuan Wu*, Lunxi Cao*, Hanfeng Lu, Xiaoxiao Jiang, Yinghao Yu, Siran Yang, Guodong Yang, Jiamang Wang, Lin Qu, Liping Zhang, and Wei Wang, "Attack of the Bubbles: Straggler-Resilient Pipeline Parallelism for Large Model Training," in the Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI ’26), Renton, WA, USA, May 2026. (*Equal contribution)
Download Paper

talks

teaching