Publications

(2024). Compute Or Load KV Cache? Why Not Both?. Preprint.

PDF

(2024). Eagle: Efficient Training-Free Router for Multi-LLM Inference. ML4Sys@NeurIPS24.

PDF

(2024). AutoSpec: Automated Generation of Neural Network Specifications. Preprint.

PDF

(2024). Adaptive Skeleton Graph Decoding. Preprint.

PDF

(2024). OASIS: Collaborative Neural-Enhanced Mobile Video Streaming. MMSys 2024 Best Paper Award.

PDF

(2024). QUIC is not Quick Enough over Fast Internet. WWW 2024.

(2024). The Case for Boosting Mobile Application QoE via Smart Band Switching in 5G/xG Networks. HotMobile 2024.

(2023). On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures. USENIX Security 2024.

(2023). Poster: QUIC is not Quick Enough over Fast Internet. IMC 2023.

(2022). Vivisecting Mobility Management in 5G Cellular Networks. SIGCOMM 2022.

PDF Code Dataset

(2021). ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases. SIGMOD 2021.

PDF

(2021). A Variegated Look at 5G in the Wild: Performance, Power, and QoE Implications. SIGCOMM 2021.

PDF Code Video DOI