Shuowei Jin
Shuowei Jin
Home
Experiences
Publications
Services
MISC
Light
Dark
Automatic
1
Compute Or Load KV Cache? Why Not Both?
Large Language Models (LLMs) are increasingly deployed in large-scale online services, enabling sophisticated applications. However, …
Shuowei Jin
,
Xueshen Liu
,
Qingzhao Zhang
,
Z. Morley Mao
PDF
Plato: Plan to Efficiently Decode for Large Language Model Inference
Large Language Models (LLMs) are increasingly deployed in large-scale online services, enabling sophisticated applications. However, …
Shuowei Jin
,
Xueshen Liu
,
Yongji Wu
,
Haizhong Zheng
,
Qingzhao Zhang
,
Atul Prakash
,
Matthew Lentz
,
Danyang Zhuo
,
Feng Qian
,
Z. Morley Mao
PDF
Eagle: Efficient Training-Free Router for Multi-LLM Inference
The proliferation of Large Language Models (LLMs) with varying capabilities and costs has created a need for efficient model selection …
Zesen Zhao
,
Shuowei Jin
,
Z. Morley Mao
PDF
AutoSpec: Automated Generation of Neural Network Specifications
The increasing adoption of neural networks in learning-augmented systems highlights the importance of model safety and robustness, …
Shuowei Jin
,
Francis Y. Yan
,
Cheng Tan
,
Anuj Kalia
,
Xenofon Foukas
,
Z. Morley Mao
PDF
On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures
Collaborative perception, which greatly enhances the sensing capability of connected and autonomous vehicles (CAVs) by incorporating …
Qingzhao Zhang
,
Shuowei Jin
,
Ruiyang Zhu
,
Jiachen Sun
,
Xumiao Zhang
,
Qi Alfred Chen
,
Z. Morley Mao
OASIS: Collaborative Neural-Enhanced Mobile Video Streaming
Neural-enhanced video streaming (e.g. super-resolution) is an ongoing revolution which can provide extremely high-quality video …
Shuowei Jin
,
Ruiyang Zhu
,
Ahmad Hassan
,
Xiao Zhu
,
Xumiao Zhang
,
Z. Morley Mao
,
Feng Qian
,
Zhi-Li Zhang
PDF
QUIC is not Quick Enough over Fast Internet
QUIC is expected to be a game-changer in improving web application performance. In this paper, we conduct a systematic examination of …
Xumiao Zhang
,
Shuowei Jin
,
Yi He
,
Ahmad Hassan
,
Z. Morley Mao
,
Feng Qian
,
Zhi-Li Zhang
The Case for Boosting Mobile Application QoE via Smart Band Switching in 5G/xG Networks
5G and future 6G networks support diverse combinations of access technologies, architectures, and radio frequencies, with each …
Ahmad Hassan
,
Anlan Zhang
,
Wei Ye
,
Jason Carpenter
,
Ruiyang Zhu
,
Shuowei Jin
,
Z. Morley Mao
,
Feng Qian
,
Zhi-Li Zhang
Vivisecting Mobility Management in 5G Cellular Networks
With 5G’s support for diverse radio bands and different deployment modes, e.g. standalone (SA) vs. non-standalone (NSA), mobility …
Ahmad Hassan
,
Arvind Narayanan
,
Anlan Zhang
,
Wei Ye
,
Ruiyang Zhu
,
Shuowei Jin
,
Jason Carpenter
,
Z. Morley Mao
,
Zhi-Li Zhang
,
Feng Qian
PDF
Code
Dataset
ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases
Modern database management systems (DBMS) contain tens to hundreds of critical performance tuning knobs that determine the system …
Xinyi Zhang
,
Hong Wu
,
Zhuo Chang
,
Shuowei Jin
,
Jian Tan
,
Feifei Li
,
Tieying Zhang
,
Bin Cui
PDF
»
Cite
×