Shuowei Jin
Shuowei Jin
Home
Research
Publications
Services
MISC
Light
Dark
Automatic
1
CoMem: Context Management with A Decoupled Long-Context Model
Yuwei Zhang
,
Chengyu Dong
,
Shuowei Jin
,
Changlong Yu
,
Hejie Cui
,
Hongye Jin
,
Xinyang Zhang
,
Hamed Bonab
,
Colin Lockard
,
Jianshu Chen
,
Zhenyu Shi
,
Jingbo Shang
,
Xian Li
,
Bing Yin
PDF
T2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
Haixin Wang
,
Hejie Cui
,
Chenwei Zhang
,
Xin Liu
,
Shuowei Jin
,
Shijie Geng
,
Xinyang Zhang
,
Nasser Zalmout
,
Zhenyu Shi
,
Yizhou Sun
PDF
LLMVisor: A Real-Time Latency Attribution Model for Multi-Tenant LLM Serving
Shuowei Jin
,
Xueshen Liu
,
Jiaxin Shan
,
Le Xu
,
Tieying Zhang
,
Liguang Xie
,
Z. Morley Mao
PDF
Plato: Plan to Efficiently Decode for Large Language Model Inference
Shuowei Jin
,
Xueshen Liu
,
Yongji Wu
,
Haizhong Zheng
,
Qingzhao Zhang
,
Atul Prakash
,
Matthew Lentz
,
Danyang Zhuo
,
Feng Qian
,
Z. Morley Mao
PDF
Compute Or Load KV Cache? Why Not Both?
Large Language Models (LLMs) are increasingly deployed in large-scale online services, enabling sophisticated applications. However, …
Shuowei Jin
,
Xueshen Liu
,
Qingzhao Zhang
,
Z. Morley Mao
PDF
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs
The Mixture-of-Experts (MoE) architecture has become increasingly popular as a method to scale up large language models (LLMs). To save …
Yongji Wu
,
Xueshen Liu
,
Shuowei Jin
,
Ceyu Xu
,
Feng Qian
,
Z. Morley Mao
,
Matthew Lentz
,
Danyang Zhuo
,
Ion Stoica
PDF
Eagle: Efficient Training-Free Router for Multi-LLM Inference
The proliferation of Large Language Models (LLMs) with varying capabilities and costs has created a need for efficient model selection …
Zesen Zhao
,
Shuowei Jin
,
Z. Morley Mao
PDF
AutoSpec: Automated Generation of Neural Network Specifications
The increasing adoption of neural networks in learning-augmented systems highlights the importance of model safety and robustness, …
Shuowei Jin
,
Francis Y. Yan
,
Cheng Tan
,
Anuj Kalia
,
Xenofon Foukas
,
Z. Morley Mao
PDF
On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures
Collaborative perception, which greatly enhances the sensing capability of connected and autonomous vehicles (CAVs) by incorporating …
Qingzhao Zhang
,
Shuowei Jin
,
Ruiyang Zhu
,
Jiachen Sun
,
Xumiao Zhang
,
Qi Alfred Chen
,
Z. Morley Mao
OASIS: Collaborative Neural-Enhanced Mobile Video Streaming
Neural-enhanced video streaming (e.g. super-resolution) is an ongoing revolution which can provide extremely high-quality video …
Shuowei Jin
,
Ruiyang Zhu
,
Ahmad Hassan
,
Xiao Zhu
,
Xumiao Zhang
,
Z. Morley Mao
,
Feng Qian
,
Zhi-Li Zhang
PDF
»
Cite
×