LLMVisor: A Real-Time Latency Attribution Model for Multi-Tenant LLM Serving

Publication
Workshop on ML for Systems at NeurIPS 2025
Shuowei Jin
Shuowei Jin
Applied Scientist at Amazon

My research interests include efficient LLM inference/training algorithms/systems, LLM post-training recipe, and general machine learning systems.