LeMix: Unified Scheduling for LLM Training and Inference on Multi-GPU Systems

Abstract

To appear soon.

Type
Publication
In 46th IEEE Real-Time Systems Symposium