PiLLM: Resource-e!cient LLM Inference Using Workload Prediction
Jan 1, 2026ยท,
,,ยท
0 min read
Yunqian Fan
Shihao Bai

Ruihao Gong
๐ง
Corresponding Author
Zaijun Wang
Rui Fan
๐ง
Corresponding Author
Type
Publication
Proceedings of the 21st European Conference on Computer Systems (EuroSys)