IRIS: Interference and Resource Aware Predictive Orchestration for ML Inference Serving

2023 - IEEE Cloud (International Conference on Cloud Computing)
(THIS PAPER RECEIVED THE BEST PAPER AWARD OF 2023 IEEE CLOUD). Over the last years, the ever-growing number of Machine Learning (ML) and Artificial Intelligence (AI) applications deployed in the Cloud has led to high demands on the computing resources required for efficient processing. Multiple…