IRIS: Interference and Resource Aware Predictive Orchestration for ML Inference Serving

2023 - IEEE Cloud (International Conference on Cloud Computing)
(THIS PAPER RECEIVED THE BEST PAPER AWARD OF 2023 IEEE CLOUD). Over the last years, the ever-growing number of Machine Learning (ML) and Artificial Intelligence (AI) applications deployed in the Cloud has led to high demands on the computing resources required for efficient processing. Multiple…

On the Implications of Heterogeneous Memory Tiering on Spark In-Memory Analytics

2nd Workshop on Composable Systems (COMPSYS2023)
This study considers a multi-tier heterogeneous DRAM/NVM memory system with contrasting access latency, bandwidth and energy consumption capabilities. The paper examines the implications of heterogeneous memory tiering on Spark in-memory analytics and addresses the challenge of the growing demand…