C Lin, J Liu - 2024 IEEE/ACM 32nd International Symposium on …, 2024 - ieeexplore.ieee.org
Multi-tenant inference, as a prevalent inference paradigm nowadays, requires deploying
multiple deep learning models on the hardware platform to concurrently process inference …