所有版本 - 学术资源搜索

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

Nanily: A qos-aware scheduling for dnn inference workload in clouds

X Tang, P Wang, Q Liu, W Wang… - 2019 IEEE 21st …, 2019 - ieeexplore.ieee.org

DNN inferences are widely emerging as a service and must run in sub-second latency,
which need GPU hardware to achieve parallel accelerating. Prior works to improve the …

被引用次数：21 相关文章

Nanily: A QoS-Aware Scheduling for DNN Inference Workload in Clouds

X Tang, P Wang, Q Liu, W Wang, J Han - 2019 IEEE 21st International …, 2019 - computer.org

DNN inferences are widely emerging as a service and must run in sub-second latency,
which need GPU hardware to achieve parallel accelerating. Prior works to improve the …

高级搜索

QQ 群

Nanily: A qos-aware scheduling for dnn inference workload in clouds

Nanily: A QoS-Aware Scheduling for DNN Inference Workload in Clouds

引用