[HTML][HTML] Applications of machine vision in pharmaceutical technology: A review

DL Galata, LA Meszaros, N Kallai-Szabo… - European Journal of …, 2021 - Elsevier
The goal of this paper is to give an introduction to analysis of images acquired by a digital
camera with visible illumination and to review its applications as a Process Analytical …

Serving heterogeneous machine learning models on {Multi-GPU} servers with {Spatio-Temporal} sharing

S Choi, S Lee, Y Kim, J Park, Y Kwon… - 2022 USENIX Annual …, 2022 - usenix.org
As machine learning (ML) techniques are applied to a widening range of applications, high
throughput ML inference serving has become critical for online services. Such ML inference …

Deeprecsys: A system for optimizing end-to-end at-scale neural recommendation inference

U Gupta, S Hsia, V Saraph, X Wang… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Neural personalized recommendation is the cornerstone of a wide collection of cloud
services and products, constituting significant compute demand of cloud infrastructure. Thus …

Planaria: Dynamic architecture fission for spatial multi-tenant acceleration of deep neural networks

S Ghodrati, BH Ahn, JK Kim, S Kinzer… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have reinvigorated real-world applications that rely on
learning patterns of data and are permeating into different industries and markets. Cloud …

Centaur: A chiplet-based, hybrid sparse-dense accelerator for personalized recommendations

R Hwang, T Kim, Y Kwon, M Rhu - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Personalized recommendations are the backbone machine learning (ML) algorithm that
powers several important application domains (eg, ads, e-commerce, etc) serviced from …

A multi-neural network acceleration architecture

E Baek, D Kwon, J Kim - 2020 ACM/IEEE 47th Annual …, 2020 - ieeexplore.ieee.org
A cost-effective multi-tenant neural network execution is becoming one of the most important
design goals for modern neural network accelerators. For example, as emerging AI services …

Heterogeneous dataflow accelerators for multi-DNN workloads

H Kwon, L Lai, M Pellauer, T Krishna… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Emerging AI-enabled applications such as augmented and virtual reality (AR/VR) leverage
multiple deep neural network (DNN) models for various sub-tasks such as object detection …

Auto-split: A general framework of collaborative edge-cloud AI

A Banitalebi-Dehkordi, N Vedula, J Pei, F Xia… - Proceedings of the 27th …, 2021 - dl.acm.org
In many industry scale applications, large and resource consuming machine learning
models reside in powerful cloud servers. At the same time, large amounts of input data are …

Band: coordinated multi-dnn inference on heterogeneous mobile processors

JS Jeong, J Lee, D Kim, C Jeon, C Jeong… - Proceedings of the 20th …, 2022 - dl.acm.org
The rapid development of deep learning algorithms, as well as innovative hardware
advancements, encourages multi-DNN workloads such as augmented reality applications …

Enable simultaneous dnn services based on deterministic operator overlap and precise latency prediction

W Cui, H Zhao, Q Chen, N Zheng, J Leng… - Proceedings of the …, 2021 - dl.acm.org
While user-facing services experience diurnal load patterns, co-locating services improve
hardware utilization. Prior work on co-locating services on GPUs run queries sequentially …