Serving multi-DNN workloads on FPGAs: A coordinated architecture, scheduling, and mapping perspective

S Zeng, G Dai, N Zhang, X Yang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Deep Neural Network (DNN) INFerence-as-a-Service (INFaaS) is the dominating workload
in current data centers, for which FPGAs become promising hardware platforms because of …

CD-MSA: cooperative and deadline-aware scheduling for efficient multi-tenancy on DNN accelerators

C Wang, Y Bai, D Sun - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
With DNN turning into the backbone of AI cloud services and propelling the emergence of
INFerence-as-a-Service (INFaaS), DNN-specific accelerators have become the …

Nimblock: Scheduling for fine-grained fpga sharing through virtualization

M Mandava, P Reckamp, D Chen - Proceedings of the 50th Annual …, 2023 - dl.acm.org
As FPGAs become ubiquitous compute platforms, existing research has focused on
enabling virtualization features to facilitate finegrained FPGA sharing. We employ an overlay …

[PDF][PDF] Complexities for the Indian Economy of China's Growing Technological Competence

S Agarwal, A Gordon - 2022 - files.osf.io
This volume brings together the opinions of engineers, economists, and policymakers. It
contains chapters on approaches to assessing technological leadership in the China And …