Deep Learning on FPGAs with Multiple Service Levels for Edge Computing

Cong Gao, Sangeet Saha, Yufan Lu, Rappy Saha, Klaus D. Mcdonald-Maier, Xiaojun Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

In the Internet of Things (IoT) era, deep learning is emerging as a promising approach for extracting information from IoT devices. Deep learning is also employed in the edge computing environment based on the demand for faster processing. In the edge server, various hardware accelerators have been proposed in recent studies to speed up the execution of such DNNs. One such accelerator is Xilinx's Deep Learning Processor Unit (DPU), designed for FPGA-based systems. However, the limited resource capacity of FPGAs in these edge servers imposes an enormous challenge for such implementation. Recent research has shown a clear trade-off between the 'resources consumed' vs. the 'performance achieved Taking a cue from these findings, we address the problem of efficient implementation of deep learning into the edge computing environment in this paper. The edge server employs FPGAs for executing the deep learning model. Each deep learning network is equipped with multiple distinct implementations represented by different service levels based on resource usage (where a higher service level implies higher performance with high resource consumption). To this end, we propose an Integer Linear Programming based optimal solution strategy for selecting a service level to maximize the overall performance subject to a given resource bound. Proof-of-concept case study with a deep learning network of multiple service levels of DPUs on a physical FPGA has also been provided.

Original languageEnglish
Title of host publication2022 27th International Conference on Automation and Computing
Subtitle of host publicationSmart Systems and Manufacturing, ICAC 2022
EditorsChenguang Yang, Yuchun Xu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9781665498074
ISBN (Print)9781665498081
DOIs
Publication statusPublished - 10 Oct 2022
Externally publishedYes
Event27th International Conference on Automation and Computing - Bristol, United Kingdom
Duration: 1 Sep 20223 Sep 2022
Conference number: 27

Conference

Conference27th International Conference on Automation and Computing
Abbreviated titleICAC 2022
Country/TerritoryUnited Kingdom
CityBristol
Period1/09/223/09/22

Fingerprint

Dive into the research topics of 'Deep Learning on FPGAs with Multiple Service Levels for Edge Computing'. Together they form a unique fingerprint.

Cite this