High Throughput Hardware/Software Heterogeneous System for RRPN-based Scene Text Detection
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Journal / Publication | IEEE Transactions on Computers |
Online published | 24 Jun 2021 |
Publication status | Online published - 24 Jun 2021 |
Link(s)
Abstract
Rotation Region Proposal Networks (RRPN) are used to generate rotated proposals with the information of text angle for arbitrary oriented scene text detection (STD). However, the computational complexity of RRPN inference is relatively high compared with other methods, which makes it difficult for massive deployment. In this paper, the first full-stack FPGA-CPU heterogeneous system design of RRPN-based STD algorithm is proposed. A hardware/software partition method is presented to analyze and split the tasks to enhance the computation efficiency of hardware. The fast 2D Winograd algorithm and block floating point are utilized to reduce computation complexity while maintaining a relatively high precision. The implementation results show that the peak performance of MAC arrays in the proposed architecture reaches 655.4 GOPS and the energy efficiency achieves 64.9 GOPS/W. By fully exploiting the parallel and pipelined merits in the algorithms, the first hardware architectures for skew non-maximum suppression (S-NMS) layer and rotation region-of-interest (RRoI) polling layer are proposed. The throughput of the proposed hardware/software heterogeneous system achieves 40 times and 1.4 times improvements compared with CPU and GPU, respectively. Moreover, the comprehensive operating expense ratio of pure CPU, GPU, and the proposed system is 80.7:2.5:1, which indicates that it is suitable for massive deployment.
Research Area(s)
- Central Processing Unit, Computer architecture, Field programmable gate arrays, Graphics processing units, Hardware, Proposals, Throughput
Citation Format(s)
High Throughput Hardware/Software Heterogeneous System for RRPN-based Scene Text Detection. / Xin, Yao; Chen, Donglong; Zeng, Chongyang; Zhang, Weichen; Wang, Yi; Cheung, Ray C.C.
In: IEEE Transactions on Computers, 24.06.2021.Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review