Parallelized Implementation of Universal Visual Computer

Tze-Yui Ho, Ping-Man Lam, Chi-Sing Leung

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 12 - Chapter in an edited book (Author)

Abstract

A CNN consists of a number of identical cells, which are arranged in a twodimensional structure and are only connected to neighboring cells, where each cell has input, current, and next states. Distant cells are influenced by the others through data propagation between neighboring cells. With the CNN approach, different applications, such as visual processing and optimization, are achieved using the same algorithm with a different set of parameters. Although the local connectivity of the cells is well suited for implementation on a GPU, there are two additional issues that we have to address: first, the computational model of GPU is based on four-channel data, but the CNN data is conventionally organized in a one-channel format; second, the data transfer rate between the GPU and main memory is much slower than the transfer rate between the CPU and main memory.
Original languageEnglish
Title of host publicationGPU Pro
Subtitle of host publicationAdvanced Rendering Techniques
EditorsWolfgang Engel
Place of PublicationNatick, Mass.
PublisherA K Peters
Pages613-622
ISBN (Electronic)9780429108426
ISBN (Print)9781568814728, 1568814720, 9781439865538
DOIs
Publication statusPublished - 2010

Fingerprint

Dive into the research topics of 'Parallelized Implementation of Universal Visual Computer'. Together they form a unique fingerprint.

Cite this