Abstract
Biclustering is an important technique in data mining for searching similar patterns. Geometric biclustering (GBC) method is used to reduce the complexity of the NP-complete biclustering algorithm. This paper studies three commonly used modern platforms including multi-core CPU, GPU and FPGA to accelerate this GBC algorithm. By analyzing the parallelizing property of the GBC algorithm, we design 1) a multi-threaded software running on a server grade multi-core CPU system, 2) a CUDA program for GPU to accelerate the GBC algorithm, and 3) a novel parameterizable and scalable hardware architecture implemented on an FPGA. Genes microarray pattern analysis is employed as an example to demonstrate performance comparisons on different platforms. In particular, we compare the speed and energy efficiency of the three proposed methods. We found that 1) GPU achieves the highest average speedup of 48 × compared to single-threaded GBC program, 2) Our FPGA design can achieve higher speedup of 4× for the computation for large microarray, and 3) FPGA consumes the least energy, which is about 3.53 × more efficient than the single-threaded GBC program.
| Original language | English |
|---|---|
| Article number | 6579602 |
| Pages (from-to) | 2540-2550 |
| Journal | IEEE Transactions on Parallel and Distributed Systems |
| Volume | 25 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - 1 Oct 2014 |
Research Keywords
- field-programmable gate array (FPGA)
- Geometric biclustering (GBC)
- graphics processing unit (GPU)
- microarray data
Fingerprint
Dive into the research topics of 'Design exploration of geometric biclustering for microarray data analysis in data mining'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver