Skip to main navigation Skip to search Skip to main content

Geometric biclustering and its applications to cancer tissue classification based on DNA microarray gene expression data

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 12 - Chapter in an edited book (Author)

Abstract

Biclustering is an important tool in microarray data analysis when only a subset of genes coregulates under a subset of conditions. It is a useful technique for cancer tissue classification based on gene expression data. Unlike standard clustering analysis, biclustering methodology can perform simultaneous classification on the two dimensions of genes and conditions in a data matrix. However, the biclustering problem is inherently intractable and computationally complex. In this chapter, we present a novel geometric perspective of a biclustering problem and the related geometric algorithms. In the view of geometrical interpretation, different types of biclusters can be mapped to the linear geometric structures, such as points, lines, or hyperplanes in a high-dimensional data space. Such a perspective makes it possible to unify the formulation of biclusters and thus the biclustering process can be interpreted as a search for linear geometries in spatial space. Based on the linear geometry formulation, we develop Hough transform-based biclustering algorithms. Considering the computational complexity in searching the existence of noise in microarray data, and the biological meanings of biclusters, we propose several methods to improve the geometric biclustering algorithms. Simulation studies show that the algorithms can discover significant biclusters despite the increased noise level and regulatory complexity. Furthermore, the algorithms are also effective in extracting biologically meaningful biclusters from real microarray gene expression data.
Original languageEnglish
Title of host publicationComputational biology
Subtitle of host publicationissues and applications in oncology
EditorsTuan D. Pham
Place of PublicationNew York;London
PublisherSpringer 
Pages19-53
ISBN (Print)9781441908117, 1441908110
DOIs
Publication statusPublished - 2009

Publication series

NameApplied bioinformatics and biostatistics in cancer research

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Geometric biclustering and its applications to cancer tissue classification based on DNA microarray gene expression data'. Together they form a unique fingerprint.

Cite this