Skip to main navigation Skip to search Skip to main content

Genome-scale NCRNA homology search using a Hamming distance-based filtration strategy

  • Yanni Sun*
  • , Osama Aljawad
  • , Jikai Lei
  • , Alex Liu
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

25 Downloads (CityUHK Scholars)

Abstract

Background: NCRNAs (noncoding RNAs) play important roles in many biological processes. Existing genome-scale ncRNA search tools identify ncRNAs in local sequence alignments generated by conventional sequence comparison methods. However, some types of ncRNA lack strong sequence conservation and tend to be missed or mis-aligned by conventional sequence comparison.Results: In this paper, we propose an ncRNA identification framework that is complementary to existing sequence comparison tools. By integrating a filtration step based on Hamming distance and ncRNA alignment programs such as FOLDALIGN or PLAST-ncRNA, the proposed ncRNA search framework can identify ncRNAs that lack strong sequence conservation. In addition, as the ratio of transition and transversion mutation is often used as a discriminative feature for functional ncRNA identification, we incorporate this feature into the filtration step using a coding strategy. We apply Hamming distance seeds to ncRNA search in the intergenic regions of human and mouse genomes and between the Burkholderia cenocepacia J2315 genome and the Ralstonia solanacearum genome. The experimental results demonstrate that a carefully designed Hamming distance seed can achieve better sensitivity in searching for poorly conserved ncRNAs than conventional sequence comparison tools.Conclusions: Hamming distance seeds provide better sensitivity as a filtration strategy for genome-wide ncRNA homology search than the existing seeding strategies used in BLAST-like tools. By combining Hamming distance seeds matching and ncRNA alignment, we are able to find ncRNAs with sequence similarities below 60%. © 2012 Sun et al.; licensee BioMed Central Ltd.
Original languageEnglish
Article numberS12
JournalBMC Bioinformatics
Volume13
Issue numberSUPPL.3
DOIs
Publication statusPublished - 21 Mar 2012
Externally publishedYes

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Publisher's Copyright Statement

  • This full text is made available under CC-BY 2.0. https://creativecommons.org/licenses/by/2.0/

Fingerprint

Dive into the research topics of 'Genome-scale NCRNA homology search using a Hamming distance-based filtration strategy'. Together they form a unique fingerprint.

Cite this