Accelerating de novo SINE annotation in plant and animal genomes

Herui Liao, Yanni Sun*, Shujun Ou*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

22 Downloads (CityUHK Scholars)

Abstract

Genome annotation is an important but challenging task. Accurate identification of short interspersed nuclear elements (SINEs) is particularly difficult due to their lack of highly conserved sequences. AnnoSINE is state-of-the-art software for annotating SINEs in plant genomes, but it is computationally inefficient for large genomes. Moreover, its applicability to animals is limited due to the absence of animal pHMMs in its HMM library. Therefore, we propose AnnoSINE_v2, which extends accurate SINE annotation for animal genomes with greatly optimized computational efficiency. Our results show that AnnoSINE_v2’s annotation of SINEs has over 20% higher F1-score compared to the existing tools on animal genomes and enables the processing of complicated genomes, like human and zebrafish, which were beyond the capabilities of AnnoSINE_v1. AnnoSINE_v2 is freely available on Conda and GitHub: https://github.com/liaoherui/AnnoSINE_v2. © The Author(s) 2024.
Original languageEnglish
Article number24
JournalMobile DNA
Volume15
Online published19 Oct 2024
DOIs
Publication statusPublished - 2024

Research Keywords

  • Genome annotation
  • SINE identification
  • Transposable element

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Accelerating de novo SINE annotation in plant and animal genomes'. Together they form a unique fingerprint.

Cite this