Skip to main navigation Skip to search Skip to main content

Protein subcellular localization based on PSI-BLAST and machine learning

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Subcellular location is an important functional annotation of proteins. An automatic, reliable and efficient prediction system for protein subcellular localization is necessary for large-scale genome analysis. This paper describes a protein subcellular localization method which extracts features from protein profiles rather than from amino acid sequences. The protein profile represents a protein family, discards part of the sequence information that is not conserved throughout the family and therefore is more sensitive than the amino acid sequence. The amino acid compositions of whole profile and the N-terminus of the profile are extracted, respectively, to train and test the probabilistic neural network classifiers. On two benchmark datasets, the overall accuracies of the proposed method reach 89.1% and 68.9%, respectively. The prediction results show that the proposed method perform better than those methods based on amino acid sequences. The prediction results of the proposed method are also compared with Subloc on two redundance-reduced datasets. © 2006 Imperial College Press.
Original languageEnglish
Pages (from-to)1181-1195
JournalJournal of Bioinformatics and Computational Biology
Volume4
Issue number6
DOIs
Publication statusPublished - Dec 2006

Research Keywords

  • Multiple sequence alignment
  • Position-specific scoring matrix
  • Probabillstic neural network
  • PSI-BLAST
  • Subcellular localization

Fingerprint

Dive into the research topics of 'Protein subcellular localization based on PSI-BLAST and machine learning'. Together they form a unique fingerprint.

Cite this