Abstract
Subcellular location is an important functional annotation of proteins. An automatic, reliable and efficient prediction system for protein subcellular localization is necessary for large-scale genome analysis. This paper describes a protein subcellular localization method which extracts features from protein profiles rather than from amino acid sequences. The protein profile represents a protein family, discards part of the sequence information that is not conserved throughout the family and therefore is more sensitive than the amino acid sequence. The amino acid compositions of whole profile and the N-terminus of the profile are extracted, respectively, to train and test the probabilistic neural network classifiers. On two benchmark datasets, the overall accuracies of the proposed method reach 89.1% and 68.9%, respectively. The prediction results show that the proposed method perform better than those methods based on amino acid sequences. The prediction results of the proposed method are also compared with Subloc on two redundance-reduced datasets. © 2006 Imperial College Press.
| Original language | English |
|---|---|
| Pages (from-to) | 1181-1195 |
| Journal | Journal of Bioinformatics and Computational Biology |
| Volume | 4 |
| Issue number | 6 |
| DOIs | |
| Publication status | Published - Dec 2006 |
Research Keywords
- Multiple sequence alignment
- Position-specific scoring matrix
- Probabillstic neural network
- PSI-BLAST
- Subcellular localization
Fingerprint
Dive into the research topics of 'Protein subcellular localization based on PSI-BLAST and machine learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver