Abstract
We attempt to establish geometrical methods for amino acid sequences. To measure the similarities of these sequences, a kernel on strings is defined using only the sequence structure and a good amino acid substitution matrix (e.g. BLOSUM62). The kernel is used in learning machines to predict binding affinities of peptides to human leukocyte antigen DR (HLA-DR) molecules. On both fixed allele (Nielsen and Lund in BMC Bioinform. 10:296, 2009) and pan-allele (Nielsen et al. in Immunome Res. 6(1):9, 2010) benchmark databases, our algorithm achieves the state-of-the-art performance. The kernel is also used to define a distance on an HLA-DR allele set based on which a clustering analysis precisely recovers the serotype classifications assigned by WHO (Holdsworth et al. in Tissue Antigens 73(2):95–170, 2009; Marsh et al. in Tissue Antigens 75(4):291–455, 2010). These results suggest that our kernel relates well the sequence structure of both peptides and HLA-DR molecules to their biological functions, and that it offers a simple, powerful and promising methodology to immunology and amino acid sequence studies.
| Original language | English |
|---|---|
| Pages (from-to) | 951-984 |
| Journal | Foundations of Computational Mathematics |
| Volume | 14 |
| Issue number | 5 |
| Online published | 17 Sept 2013 |
| DOIs | |
| Publication status | Published - Oct 2014 |
Research Keywords
- String kernel, Peptide binding prediction, Reproducing kernel Hilbert space, Major histocompatibility complex, HLA DRB allele classification
Fingerprint
Dive into the research topics of 'Introduction to the Peptide Binding Problem of Computational Immunology: New Results'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver