TY - JOUR
T1 - Fast prediction of protein-protein interaction sites based on Extreme Learning Machines
AU - Wang, Debby D.
AU - Wang, Ran
AU - Yan, Hong
PY - 2014/3/27
Y1 - 2014/3/27
N2 - Conventional machine learning methods can be used to identify protein-protein interaction sites and study the gene regulatory networks and functions. However, when applied to large datasets, the computational complexities of these methods become a major drawback. With a significantly reduced computational complexity, the Extreme Learning Machines provide an attractive balance between computational time and generalization performance. In the method proposed in this paper, after searching for interfacial residues using a dynamic strategy and extracting spatially neighboring residue profiles for a set of 563 non-redundant protein chains, we implement the interface prediction either on multi-chain sets or on single-chain sets, using the two methods Extreme Learning Machines and support vector machines for a comparable study. As a consequence, in both multi-chain and single-chain cases Extreme Learning Machines tend to obtain higher Recall values than support vector machines, and in the multi-chain case Extreme Learning Machines as well show a remarkable advantage in the computational speed. © 2013 Elsevier B.V.
AB - Conventional machine learning methods can be used to identify protein-protein interaction sites and study the gene regulatory networks and functions. However, when applied to large datasets, the computational complexities of these methods become a major drawback. With a significantly reduced computational complexity, the Extreme Learning Machines provide an attractive balance between computational time and generalization performance. In the method proposed in this paper, after searching for interfacial residues using a dynamic strategy and extracting spatially neighboring residue profiles for a set of 563 non-redundant protein chains, we implement the interface prediction either on multi-chain sets or on single-chain sets, using the two methods Extreme Learning Machines and support vector machines for a comparable study. As a consequence, in both multi-chain and single-chain cases Extreme Learning Machines tend to obtain higher Recall values than support vector machines, and in the multi-chain case Extreme Learning Machines as well show a remarkable advantage in the computational speed. © 2013 Elsevier B.V.
KW - Extreme Learning Machines
KW - Protein-protein interaction sites
KW - Residue sequence profile
KW - Support vector machines
UR - http://www.scopus.com/inward/record.url?scp=84893651255&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84893651255&origin=recordpage
U2 - 10.1016/j.neucom.2012.12.062
DO - 10.1016/j.neucom.2012.12.062
M3 - RGC 21 - Publication in refereed journal
SN - 0925-2312
VL - 128
SP - 258
EP - 266
JO - Neurocomputing
JF - Neurocomputing
ER -