TY - JOUR
T1 - A Study of Parallel Data Mining in a Peer-to-Peer Network
AU - Guan, Huiwei
AU - Ip, Horace H. S.
PY - 2007/9/1
Y1 - 2007/9/1
N2 - In this article, a parallel data mining algorithm in a distributed Peer-to-Peer (P2P) network is designed and proposed. The algorithm has the following advanced features: the implementation of the algorithm for all nodes in a P2P network is the same which satisfies not only the distribution but also the random walking in/out features of a P2P network; it balances the working load of each node in the P2P network well; it is easy for the maintenance and reuse of the codes. All processes of this algorithm are executed in parallel over a P2P network to reach high efficiency, fine scalability and efficient communication. Data mining for large and distributed databases in P2P networks requires more efficient parallel or distributed algorithms. Dealing with a fast changing P2P environment also demands more flexible and scaleable methods. Our parallel algorithm provides a good solution. Parallel P2P data mining applications may play a key role in the next generation of distributed database networks, file sharing networks, and search engines.
AB - In this article, a parallel data mining algorithm in a distributed Peer-to-Peer (P2P) network is designed and proposed. The algorithm has the following advanced features: the implementation of the algorithm for all nodes in a P2P network is the same which satisfies not only the distribution but also the random walking in/out features of a P2P network; it balances the working load of each node in the P2P network well; it is easy for the maintenance and reuse of the codes. All processes of this algorithm are executed in parallel over a P2P network to reach high efficiency, fine scalability and efficient communication. Data mining for large and distributed databases in P2P networks requires more efficient parallel or distributed algorithms. Dealing with a fast changing P2P environment also demands more flexible and scaleable methods. Our parallel algorithm provides a good solution. Parallel P2P data mining applications may play a key role in the next generation of distributed database networks, file sharing networks, and search engines.
KW - Association rule mining
KW - Data mining
KW - Distributed computing
KW - Network applications
KW - Parallel algorithm
KW - Peer-to-peer network
UR - http://www.scopus.com/inward/record.url?scp=34848881619&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-34848881619&origin=recordpage
U2 - 10.1177/1063293X07083088
DO - 10.1177/1063293X07083088
M3 - RGC 21 - Publication in refereed journal
SN - 1063-293X
VL - 15
SP - 281
EP - 289
JO - Concurrent Engineering Research and Applications
JF - Concurrent Engineering Research and Applications
IS - 3
ER -