A Study of Parallel Data Mining in a Peer-to-Peer Network
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 281-289 |
Journal / Publication | Concurrent Engineering Research and Applications |
Volume | 15 |
Issue number | 3 |
Publication status | Published - 1 Sept 2007 |
Link(s)
Abstract
In this article, a parallel data mining algorithm in a distributed Peer-to-Peer (P2P) network is designed and proposed. The algorithm has the following advanced features: the implementation of the algorithm for all nodes in a P2P network is the same which satisfies not only the distribution but also the random walking in/out features of a P2P network; it balances the working load of each node in the P2P network well; it is easy for the maintenance and reuse of the codes. All processes of this algorithm are executed in parallel over a P2P network to reach high efficiency, fine scalability and efficient communication. Data mining for large and distributed databases in P2P networks requires more efficient parallel or distributed algorithms. Dealing with a fast changing P2P environment also demands more flexible and scaleable methods. Our parallel algorithm provides a good solution. Parallel P2P data mining applications may play a key role in the next generation of distributed database networks, file sharing networks, and search engines.
Research Area(s)
- Association rule mining, Data mining, Distributed computing, Network applications, Parallel algorithm, Peer-to-peer network
Citation Format(s)
In: Concurrent Engineering Research and Applications, Vol. 15, No. 3, 01.09.2007, p. 281-289.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review