Academic
Publications
Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees

Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees,10.1007/978-3-642-12275-0_27,Odysseas Papapetrou,Wolf Siberski,Norbert Fuhr

Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees   (Citations: 2)
BibTex | RIS | RefWorks Download
Text clustering is an established technique for improving quality in information retrieval, for both centralized and distributed envi- ronments. However, for highly distributed environments, such as peer-to- peer networks, current clustering algorithms fail to scale. Our algorithm for peer-to-peer clustering achieves high scalability by using a proba- bilistic approach for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters, without signicant loss of clustering quality. The algorithm oers proba- bilistic guarantees for the correctness of each document assignment to a cluster. Extensive experimental evaluation with up to 100000 peers and 1 million documents demonstrates the scalability and eectiveness of the algorithm.
Conference: European Colloquium on IR Research - ECIR , pp. 293-305, 2010
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
Sort by: