Top publications in data mining 1–100 of 71,181 results
Publications Citations  
1
Classification and Regression Trees (1984) 9405
2
Data Mining: Concepts and Techniques (2000) 5979
3
Uci repository of machine learning databases (1998) 5440
4
Introduction to Modern Information Retrieval (1984) 4976
5
Modern Information Retrieval (1999) 4930
6
Mining association rules between sets of items in large databases (1993) 4908
7
A Tutorial on Support Vector Machines for Pattern Recognition (1998) 4844
8
The anatomy of a large-scale hypertextual Web search engine (1998) 4676
9
Fast Algorithms for Mining Association Rules (1994) 4575
10
Algorithms for Clustering Data (1988) 4196
11
Social Network Analysis: Methods and Applications (1994) 4081
12
Data Mining: Practical Machine Learning Tools and Techniques (2005) 3991
13
Indexing by Latent Semantic Analysis (1990) 3902
14
Authoritative sources in a hyperlinked environment (1999) 3773
15
Data clustering: a review (1999) 3497
16
Some methods for classification and analysis of multivariate observations (1967) 3438
17
The Elements of Statistical Learning (2001) 3423
18
Finding Groups in Data: An Introduction to Cluster Analysis (1990) 2937
19
Random Forests (2001) 2856
20
Robust regression and outlier detection (1987) 2746
21
Generalized linear models (1984) 2730
22
Working knowledge: how organizations manage what they know (2000) 2630
23
Text categorization with support vector machines: Learning withmany relevant features (1998) 2357
24
The PageRank Citation Ranking: Bringing Order to the Web (1998) 2185
25
Mining Sequential Patterns (1995) 2019
26
Latent dirichlet allocation (2003) 1957
27
Mining frequent patterns without candidate generation (2000) 1939
28
Machine learning in automated text categorization (2002) 1901
29
Cluster Analysis (1993) 1846
30
An Introduction to Variable and Feature Selection (2003) 1821
31
The elements of statistical learning: data mining, inference, and prediciton (2002) 1782
32
cluster analysis for applications (1973) 1779
33
A comparative study on feature selection in text categorization (1997) 1766
34
Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations (1999) 1742
35
Binary Codes Capable of Correcting Deletions, Insertions and Reversals (1966) 1733
36
The em algorithm and extensions (2000) 1708
37
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer (1989) 1703
38
Machine learning in automated text categorization (2002) 1676
39
Some Methods for Classification and Analysis of MultiVariate Observations (1967) 1665
40
A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise (1996) 1646
41
A vector space model for automatic indexing (1975) 1621
42
Multidimensional binary search trees used for associative searching (1975) 1617
43
Advances in Knowledge Discovery and Data Mining (1996) 1602
44
A survey of approaches to automatic schema matching (2001) 1591
45
Latent Dirichlet Allocation (2001) 1548
46
GroupLens: an open architecture for collaborative filtering of netnews (1994) 1541
47
Community structure in social and biological networks (2002) 1515
48
Empirical Analysis of Predictive Algorithms for Collaborative Filtering (1998) 1477
49
Spectral Graph Theory (1997) 1460
50
Statistical analysis of finite mixture distributions (1985) 1446
51
Fast Algorithms for Mining Association Rules in Large Databases (1994) 1387
52
Combining Labeled and Unlabeled Data with Co-training (1998) 1371
53
Outliers in statistical data (1994) 1358
54
BIRCH: an efficient data clustering method for very large databases (1996) 1331
55
Learning Bayesian networks: The combination of knowledge andstatistical data (1994) 1320
56
Fast Discovery of Association Rules (1996) 1294
57
Fast Effective Rule Induction (1995) 1278
58
On Spectral Clustering: Analysis and an algorithm (2001) 1230
59
A re-examination of text categorization methods (1999) 1228
60
Using collaborative filtering to weave an information tapestry (1992) 1198
61
Introduction to Data Mining (2005) 1172
62
Algorithms for Nonnegative Matrix Factorization (2000) 1162
63
Human behavior and the pmnc~ple of least effort 1153
64
Models and issues in data stream systems (2002) 1125
65
Human behavior and the principle of least effort 1122
66
Principles of Data Mining (2001) 1113
67
Item-based collaborative filtering recommendation algorithms (2001) 1104
68
What is your strategy for managing knowledge (1999) 1092
69
A comparison of event models for Naive Bayes text classification (1998) 1072
70
Text Classification from Labeled and Unlabeled Documents using EM (2000) 1058
71
Multi-interval discretization of continuous-valued attributes for classification learning (1993) 1038
72
Privacy-Preserving Data Mining (2000) 1030
73
Knowledge Acquisition via Incremental Conceptual Clustering (1987) 1025
74
Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions (2005) 1024
75
k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY 1 (2002) 1017
76
Discriminant analysis and statistical pattern recognition (1992) 1015
77
An overview of data warehousing and OLAP technology (1997) 1010
78
From Data Mining to Knowledge Discovery: An Overview (1996) 1003
79
Graph structure in the Web (2000) 996
80
An Information-Theoretic Definition of Similarity (1998) 990
81
Mining Sequential Patterns: Generalization and Performance Improvements (1996) 986
82
Evaluating collaborative filtering recommender systems (2004) 965
83
Detection of Abrupt Changes: Theory and Applications (1992) 964
84
Automatic subspace clustering of high dimensional data for data mining applications (1998) 957
85
On the Optimality of the Simple Bayesian Classifier under Zero-OneLoss (1997) 951
86
Binary codes capable of correcting deletions, insertions and reversals (1965) 945
87
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features (1998) 944
88
Selection of Relevant Features and Examples in Machine Learning (1997) 943
89
Multivariate Density Estimation: Theory, Practice, and Visualization (1992) 942
90
Probabilistic latent semantic indexing (1999) 941
91
Mining Generalized Association Rules (1995) 931
92
An evaluation of statistical approaches to text categorization (1999) 928
93
Comparing partitions (1985) 905
94
Ensemble Methods in Machine Learning (2000) 903
95
An Empirical Comparison of Voting Classification Algorithms: Bagging,Boosting, and Variants (1999) 895
96
Mixture models: inference and applications to clustering (1988) 881
97
Data Mining: An Overview from a Database Perspective (1996) 879
98
Efficient and Effective Querying by Image Content (1994) 878
99
Optimizing search engines using clickthrough data (2002) 877
100
Integrating Classification and Association Rule Mining (1998) 873