Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(6)
Document Clustering
Document Representation
frequent itemset
Hierarchical Clustering
High Dimensionality
Semantic Relations
Subscribe
Academic
Publications
Frequent Itemset Based Hierarchical Document Clustering Using Wikipedia as External Knowledge
Frequent Itemset Based Hierarchical Document Clustering Using Wikipedia as External Knowledge,10.1007/978-3-642-15390-7_2,Ravi Shankar,Vikram Pudi
Edit
Frequent Itemset Based Hierarchical Document Clustering Using Wikipedia as External Knowledge
(
Citations: 1
)
BibTex
|
RIS
|
RefWorks
Download
Ravi Shankar
,
Vikram Pudi
High dimensionality
is a major challenge in document clustering. Some of the recent algorithms address this problem by using frequent itemsets for clustering. But, most of these algorithms neglect the semantic relationship between the words. On the other hand there are algorithms that take care of the
semantic relations
between the words by making use of external knowledge contained in WordNet, Mesh, Wikipedia, etc but do not handle the high dimensionality. In this paper we present an efficient solution that addresses both these problems. We propose a
hierarchical clustering
algorithm using closed frequent itemsets that use Wikipedia as an external knowledge to enhance the document representation. We evaluate our methods based on F-Score on standard datasets and show our results to be better than existing approaches.
Conference:
Knowledge-Based Intelligent Information & Engineering Systems - KES
, pp. 11-20, 2010
DOI:
10.1007/978-3-642-15390-7_2
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
www.springerlink.com
)
(
www.springerlink.com
)
(
www.informatik.uni-trier.de
)
(
dx.doi.org
)
More »
References
(15)
Generalized Closed Itemsets for Association Rule Mining
(
Citations: 7
)
Vikram Pudi
,
Jayant R. Haritsa
Conference:
International Conference on Data Engineering - ICDE
, pp. 714-716, 2003
Finding groups in data
(
Citations: 494
)
L. Kaufman
,
P. Rousseeuw
Published in 1989.
Evaluation of hierarchical clustering algorithms for document datasets
(
Citations: 203
)
Ying Zhao
,
George Karypis
Conference:
International Conference on Information and Knowledge Management - CIKM
, pp. 515-524, 2002
Frequent term-based text clustering
(
Citations: 162
)
Florian Beil
,
Martin Ester
,
Xiaowei Xu
Conference:
Knowledge Discovery and Data Mining - KDD
, pp. 436-442, 2002
Hierarchical Document Clustering Using Frequent Itemsets
(
Citations: 57
)
Benjamin C. M. Fung
,
Ke Wang
,
Martin Ester
Published in 1999.
Sort by:
Citations
(1)
Semantics in social tagging systems: A review
Amna Majid
,
Shah Khusro
,
Azhar Rauf
Conference:
International Conference on Computer Networks and Information Technology - ICCNIT
, pp. 191-203, 2011