Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all domains
Limit my searches in the following domains
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(7)
Background Knowledge
Document Classification
Document Retrieval
Learning Process
Supervised Learning
Text Classification
Transfer Learning
Subscribe
Academic
Publications
Towards a Universal Text Classifier: Transfer Learning Using Encyclopedic Knowledge
Edit
Towards a Universal Text Classifier: Transfer Learning Using Encyclopedic Knowledge
(
Citations: 1
)
BibTex
|
RIS
|
RefWorks
Download
Pu Wang
,
Carlotta Domeniconi
Document classification
is a key task for many text min- ing applications. However, traditional
text classification
requires labeled data to construct reliable and accurate classifiers. Unfortunately, labeled data are seldom avail- able, and often too expensive to obtain. In this work, we propose a universal text classifier, which does not require any labeled training document. Our approach simulates the capability of people to classify documents based on background knowledge. As such, we build a classifier that can effectively group documents based on their content, underthe guidanceof few words, whichwe call discriminant words, describing the classes of inter- est. Backgroundknowledgeis modeledusing encyclope- dic knowledge, namely Wikipedia. Wikipedia's articles related to the specific problem domain at hand are se- lected, and used during the
learning process
for predict- ing labels of test documents. The universal text classifier can also be used to perform document retrieval, in which the pool of test documents may or may not be relevant to the topics of interest for the user. In ourexperimentswith real data we test the feasibility of our approach for both the classification and retrieval tasks. The results demon- strate the advantage of incorporating backgroundknowl- edge through Wikipedia, and the effectiveness of mod- eling such knowledge via probabilistic topic modeling. The accuracy achieved by the universal text classifier is comparable to that of a
supervised learning
technique for transfer learning.
Conference:
IEEE International Conference on Data Mining - ICDM
, pp. 435-440, 2009
DOI:
10.1109/ICDMW.2009.101
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
doi.ieeecomputersociety.org
)
(
cs.gmu.edu
)
(
ieeexplore.ieee.org
)
(
www.informatik.uni-trier.de
)
(
ieeexplore.ieee.org
)
More »
Citation Context
(1)
...As far as text classification without labelled data is concerned, several works have been proposed recently for building flat text classifier without labelled data such as [8,
23
, 25, 14, 13]...
Viet Ha-Thuc
,
et al.
Large-scale hierarchical text classification without labelled data
References
(28)
Latent dirichlet allocation
(
Citations: 1957
)
David M. Blei
,
Andrew Y. Ng
,
Michael I. Jordan
Journal:
Journal of Machine Learning Research - JMLR
, vol. 3, pp. 993-1022, 2003
Importance of Semantic Representation: Dataless Classification
(
Citations: 5
)
Ming-wei Chang
,
Lev-arie Ratinov
,
Dan Roth
,
Vivek Srikumar
Conference:
National Conference on Artificial Intelligence - AAAI
, pp. 830-835, 2008
Co-clustering based classification for out-of-domain documents
(
Citations: 48
)
Wenyuan Dai
,
Gui-rong Xue
,
Qiang Yang
,
Yong Yu
Conference:
Knowledge Discovery and Data Mining - KDD
, pp. 210-219, 2007
Information-theoretic co-clustering
(
Citations: 342
)
Inderjit S. Dhillon
,
Subramanyam Mallela
,
Dharmendra S. Modha
Conference:
Knowledge Discovery and Data Mining - KDD
, pp. 89-98, 2003
Transfer learning for text classification
(
Citations: 26
)
Chuong Do
,
Andrew Y. Ng
Conference:
Neural Information Processing Systems - NIPS
, 2005
Order by:
Citations
(1)
Large-scale hierarchical text classification without labelled data
Viet Ha-Thuc
,
Jean-Michel Renders
Conference:
Web Search and Data Mining - WSDM
, pp. 685-694, 2011