Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(5)
Error Analysis
Machine Learning
part-of-speech tagging
Semi Supervised Learning
Part of Speech
Subscribe
Academic
Publications
Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?
Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?,10.1007/978-3-642-19400-9_14,Christopher D. Manning
Edit
Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?
BibTex
|
RIS
|
RefWorks
Download
Christopher D. Manning
I examine what would be necessary to move
part-of-speech tagging
performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an
error analysis
of some of the remaining errors suggests that there is limited further mileage to be had either from better
machine learning
or better features in a discriminative sequence classifier. The prospects for further gains from semi-supervised learning also seem quite limited. Rather, I suggest and begin to demonstrate that the largest opportunity for further progress comes from improving the taxonomic basis of the linguistic resources from which taggers are trained. That is, from improved descriptive linguistics. However, I conclude by suggesting that there are also limits to this process. The status of some words may not be able to be adequately captured by assigning them to one of a small number of categories. While conventions can be used in such cases to improve tagging consistency, they lack a strong linguistic basis.
Conference:
Conference on Intelligent Text Processing and Computational Linguistics - CICLing
, pp. 171-189, 2011
DOI:
10.1007/978-3-642-19400-9_14
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
www.springerlink.com
)
(
www.informatik.uni-trier.de
)
(
dx.doi.org
)