Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(6)
Cross Entropy
Model Adaptation
Prior Knowledge
Word Frequency
Word Recognition
Character Error Rate
Subscribe
Academic
Publications
Incorporating linguistic post-processing into whole-book recognition
Incorporating linguistic post-processing into whole-book recognition,10.1117/12.839099,Pingping Xiu,Henry S. Baird
Edit
Incorporating linguistic post-processing into whole-book recognition
(
Citations: 1
)
BibTex
|
RIS
|
RefWorks
Download
Pingping Xiu
,
Henry S. Baird
We describe a technique of linguistic post-processing of whole-book recognition results. Whole-book recognition is a technique that improves recognition of book images using fully automatic cross-entropy-based model adaptation. In previous published works,
word recognition
was performed on individual words separately, without awaring passage-level information such as word-occurrence frequencies. Therefore, some rare words in real texts may appear much more often in recognition results; vice versa. Differences between word frequencies in recognition results and in
prior knowledge
may indicate recognition errors on a long passage. In this paper, we propose a post-processing technique to enhance whole-book recognition results by minimizing differences between word frequencies in recognition results and prior word frequencies. This technique works better when operating on longer passages, and it drives the
character error rate
down 20% from 1.24% to 0.98% in a 90-page experiment.
Conference:
Document Recognition and Retrieval - DRR
, vol. 7534, pp. 1-10, 2010
DOI:
10.1117/12.839099
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
adsabs.harvard.edu
)
(
www.informatik.uni-trier.de
)
(
dx.doi.org
)
Citation Context
(1)
...The earliest version of this strategy was described in [11], increasingly large-scale experiments in [10] and [12], and algorithm refinements in [
13
]...
Pingping Xiu
,
et al.
Analysis of whole-book recognition
References
(13)
Recent Work in the Document Image Decoding Group at Xerox PARC
(
Citations: 5
)
Thomas M. Breuel
,
Kris Popat
Published in 2001.
A Survey of Methods and Strategies in Character Segmentation
(
Citations: 330
)
Richard G. Casey
,
Eric Lecolinet
Journal:
IEEE Transactions on Pattern Analysis and Machine Intelligence - PAMI
, vol. 18, no. 7, pp. 690-706, 1996
Pattern Classiflcation (2nd ed.)
(
Citations: 2201
)
Richard O. Duda
,
Peter E. Hart
,
David G. Stork
Published in 2000.
Degraded Text Recognition Using Visual And Linguistic Context
(
Citations: 24
)
Tao Hong
Published in 1995.
Document Image Decoding Using Markov Source Models
(
Citations: 125
)
Gary E. Kopec
,
Philip A. Chou
Journal:
IEEE Transactions on Pattern Analysis and Machine Intelligence - PAMI
, vol. 16, no. 6, pp. 602-617, 1994
Sort by:
Citations
(1)
Analysis of whole-book recognition
(
Citations: 2
)
Pingping Xiu
,
Henry S. Baird
Conference:
Document Analysis Systems - DAS
, pp. 199-206, 2010