Academic
Publications
A Comprehensive Isolated Farsi/Arabic Character Database for Handwritten OCR Research

A Comprehensive Isolated Farsi/Arabic Character Database for Handwritten OCR Research,Saeed Mozaffari,Karim Faez,Farhad Faradji,Majid Ziaratban,S. Moh

A Comprehensive Isolated Farsi/Arabic Character Database for Handwritten OCR Research   (Citations: 8)
BibTex | RIS | RefWorks Download
This paper presents a new comprehensive database for isolated offline handwritten Farsi/Arabic numbers and characters for use in optical character recognition research. The database is freely available for academic use. So far no such a freely database in Farsi language is available. Grayscale images of 52,380 characters and 17,740 numerals are included. Each image was scanned from Iranian school entrance exam forms during the years 2004-2006 at 300 dpi. The only restriction imposed on the writers is to write each character within a rectangular box. The number of samples in each class of the database is non-uniform corresponding to their real life distributions. Also, for comparison purposes, each dataset has been properly divided into respective training and test sets. To validate the effectiveness of a proposed system for Farsi (Arabic) OCR research, it is necessary to compare it with other approaches. Now, such comparison is possible by implementing the concurrent approaches concurrently and then applying them with the proposed method on the same database. Therefore, in the filed of Farsi (Arabic) OCR, a standard database is needed to facilitate researches.
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
    • ...In this regard, we have utilized a standard Persian handwritten character dataset [16] and we have extracted two separate feature sets (as explained later) based on the directional chain code information of the skeleton as well as the contour points of the input image...
    • ...IFHCDB database [16], which includes 36682 isolated characters for training and 15338 isolated characters to test...
    • ...The database is not a uniform dataset and hence number of samples in each class is different in both training as well as testing parts [16]...
    • ...Likewise, we have tested the system on the same dataset [16] using different classifiers (NN, 3-NN and 5-NN) and different...
    • ...Based on another experiment on the Persian handwritten character dataset introduced in [16], recognition accuracy of 98.10% is obtained when 32 Persian handwritten isolated characters are categorized into 8 groups (8-class Persian handwritten character recognition problem)...
    • ...The results of our proposed method and the system presented in [5] while using the same dataset [16] for experiment are tabulated in Table VII...
    • ...The proposed scheme has provided 98.10% accuracy but the method due to Dehghan and Faez [5] has given only 81.47% accuracy when the same dataset [16] is used for experiment...

    Alireza Alaeiet al. A New Two-Stage Scheme for the Recognition of Persian Handwritten Char...

    • ...Arabic numerals, particularly, Bangla and Farsi numeral recognition because some public databases are available [7, 8, 9]. Though some research works have contributed to Bangla numeral recognition [10, 11, 12] and Farsi numeral recognition [13], they rarely used common sample databases, and some of the reported accuracies are not very high...
    • ...We evaluate our recognition methods on three databases: ISI Bangla numerals [7], CENPARMI Farsi numerals [8], and IFHCDB Farsi numerals [9]...
    • ...We evaluate our recognition methods on three databases: ISI Bangla numerals [7]1, CENPARMI Farsi numerals [8], and IFHCDB Farsi numerals [9]2...

    Cheng-lin Liuet al. A new benchmark on the recognition of handwritten Bangla and Farsi num...

    • ...It consists of 52,380 grayscale images of characters and 17,740 digits [8]...

    Puntis Jifroodian Haghighiet al. A New Large-Scale Multi-purpose Handwritten Farsi Database

    • ...Until recently, there was no standard Farsi database available for researchers; however, very recently two standard databases have been developed for research on Farsi off-line handwritten recognition in [15] and [16]...

    Javad Sadriet al. State-of-the-art in Farsi script recognition

    • ...In addition to these data sets, some free databases were also presented for Farsi digits and characters [17, 24, 11]...

    Saeed Mozaffariet al. IfN/Farsi-Database: A Database of Farsi Handwritten City Names

Sort by: