Resources for Turkish morphological processing

Resources for Turkish morphological processing,10.1007/s10579-010-9128-6,Language Resources and Evaluation,Hasim SakTunga GungorMurat Saraclar,Tunga G

Resources for Turkish morphological processing   (Citations: 1)
BibTex | RIS | RefWorks Download
We present a set of language resources and tools - a morphological parser, a morphological disambiguator, and a text corpus - for exploiting Turkish morphology in natural language processing applications. The morphological parser is a state-of-the-art finite-state transducer-based implementation of Turkish morphology. The disambigua- tor is based on the averaged perceptron algorithm and has the best accuracy reported for Turkish in the literature. The text corpus has been compiled from the web and contains about 500 million tokens. This is the largest Turkish web corpus published.
Journal: Language Resources and Evaluation - LANG RESOUR EVAL , vol. 45, no. 2, pp. 249-261, 2011
Cumulative Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
    • ...The lexical transducer of the morphological parser maps the letter sequences to lexical morphemes annotated with morphological features [47]...
    • ...The text corpora that we used for estimating the parameters of statistical language models are composed of 182.3 million-words BOUN NewsCor corpus collected from news portals in Turkish [47] and 1.3 million-words text corpus (BN Corpus) obtained from the transcriptions of the Turkish Broadcast News speech database [6]...

    Marek Hrúzet al. Automatic fingersign-to-speech translation system

Sort by: