You are here: University of Vienna PHAIDRA Detail o:293845
Title
Challenges in Accessing Information in Digitized 19th-Century Czech Texts
Subtitle (en)
Paper - iPRES 2012 - Digital Curation Institute, iSchool, Toronto
Language
English
Description (en)
This short paper describes problems arising in optical character recognition of and information retrieval from historical texts in languages with rich morphology, rather discontinuous lexical development and a long history of spelling reforms. In a work-in- progress manner, the problems and proposed linguistic solutions are shown on the example of the current project focused on improving the access to digitized Czech prints from the 19th century and the first half of the 20th century.
Keywords (en)
iPRES, iSchool, Toronto, Canada,Information Retrieval, Known-Item Retrieval, Historical Text, Lemma, Hyperlemma
Author of the digital object
Karel  Kucera
Martin  Stluka
Format
application/pdf
Size
685.3 kB
Licence Selected
CC BY-NC-SA 3.0 AT
Conferences
Conference 2012
Name of Publication (en)
"iPres 2012 - Proceedings of the 9th International Conference on Preservation of Digital Objects." Editors: Reagan Moore, Kevin Ashley, Seamus Ross
From Page
226
To Page
229
Name of Collection/Monograph (en)
"iPres 2012 - Proceedings of the 9th International Conference on Preservation of Digital Objects." Editors: Reagan Moore, Kevin Ashley, Seamus Ross
Publishing Address
140 St. George Street, Toronto, ON M5S3G6
Publisher
Digital Curation Institute, iSchool University of Toronto
Publication Date
2012-11-01
Link to bibliographic information
https://ipres.ischool.utoronto.ca/sites/ipres.ischool.utoronto.ca/files/iPres%202012%20Conference%20Proceedings%20Final.pdf
Content
Details
Object type
PDFDocument
Format
application/pdf
Created
15.06.2013 08:55:03
Metadata