IMPACT Language Resources

Impact Centre of Competence


Description
Access a remarkable collection of historical and named entity lexica developed within the IMPACT project, covering Bulgarian, Czech, Dutch, English, French, German, Polish, Slovene, Spanish and Latin.
Dataset content type

Dataset scope
Books
Historical documents
Newspapers
Other
Postcorrection
Text recognition
HTR
OCR
Typewritten recognition
Language
Bulgarian
Czech
Dutch
English
French
German
Polish
Slovene
Spanish
Latin
Size
Historical and/or named entity lexica for 10 languages and 2 historical corpora
Dataset License
Various licenses
Dataset owner
Several owners, depending on the dataset
Dataset distributor
IMPACT Centre of Competence
Link
https://www.digitisation.eu/knowledge/language-resources/
Contact
info@digitisation.eu