IMPACT Language Resources

Impact Centre of Competence

A collection of historical and named-entity lexica for Bulgarian, Czech, Dutch, English, French, German, Polish, Slovene, Spanish and Latin.

Natural History Museum Lepidoptera

Impact Centre of Competence

This dataset contains contains scans of index cards from the UK’s Natural History Museum lepidoptera index

Layout Analysis Dataset

Impact Centre of Competence

This dataset has been created primarily for the evaluation of layout analysis (physical and logical) methods.

Census 1961 Project Dataset

Impact Centre of Competence

Images containing tables from the 1961 Census for England and Wales.

RDCL2017

Impact Centre of Competence

Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Documents with Complex Layouts

RASM2018

Impact Centre of Competence

Example and evaluation dataset used for the ICFHR2018 Competition on Recognition of Historical Arabic Scientific Manuscripts.