The 2-day CIS OCR Workshop on “OCR and postcorrection of early printings for digital humanities” originally held at LMU, Munich 14/15 September 2015 (see http://www.cis.lmu.de/ocrworkshop).
GT4HistOCR: Ground Truth for training OCR engines on historical documents in German Fraktur and Early Modern Latin
GT4HistOCR contains ground truth for research in Optical Character Recognition (OCR) technology applied to historical printings in German Fraktur and Early Modern Latin.
HDLAC2011
Example and evaluation dataset used for the ICDAR2011 Historical Document Layout Analysis Competition.
HBR2013
Example and evaluation dataset used for the ICDAR2013 Competition on Historical Book Recognition.
HNLA2013
Example and evaluation dataset used for the ICDAR2013 Competition on Historical Newspaper Layout Analysis
REID2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Early Indian printed Documents
RDCL2017
Example and evaluation dataset used for the ICDAR2017 Competition on Recognition of Documents with Complex Layouts
RASM2018
Example and evaluation dataset used for the ICFHR2018 Competition on Recognition of Historical Arabic Scientific Manuscripts.
Census 1961 Project Dataset
Images containing tables from the 1961 Census for England and Wales.
Europeana Newspapers
This online repository is the main point of reference for all activities related to evaluation within the scope of the Europeana Newspapers project.
- Page 1 of 2
- 1
- 2