Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

62 results in

Tools

collaborative correction platform (concert)

  • Description:A web-based platform suitable for massive volunteer participation which validates and corrects OCR results
  • Group: text recognition
  • Type: postcorrection
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: ibm israel - science and technology ltd

cs499ocr

  • Description:Performs OCR with image processing and statistical pattern recognition.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPL
  • Language: -
  • Developer: -

cuneiform

  • Description:Cuneiform is an OCR system. In addition to text recognition it also does layout analysis and text format recognition. Cuneiform supports several languages.
  • Group: text recognition
  • Type: core text recognition
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: -

cutouts

  • Description:Cutouts is a web application which allows to crowdsource preparation of training data for Tesseract OCR engine.
  • Group: text recognition
  • Type: postcorrection
  • Subtype: utilities for training and customization
  • License:
  • Language: n/a
  • Developer: poznań supercomputing and networking center

gamera ocr

  • Description:OCR toolkit for Gamera: This is a Gamera toolkit for building standard text recognition applications. It is based on the Gamera framework and requires a working Gamera installation.
  • Group: text recognition
  • Type: core text recognition
  • Subtype: framework
  • License:
  • Language: n/a
  • Developer: -

gocr

  • Description:GOCR is an OCR (Optical Character Recognition) program developed under the GNU Public License. It converts scanned images of text back to text files.
  • Group: text recognition
  • Type: core text recognition
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: -

hOCR

  • Description:HOCR is a Hebrew optical character recognition library.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPLv3
  • Language: -
  • Developer: -

korrektor

  • Description:GUI-based software for viewing and correcting document analysis results
  • Group: text recognition
  • Type: postcorrection
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: fraunhofer iais
  • Wiki

ocrad

  • Description:GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method. It reads images in pbm (bitmap) pgm (greyscale) or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. Also includes a layout analyser able to separate the columns or blocks of text normally found on printed pages. Ocrad can be used as a stand-alone console application or as a backend to other programs.
  • Group: text recognition
  • Type: core text recognition
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: -

ocre

  • Description:Spanish OCR prototype
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: unknown
  • Language: English Euskara/Basque French German Polish Português Russian Spanish
  • Developer: -

ocropus

  • Description:OCRopus is an OCR system focusing on the use of large scale machine learning for addressing problems in document analysis
  • Group: text recognition
  • Type: core text recognition
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: ocropus project

post correction tool

  • Description:Interactive post-correction of OCRed documents
  • Group: text recognition
  • Type: postcorrection
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: centrum für informations und sprachverarbeitung (cis) university of munich


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: