Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

61 results in

Tools

ALTO-Edit

  • Description:ALTO Editor for text and segmentation
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: GPL
  • Language: Not applicable
  • Developer: -

Asprise

  • Description:Asprise OCR SDK library for Java enables you to equip your Java applications (Java applets web applications standard applications J2EE enterprise applications) with optical character recognition (OCR) ability.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: -
  • Developer: -

BIT-Alpha

  • Description:Small French company that offered trainable OCR based on Neuronal Networks with support for Fraktur.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Commercial
  • Language: German French
  • Developer: -

Carleton OCR

  • Description:Code repository for the Carleton OCR comps project 2010-2011
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: MIT
  • Language: -
  • Developer: -

ClaraOCR

  • Description:Clara OCR is an Optical Character Recognition program. It features both a powerful GUI for the X Window System and a Web interface. The Web interface is able to collect revision efforts from the Internet using a simple revision model. It is intended to be used in the cooperative optical recognition of old books. It tries to facilitate fine- tuning so an optical recognition project is enabled to invest resources in tuning the OCR in order to achieve better recognition results for one specific book and reduce the overall revision cost.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPL
  • Language: -
  • Developer: -

Expervision OpenRTK

  • Description:OpenRTK 70 Open Recognition Toolkit is a CC toolkit that provides an innovative solution to application developers system integrators and OEM customers who need to integrate OCR capability into their applications with minimum engineering efforts
  • Group: text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License:
  • Language: 0
  • Developer: ExperVision

Expervision OpenRTK - OCR

  • Description:OpenRTK 7.0 (Open Recognition Toolkit) is a C/C++ toolkit that provides an innovative solution to application developers system integrators and OEM customers who need to integrate OCR capability into their applications with minimum engineering efforts.
  • Group: Text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License: Commercial
  • Language: English French German Italian Spanish Portuguese Danish Dutch Swedish Norwegian Hungarian Polish Finnish
  • Developer: ExperVision

EyeOCR

  • Description:An OCR (Optical Character Recognition) application written in Java. Eye is easy and fun to use - no in-depth knowledge required. Eye is known to work on Linux Windows and Mac OS X.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: -
  • Developer: -

Franken+

  • Description:The Initiative for Digital Humanities Media and Culture (IDHMC) at Texas A&M University as part of its Early Modern OCR Project (eMOP) has created a new tool called Franken+ that provides a way to create font training for the Tesseract OCR engine using page images. This is in contrast to Tesseract's document method of font training which involves using a word processing program with a modern font. ''''Franken+ works in conjunction with PRImA's Aletheia tool and allows users to easily and quickly identify one or more idealized forms of each glyph found on a set of page images. These identified forms are then used to generate a set of Franken-page images matching the page characteristics documented in Tesseract's training instructions but using a font used in an actual early modern printed document.
  • Group: Text Recognition
  • Type: Training
  • Subtype: -
  • License: Open source
  • Language: -
  • Developer: Bryan Tarpley

FromThePage

  • Description:FromThePage is free software that allows volunteers to transcribe handwritten documents online
  • Group: text recognition
  • Type: Postcorrection
  • Subtype: Transcription
  • License:
  • Language: 0
  • Developer: Ben Brumfield

FromThePage - Text Recognition

  • Description:FromThePage is free software that allows volunteers to transcribe handwritten documents on-line.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: Transcription
  • License: GNU AGPL v3
  • Language: -
  • Developer: Ben Brumfield

IBM Adaptive OCR Engine

  • Description:IBM Adaptive OCR is a comprehensive software system which improves the recognition of historical texts significantly by applying adaptivity as one of the main features to the text recognition process. It integrates several other tools such as the image enhancement toolkit the ABBYY FineReader Engine the post correction tool and the lexical resources developed during the IMPACT project.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial
  • Language: English Dutch German
  • Developer: IBM Israel - Science and Technology Ltd


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: