Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

62 results in

Tools

IBM Adaptive OCR Engine

  • Description:IBM Adaptive OCR is a comprehensive software system which improves the recognition of historical texts significantly by applying adaptivity as one of the main features to the text recognition process. It integrates several other tools such as the image enhancement toolkit the ABBYY FineReader Engine the post correction tool and the lexical resources developed during the IMPACT project.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial
  • Language: English Dutch German
  • Developer: IBM Israel - Science and Technology Ltd

Inventory Extraction

  • Description:Allows for the extraction of a complete list of characters from a document without reference to a specific language dictionary or a library of fonts.
  • Group: Text Recognition
  • Type: -
  • Subtype:
  • License: ASL 2.0
  • Language: Not applicable
  • Developer: University of Innsbruck

JavaOCR

  • Description:This OCR engine is implemented as a Java library along with a demo application which shows the library in action. The core concept at the character level is image matching with automatic position and aspect ratio correction using a least-square-error matching algorithm. It is a very simple yet reasonably effective implementation.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: BSD
  • Language: -
  • Developer: -

Kognition

  • Description:An omnifont OCR software for KDE. Due to the fact that each step of the OCR process can be visualized you can get a quick idea of how OCR works and where the problems lie. However the program may be of minor/no use for end users in its current state.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPLv2
  • Language: -
  • Developer: -

Lios

  • Description:Lios is a free and open source software for converting print into text using either scanner or a camera. It can also produce text out of scanned images from other sources such as pdfs images or folders containing images.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPLv3
  • Language: Bulgarian Croatian Czech Danish Dutch English Estonian French German Hungarian Italian Latvian Lithuanian Polish Portuguese Romanian Russian Russian-English bilingual Serbian Slovene Spanish Swedish Turkish and Ukrainian.
  • Developer: -

Longan

  • Description:A flexible pure-Java OCR implementation. The aim of this project is to write a reasonably (competent modular understandable) OCR system.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: ASL 2.0
  • Language: -
  • Developer: -

NeuroOCR

  • Description:Demo neural network OCR
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPLv3
  • Language: -
  • Developer: -

NewOCR

  • Description:NewOCR.com is a free online OCR service based on Tesseract. It can analyze the text in any image file that you upload and then convert the text from the image into text that you can easily edit on your computer
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: Same as Tesseract 3 see also website
  • Developer: -

OCR gem

  • Description:Recognize text and characters from image files using web services.
  • Group: text recognition
  • Type: core text recognition
  • Subtype:
  • License: MIT
  • Language:
  • Developer: -

OCRFeeder

  • Description:OCRFeeder is a document layout analysis and optical character recognition system
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPL
  • Language: -
  • Developer: The GNOME Project

OCRchie

  • Description:The original OCR package could learn from a tif file and ascii translation then recognize a document in the same font. This semester we added interactive learning interactive segmentation of mathematics page zoning (the ability to automatically or manually zone columns or regions of text and interactive read-order specification.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: unknown
  • Language: -
  • Developer: -

OmniPage

  • Description:State-of-the-art OCR engine
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial
  • Language: 123 languages
  • Developer: Nuance


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: