Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

62 results in

Tools

SimpleOCRSDK

  • Description:The SimpleOCR SDK is a fast lightweight OCR engine designed to let developers add basic OCR functions to an application with minimal cost and none of the drawbacks of open source solutions.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: own license
  • Language: English French
  • Developer: -

SmartScore

  • Description:Music OCR: Recognizes scores without any restriction on the number of parts. Process band arrangements operas hymns musicals instrumental and solo parts as well as full conductor’s scores.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial
  • Language: -
  • Developer: MUSITEK

T-pen

  • Description:T‑PEN is a web-based tool for working with images of manuscripts. Users attach transcription data (new or uploaded) to the actual lines of the original manuscript in a simple flexible interface.
  • Group: Text Recognition
  • Type: -
  • Subtype:
  • License: ECL
  • Language: -
  • Developer: Saint Louis University

Text and Error Profiler

  • Description:The Text and Error Profiler is software to analyse the OCR output from historical documents using statistical modelling of document characteristics to improve OCR accuracy. It works by attuning itself to a particular document rather than to common traits of printed documents from a certain era resulting in a highly adaptive process. The tool uses its document-specific knowledge to allow the batch processing of erroneous words.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: Licence pending. For further information please contact the IMPACT Centre of Competence
  • Language: Language-independent
  • Developer: Centrum für Informations und Sprachverarbeitung (CIS) University of Munich

Transcript

  • Description:Transcript is a desktop-based manuscript transcription tool that supports word-processor style formatting.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: free or 15 EUR
  • Language: -
  • Developer: Jacob Boerema

Typereader

  • Description:TypeReader®has been in the global market and received hundreds of appraisals from various industry technology magazines since 1991. The heart of this award winning OCR software product ExperVision®’s OpenRTK® is the only OCR Engine which won UNLV Test for consecutive years. Commercial (server/desktop)
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: -
  • Developer: -

Typewright

  • Description:TypeWright1 is a tool for correcting the text-version of a document made up of page images.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: ASL 2.0
  • Language: English
  • Developer: -

Virtual Transcription Laboratory

  • Description:Virtual Transcription Laboratory is Virtual Research Environment which works as a crowdsourcing platform for developing high quality textual representations of digital documents. It gives access to online OCR service and easy to use transcription editor. Images can be imported from various sources including direct import from digital libraries.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: free
  • Language: -
  • Developer: Poznań Supercomputing and Networking Center

WeOCR

  • Description:WeOCR is a platform for Web-enabled OCR (Optical Character Reader/Recognition) systems. It enables people to use character recognition over networks. A WeOCR server receives document images from users recognizes text in the images and returns recognition results to the users. WeOCR does not have its own character recognition engine. Instead it is intended to accommodate various existing character recognition engines.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: Web service
  • License: ASL 2.0
  • Language: -
  • Developer: -

Word Spotting

  • Description:This tool provides an integrated GUI for indexing historical documents without an OCR engine. It works by segmenting documents into individual words and compiling a list of the most common words (keywords) in the text. Users are then asked to classify the keywords
  • Group: Text Recognition
  • Type: -
  • Subtype:
  • License: commercial
  • Language: Not applicable
  • Developer: National Center for Scientific Research (NCSR) "Demokritos"

Wordsnap OCR

  • Description:An app for OCR-based camera input on Android
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPLv3
  • Language: -
  • Developer: -

abbyy finereader engine 10

  • Description:Stateoftheart OCR engine
  • Group: text recognition
  • Type: core text recognition
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: abbyy


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: