Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

62 results in

Tools

Paradiit

  • Description:The PaRADIIT (Pattern Redundancy Analysis for Document Image Indexing and Transcription) project is a research project conducted by the RFAI Team of the Computer Science Laboratory of Tours. The project focused on layout analysis text/graphics separation Optical Character Recognition (OCR) and text transcription processes dedicated to old books and historical documents. Additions: This is very much like the IBM concert tool also has ideas related to the inventory extraction! It consists of two processing steps: AGORA which extracts clusters of characters and RETRO which presents something like IBM's carpets.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: Framework
  • License: GPL
  • Language: -
  • Developer: -

Photoscore

  • Description:Music OCR: music scanning & PDF to notation
  • Group: text recognition
  • Type: core text recognition
  • Subtype:
  • License: commercial
  • Language:
  • Developer: Neuratron

Plasma OCR

  • Description:An omnifont OCR engine. The long-term goal is recognition of formulas.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: GPL
  • Language: -
  • Developer: -

PrimeOCR

  • Description:Prime Recognition's production OCR product PrimeOCR is a Windows OCR engine that claims to reduce OCR error rates by up to 65-80% over conventional OCR by implementing "Voting" OCR technology.
  • Group: Text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License: Commercial
  • Language: Danish English German Norwegian Spanish Dutch French Italian Portuguese Swedish
  • Developer: PrimeRecognition

Proofread page

  • Description:Proofread Page is an extension for MediaWiki which allows you to edit transcriptions side by side with the page images. It is used on WikiSource for manuscript and early print transcription projects. Proofread Page supports workflow but no markup.
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: -
  • License: GPL v2
  • Language: -
  • Developer: ThomasV (original author)''Tpt (current maintainer)

ReadIris

  • Description:Readiris is a OCR solution designed for private users and small to large office users
  • Group: Text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License: Commercial
  • Language: 140 languages
  • Developer: IRIS

Rescribe OCR

  • Description:Rescribe\\\'s open source Latin OCR software is based on Google\\\'s Tesseract and has been developed particularly for text recognition of historic Latin printed texts. Detailed instructions and additional helpful open source tools for Windows, Linux and OSX can be found on latinocr.org
  • Group: text recognition
  • Type: ocr (text)
  • Subtype:
  • License:
  • Language: latin
  • Developer: nick white, antonia karaisl

Rescribe OCR

  • Description:Rescribe\'s open source Latin OCR software is based on Google\'s Tesseract and has been developed particularly for text recognition of historic Latin printed texts. Detailed instructions and additional helpful open source tools for Windows, Linux and OSX can be found on latinocr.org
  • Group: text recognition
  • Type: ocr (text)
  • Subtype:
  • License:
  • Language: latin
  • Developer: nick white, antonia karaisl

Schnell OCR

  • Description:A lightweight ocr module written in C
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: unknown
  • Language: -
  • Developer: -

Scripto

  • Description:A free open source tool enabling community transcriptions of document and multimedia files
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: Transcription
  • License: GPLv3
  • Language: -
  • Developer: Roy Rosenzweig Center for History and New Media

SharpEye 2

  • Description:Music OCR: You can use SharpEye to scan and convert printed sheet music into a music notation file or a MIDI file which can then be imported into a music notation program or MIDI sequencer
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial $169
  • Language: -
  • Developer: Graham Jones

SimpleOCR

  • Description:SimpleOCR is the popular freeware OCR software with hundreds of thousands of users worldwide. SimpleOCR is also a royalty-free OCR SDK for developers to use in their custom applications.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: English French
  • Developer: -


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: