Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

287 results

Tools

Rescribe OCR

  • Description:Rescribe\'s open source Latin OCR software is based on Google\'s Tesseract and has been developed particularly for text recognition of historic Latin printed texts. Detailed instructions and additional helpful open source tools for Windows, Linux and OSX can be found on latinocr.org
  • Group: text recognition
  • Type: ocr (text)
  • Subtype:
  • License:
  • Language: latin
  • Developer: nick white, antonia karaisl

Rosette

  • Description:Automatically Detects the Language of Any Digital Text. Rosette® Language Identifier analyzes text identifying the language and the character encoding scheme. Detecting the language of documents is a critical first step in any process that handles multilingual text. Our software recognizes 55 languages and 45 encodings and processes files extremely quickly and accurately.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: Language Identification
  • License: commercial
  • Language: 55
  • Developer: http://www.basistech.com/

Rosette Base Linguistics

  • Description:Sophisticated morphological analysis segmentation and tagging of Arabic Asian and European language text
  • Group: Text processing
  • Type: NLP Tools
  • Subtype: Tokenizer
  • License: Commercial
  • Language: 40
  • Developer: http://www.basistech.com/

Rosette Entity Extractor (REX)

  • Description:Identify Names Places Organizations and Other Entities in Your Text
  • Group: Text processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Commercial
  • Language: 17
  • Developer: http://www.basistech.com/

Rosette Linguistic Platform

  • Description:Comprehensive linguistic analysis of unstructured text in Asian European and Middle Eastern languages for enhancing information retrieval text mining and other applications
  • Group: text processing
  • Type: NLP Tools
  • Subtype: NLP toolset and resources
  • License:
  • Language: 0
  • Developer: http://www.basistech.com/

Rosette Linguistic Platform - Language Identification

  • Description:Rosette® Language Identifier analyzes text identifying the language and the character encoding scheme. Detecting the language of documents is a critical first step in any process that handles multilingual text. Our software recognizes 55 languages and 45 encodings and processes files extremely quickly and accurately.
  • Group: Text processing
  • Type: NLP Tools
  • Subtype: Language Identification
  • License: Commercial
  • Language: 55
  • Developer: http://www.basistech.com/

Rosette Linguistic Platform - NLP toolset and resources

  • Description:Comprehensive linguistic analysis of unstructured text in Asian European and Middle Eastern languages for enhancing information retrieval text mining and other applications.
  • Group: Text processing
  • Type: NLP Tools
  • Subtype: NLP toolset and resources
  • License: Commercial
  • Language: -
  • Developer: http://www.basistech.com/

Scan Tailor

  • Description:Scan Tailor is an interactive post-processing tool for scanned pages. It performs operations such as page splitting deskewing adding/removing borders and others.
  • Group: Image Processing
  • Type: Image Processing and Enhancement
  • Subtype: -
  • License: GPL v3
  • Language: -
  • Developer: Joseph Artsimovich

Schnell OCR

  • Description:A lightweight ocr module written in C
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: unknown
  • Language: -
  • Developer: -

Scribe

  • Description:Scribe is a framework for generating crowd sources transcriptions of image based documents. It provides a system for generating templates which combined with a magnification tool guide a user through the process of transcribing an asset (an image).
  • Group: Miscellaneous Utilities
  • Type: -
  • Subtype: Transcription
  • License: ASL 2.0
  • Language: -
  • Developer: Zooniverse

Scripto

  • Description:A free open source tool enabling community transcriptions of document and multimedia files
  • Group: Text Recognition
  • Type: Postcorrection
  • Subtype: Transcription
  • License: GPLv3
  • Language: -
  • Developer: Roy Rosenzweig Center for History and New Media

SharpEye 2

  • Description:Music OCR: You can use SharpEye to scan and convert printed sheet music into a music notation file or a MIDI file which can then be imported into a music notation program or MIDI sequencer
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: commercial $169
  • Language: -
  • Developer: Graham Jones


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: