Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

283 results

Tools

Dates Recognizer

  • Description:Accepts a string thought to contain a date (or a date range or a period) and parses it returning a date range.
  • Group: Text Processing
  • Type: -
  • Subtype: dates recognizer
  • License: free service
  • Language: -
  • Developer: N/A

Dictionary Attestation Tool

  • Description:This tool is meant to be used for manual correction of large quantities of automatically matched occurrences of a headword in the quotations of the particular article in a comprehensive dictionary.
  • Group: text processing
  • Type: nlp tools
  • Subtype: annotation tool
  • License:
  • Language: n/a
  • Developer: IVdNT

DigitLab

  • Description:DigitLab (http://digitlab.psnc.pl) is an especially adapted operating system based on Linux Ubuntu. The main aim of its creation was to create a complete system which can be used for collections digitisation with the usage of free and widely available tools. DigitLab is a perfect solution for both everyday work and hands-on trainings. It allows to work with images textual content (OCR included) and audio-visual collections. Gives access to three example digital libraries based on DSpace dLibra and Greenstone.
  • Group: Miscellaneous Utilities
  • Type: -
  • Subtype: toolset helping with digitisation activities
  • License: free
  • Language: -
  • Developer: Poznań Supercomputing and Networking Center

DjVu tools

  • Description:Suit of open source tools and utilities related to the DjVu format
  • Group: Miscellaneous Utilities
  • Type: -
  • Subtype: DjVu toolset
  • License: unknown
  • Language: -
  • Developer: Warsaw University

Document layout analysis tools

  • Description:Intented to be used in Mapa76 processing pipeline for detecting the clusters of text in a PDF file to correctly perform NE dectection to the body of text excluding other unrelated text lines (like page numbers titles footnotes etc)
  • Group: Layout Analysis
  • Type: -
  • Subtype:
  • License: unknown
  • Language: -
  • Developer: Damián Silvani

Exiftool

  • Description:ExifTool is a free software program for reading, writing, and manipulating image, audio, and video metadata. It is platform independent, available as both a Perl library (Image::ExifTool) and command-line application. ExifTool is commonly incorporated into different types of digital workflows and supports many types of metadata including Exif, IPTC, XMP, JFIF, GeoTIFF, ICC Profile, Photoshop IRB, FlashPix, AFCP and ID3, as well as the manufacturer-specific metadata formats of many digital cameras.
  • Group: metadata processing
  • Type: images
  • Subtype: 0
  • License:
  • Language: n/a
  • Developer: Phil Harvey

Expervision OpenRTK

  • Description:OpenRTK 70 Open Recognition Toolkit is a CC toolkit that provides an innovative solution to application developers system integrators and OEM customers who need to integrate OCR capability into their applications with minimum engineering efforts
  • Group: text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License:
  • Language: 0
  • Developer: ExperVision

Expervision OpenRTK - OCR

  • Description:OpenRTK 7.0 (Open Recognition Toolkit) is a C/C++ toolkit that provides an innovative solution to application developers system integrators and OEM customers who need to integrate OCR capability into their applications with minimum engineering efforts.
  • Group: Text recognition
  • Type: Core Text Recognition
  • Subtype: OCR
  • License: Commercial
  • Language: English French German Italian Spanish Portuguese Danish Dutch Swedish Norwegian Hungarian Polish Finnish
  • Developer: ExperVision

Expervision WebOCR

  • Description:In 1999 Expervision released WebOCR Online OCR 10 providing her users with flexible and easy modes of OCR application WebOCR OnlineOCR20 updated later is able to provide 4 kinds of Web OCR Online OCR application modes based on different business environment and processing requirements of her users
  • Group: text processing
  • Type: Core Text Recognition
  • Subtype: Web service
  • License:
  • Language: 0
  • Developer: Expervision

Expervision WebOCR - Web service

  • Description:In 1999 Expervision released WebOCR (Online OCR) 1.0 providing her users with flexible and easy modes of OCR application. WebOCR (OnlineOCR)2.0 updated later is able to provide 4 kinds of Web OCR (Online OCR) application modes based on different business environment and processing requirements of her users.
  • Group: Text Processing
  • Type: Core Text Recognition
  • Subtype: Web service
  • License: Own license
  • Language: -
  • Developer: -

EyeOCR

  • Description:An OCR (Optical Character Recognition) application written in Java. Eye is easy and fun to use - no in-depth knowledge required. Eye is known to work on Linux Windows and Mac OS X.
  • Group: Text Recognition
  • Type: Core Text Recognition
  • Subtype: -
  • License: Own license
  • Language: -
  • Developer: -

FM-SBLEX

  • Description:FM-SBLEX consists of three computational morphology tools for modern Swedish (SALDO) for 19th century Swedish (Dalin) and for Old Swedish. FM-SBLEX has been developed using the Functional Morphology library.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: Morphological Analysis
  • License: GPL3
  • Language: 1
  • Developer: sb-info@svenska.gu.se


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: