Tools for text digitisation

More than
250
state-of-the-art tools for text digitisation.

35 results in

Tools

Alchemy API NER

  • Description:AlchemyAPI provides the worlds most popular natural language processing service via an easytouse SaaS API Integrate advanced text mining and analytics functionality into your application service or dataprocessing pipeline
  • Group: text processing
  • Type: NLP Tools
  • Subtype: NER
  • License:
  • Language: null
  • Developer: http://www.alchemyapi.com/

Apache OpenNLP - NER

  • Description:The Name Finder can detect named entities and numbers in text. To be able to detect entities the Name Finder needs a model. The model is dependent on the language and entity type it was trained for. The OpenNLP projects offers a number of pre-trained name finder models which are trained on various freely available corpora. They can be downloaded at our model download page. To find names in raw text the text must be segmented into tokens and sentences. A detailed description is given in the sentence detector and tokenizer tutorial. Its important that the tokenization for the training data and the input text is identical.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Apache License 2
  • Language: Any
  • Developer: http://opennlp.apache.org/

Chaos

  • Description:CHAOS: A robust syntactic parser for Italian and for English. The system implements a modular and lexicalised approach to the syntactic parsing problem. It is based on the notion of eXtended Dependency Graph (XDG) that has been seen as a useful representation mechanism in a shallow parsing approach. The system offers a collection of modules for designing parsing architectures. The pool of modules consists of:
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Unclear
  • Language: Italian English
  • Developer: http://art.uniroma2.it/external/chaosproject/

CiceroLite

  • Description:Language Computer's CiceroLite recognizes hundreds of different types of named entities in English Arabic and Chinese texts with nearly 90% precision and recall. It is available as one of many plug-in NLP components which operate within the Cicero On-Demand server.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Commercial
  • Language: 7
  • Developer: http://www.languagecomputer.com/

FreeLing - NER

  • Description:There are two different modules able to perform NE recognition. They can be instantiated directly or via a wrapper that will create the right module depending on the configuration file.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: GPL
  • Language: Asturian Catalan English Galician Italian Portuguese Russian Spanish Welsh
  • Developer: http://www.talp.upc.edu/

Frog

  • Description:Frog's current version will tokenize tag lemmatize and morphologically segment word tokens in Dutch text files will assign a dependency graph to each sentence will identify the base phrase chunks in the sentence and will attempt to find and label all named entities.
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: GPL
  • Language: Dutch
  • Developer: ILK Research Group

Liner2 (NER)

  • Description:Liner2 is a customizable and open-source framework for proper names''recognition. The framework consists of several universal methods for''sequence chunking which include: dictionary look-up pattern matching''and statistical processing.The statistical processing is performed using''Conditional Random Fields and a rich set of features including''morphological lexical and semantic information. We present an''application of the framework to the task of recognition proper names in''Polish texts (5 common categories of proper names i.e. first names''surnames city names road names and country names) and an extended''model to recognize 56 categories of proper names which was used to''bootstrap the manual annotation of KPWr corpus.
  • Group: Text Processing
  • Type: -
  • Subtype: NER
  • License: unknown
  • Language: Polish
  • Developer: The WrocUT Language Technology Group G4.19

LingPipe - NER

  • Description:LingPipe is tool kit for processing text using computational linguistics. LingPipe is used to do tasks like: Find the names of people organizations or locations in news Automatically classify Twitter search results into categories Suggest correct spellings of queries
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Limited version free production version at a fee
  • Language: all in principle
  • Developer: http://alias-i.com/lingpipe/index.html

NCSR binarisation and colour reduction

  • Description:Perform image binarisation using an algorithm developed at NCSR.
  • Group: image processing
  • Type: image processing and enhancement
  • Subtype: ner
  • License:
  • Language: xhosa
  • Developer: national center for scientific research (ncsr) \\\\\\\"demokritos\\\\\\\"

NCSR evaluation tool for ocr

  • Description:This tool evaluates the performance of an optical character recognition system on character and word level.
  • Group: evaluation
  • Type: ocr (text)
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: national center for scientific research (ncsr) \\\"demokritos\\\"

NCSR geometric correction: page curl

  • Description:This tool rectifies document images which suffer from warping and perspective distortions
  • Group: image processing
  • Type: image processing and enhancement
  • Subtype: ner
  • License:
  • Language: n/a
  • Developer: national center for scientific research (ncsr) \\\"demokritos\\\"
  • Wiki

NLTK - NER

  • Description:-
  • Group: Text Processing
  • Type: NLP Tools
  • Subtype: NER
  • License: Free
  • Language: Any
  • Developer: http://www.nltk.org/index.html


Would you like to add any tool?

Registered users can add new tools through a simple form login or register.

Search or filter tools

Group:

Type:

Subtype:

In demonstrator platform: