Impact at CERL ANNUAL SEMINAR 2014
Impact at CERL ANNUAL SEMINAR 2014 Impact at CERL ANNUAL SEMINAR 2014

Impact will be next 28th of October in CERL ANNUAL SEMINAR 2014 ‘The Application of Text Encoding Facilities to Digital Versions of European Early Books’, that will take place in the National Library of Oslo (Norway). In the conference, Tomasz Parkoła (Digital Libraries Team, Supercomputing and Networking Center, Poznan) will talk about ‘The Impact Centre of Competence: tools for text digitisation and transcription’.

Continue reading

Working together to improve text digitisation techniques
Working together to improve text digitisation techniques Working together to improve text digitisation techniques

2nd Succeed hackathon at the University of Alicante

Is there anyone out there still thinking that a hackathon is a malicious break-in?

Far from it. It is the best way for developers and researchers to get together and work on new tools and innovations. The2nd developers workshop / hackathon organised on 10-11 April by the Succeed Project was a case in point: bringing together people to work on new ideas and new inspiration for better OCR. The event was held in the Claude Shannon room of the Department of Software and Computing Systems (DLSI) of the University of Alicante, Spain. Claude Shannon was a famous mathematician and engineer and is also known as the “father of information theory”. So it seemed a good place to have a hackathon!

Continue reading

IMPACT Final Conference – Evaluation of lexicon supported OCR and information retrieval

Jesse de Does

Jesse de Does – Evaluation of lexicon supported OCR

Jesse De Does from the INL gave a brief but rich presentation on the evaluation of lexicon supported OCR and the project’s recent improvements. To evaluate lexica in OCR, the FineReader SDK 10 is used. In short, the software measures OCR with a default included dictionary, and, for each word or fuzzy set, it gives a number of alternatives and segmentations. It is then up to the user to manually select the most suitable or probable option. Lexicon, however, may include errors and the fuzzy sets created by FineReader may be too small (we will never have all spelling variations or compounds). Thus, a number of actions, including word recall, dictionary cleaning and implementation of historical dictionaries, are taken in order to increase performance, even if by small percentages.

Continue reading

IMPACT Final Conference: 1st Keynote: The Strategic Digital Overview

Richard Bouderstone from British Library

Richard Boulderstone from Brtish Library gives the opening keynote

Richard Boulderstone, Director of eStrategy and Programs at the British Library, kicked off the IMPACT Conference this morning with a suitably impactful statement of scope: the British Library, he estimates, has nearly 5 billion physical pages in a 150 million object collection.

 

 

 

Continue reading