This is a set of software tools for manipulating scanned images in order to improve the recognition results of OCR engines.
The various defects that can manifest themselves in document images are grouped into three broad categories of conditions that can be improved or eliminated in order to enhance the results obtained from scanned documents:
- Binarisation and Colour Reduction
- Border Detection and Removal
- Geometric Correction: Page Curl & Arbitrary Warping