Page Curl Correction

Author: Sebastian Kirch (Fraunhofer IAIS)

For old books and newspapers it is often not possible to remove the binding before digitizing the individual pages. This often results in distorted pages images due to the warping implied by the books binding (Figure 1).

Figure 1: A scanned book page with page curl

Other reasons for distorted pages can be environmental conditions like humidity (which can cause page shrinking over time) or a wrong camera setup.

The Page Curl Correction is a command-line tool capable of detecting such distortions and correcting them automatically. This can significantly improve the results of a subsequent text recognition step since most OCR algorithms neglect this kind of distortion. More information about the tool can be found here: