OCR Engine

developer can recognize an image and output that image to a file, or enumerate its lines, words, and characters with confidence.

Data sources for the engine can be scanned images or files. The engine output consists of either a file or a class hierarchy. This model is illustrated below.

As OcrEngine object is abstract, you cannot create an instance of this object. Nevertheless, the object definition contains most of the necessary functionality needed for a concrete subclass to function with a minimum of extra code.

The OcrEngine object has five primary components as illustrated below:

Preprocessing options
Document translators
Page element factory
Font mapping
Font building

INFO: OcrEngine - Overview

OCR Engine

See Also