INFO: OcrEngine - Overview


OCR Engine

developer can recognize an image and output that image to a file, or enumerate its lines, words, and characters with confidence.

Data sources for the engine can be scanned images or files. The engine output consists of either a file or a class hierarchy. This model is illustrated below.

As OcrEngine object is abstract, you cannot create an instance of this object. Nevertheless, the object definition contains most of the necessary functionality needed for a concrete subclass to function with a minimum of extra code.

The OcrEngine object has five primary components as illustrated below:

  • Preprocessing options
  • Document translators
  • Page element factory
  • Font mapping
  • Font building

 

See Also

Original Article:
Q10364 - INFO: OcrEngine - Overview