Supported Fonts for OCR

General

ABBYY OCR technology for printed text is available for 198 languages, including:

  • European languages (Latin, Cyrillic, Armenian, Greek alphabets)
  • FineReader XIX (19) — an OCR module designed specifically for digitizing and archiving old documents, books and newspapers published in the 19th century.

Not only the language is an important factor for OCR, but also the way how the characters/letters are printed. Internally ABBYY technology is based on omni-font technology and not trained on a special font type.
Never the less it is important to tell the OCR Engine what text type is used in the documents that are processed, because then it is optimised for speed and quality on the text-type selection.

"European" Text Types

ABBYY products support following types of text printing:

  • Normal - a common typographic type of text, such as Arial, Times New Roman or Courier.
  • Typewriter - text typed on a typewriter.
  • Matrix - text printed on a dot-matrix printer.
  • Gothic
    Text printed with the Gothic type and used for Gothic recognition. It may look as follows:
  • Index
    Special set of characters including only digits written in ZIP-code style. They look as follows:
  • OCR_A
    A monospaced font designed specifically for OCR. It is largely used by banks, credit card companies and similar businesses. It may look as follows:
  • OCR_B
    A font designed specifically for OCR. It may look as follows:
  • MICR_E13B
    Special numeric characters printed in magnetic ink. MICR (Magnetic Ink Character Recognition) characters are found in a variety of places, including personal checks. They may look as follows:
  • MICR_CMC7
    Special MICR barcode font (CMC-7). It is used on the bank checks. It may look as follows:

Asian Languages

  • Chinese (Simplified and Traditional), Japanese, and Korean (CJK)
  • Thai, Vietnamese and Hebrew
  • Arabic — technical preview version


Back to: Feature Overview