ABBYY OCR technology for printed text is available for 198 languages, including:
European languages (Latin, Cyrillic, Armenian, Greek alphabets)
FineReader XIX (19) — an OCR module designed specifically for digitizing and archiving old documents, books and newspapers published in the 19th century.
Not only the language is an important factor for OCR, but also the way how the characters/letters are printed. Internally ABBYY technology is based on omni-font technology and not trained on a special font type.
Never the less it is important to tell the OCR Engine what text type is used in the documents that are processed, because then it is optimised for speed and quality on the text-type selection.
ABBYY products support following types of text printing:
Gothic
Text printed with the Gothic type and used for
Gothic recognition. It may look as follows:

Index
Special set of characters including only digits written in ZIP-code style. They look as follows:

OCR_A
A monospaced font designed specifically for OCR. It is largely used by banks, credit card companies and similar businesses. It may look as follows:

OCR_B
A font designed specifically for OCR. It may look as follows:

MICR_E13B
Special numeric characters printed in magnetic ink. MICR (Magnetic Ink Character Recognition) characters are found in a variety of places, including personal checks. They may look as follows:

MICR_CMC7
Special MICR barcode font (CMC-7). It is used on the bank checks. It may look as follows:

Chinese (Simplified and Traditional), Japanese, and Korean (CJK)
Thai, Vietnamese and Hebrew
Arabic — technical preview version
Back to: Feature Overview