New in Engine 9.0 Windows

Adaptive Document Recognition Technology (ADRT)

New recognition technology based on an innovative set of document synthesis algorithms. Automatically builds a logical model of the document structure and identifies:

  • Formatting purpose of elements like headers and footers, footnotes, page numbers etc.
  • How document elements should be reconstructed

ADRT automatically locates, determines and reconstructs various structural parts of the document and formatting of the elements.

Benefits

  • ADRT is a major step forward in intelligent, easy to use document recognition. Documents generated by ADRT have consistent formatting across all pages of a document since they are processed as a unit.
  • ADRT preserves integrity with logical relations between elements. For example, during the export to DOC(X) page numbers, headers and footers are automatically formatted and linked in an intelligent way.

Multi-Page Processing through new Document specific API

The new API objects allow you to set up the parameters of page and document synthesis separately. The pages will be processed as a logical unit and not isolated from each other. This approach preserves the logical organisation of the document, retaining not only the original text and columns, but also fonts, styles, etc.

Benefit
Makes multi page document processing very simple, only 3 steps are needed:

  1. Open multiple pages or multi page documents
  2. Start processing and
  3. Export them.

Multi CPU / Multi Core Recognition Architecture

Utilises all CPU cores during analysis and recognition of multi page documents.

Benefit
Automatically combines and executes steps of distributing pages, as well as coordinating recognition and the synthesis. Provides easy scalability and utilisation of multi core hardware.

New 2D barcode types

New support for additional barcode formats:

  • Aztec 2D Barcodes
    • 2 dimensional matrix barcode which can contain up to 3000 characters
    • Often used on tickets by railways for example Deutsche Bahn and Swiss Federal Railways.
  • Data Matrix 2D Barcodes
    • 2 dimensional matrix barcode capable of storing and encoding fifty characters in a symbol that is readable at size of 2 or 3 mm2.The code can be read with only a 20% contrast ratio.
    • Labelling small items such as electronic components, also printed materials such as labels and letters.
  • QR code 2D Barcodes
    • Next generation format used already in Japan for both image and text data with ability to store URLs which can be read and launched by appropriate software
    • Packaging, product and advertising info, and mobile-based applications.

Benefits
The new 2D barcode support in FineReader Engine 9.0 Linux opens new areas for your applications, like postal environments, ticket or voucher processing applications, document capturing with digital cameras.

New Image pre-processing capabilities

Additional image pre-processing functionalities:

  • Detection of an image rotation up to 20 degrees
  • Deskew by horizontal and vertical pairs of black squares
  • Deskew by horizontal and vertical lines
  • Deskew by horizontal and vertical lines of text

Benefit
Better OCR results on skewed images generated during scanning or document capture with a camera.

Visual Components

ABBYY FineReader Engine 9.0 Visual Components offer a set of graphical user interface components which developers can take advantage of when creating their own applications. FineReader Engine 9.0 Visual Components were created using ABBYY’s expertise in end-user applications and are based on interfaces created for our award-winning FineReader Professional Edition application. The Visual Components are ActiveX controls for easy integration of user interface elements to existing applications. Developers can give users direct but controlled access to recognition results and functions for validation or checking of documents.

  • Scan Interface
  • Document Viewer
  • Image Viewer
  • Text Editor
  • Text Validator

Improved Asian Language OCR Support

New ABBYY OCR technology for

  • Chinese,
  • Japanese and
  • Korean languages.

Functionality enhancements include

  • enhanced accuracy,
  • faster processing and
  • the ability to combine these hieroglyphic Asian languages with other languages.

Benefit
High accuracy and faster processing results for Asian documents. New possibility to work with multi-language documents like Chinese – French, Japanese – German etc.

New Export Formats

XML-based Office 2007 File Formats

  • Export to new XML-based Microsoft Office formats – DOCX, XLSX and PPTX.

Benefit
Users can export recognised documents to the new, open, interoperable, robust XML based formats that were introduced in Microsoft Office 2007.

MRC (Mixed Raster Content) Compression for PDF and PDF/A

Export to PDF with MRC compression. Supports the ability to set the MRC compression level. The parameters of compression for background, colour and text mask can be set and modified.

Benefit
MRC compression achieves significantly better file compression without visible degradation of document representation. Significant reduced file size, up to 10 times smaller compared to JPEG compression. Ideal when colour documents are scanned and processed.

New and Extended Licensing Options

Extended CPU Core Support

Out-of-the box support for multi-core CPUs. New licensing scheme allows an unlimited number of cores with page limited licences.

Benefit

  • Developers and customers can easily use multi-core hardware.
  • Network licences allow flexible OCR distribution in an enterprise with increased speed and throughput.

CPU core based licences

New offer of licences without a page counter – instead pricing is based on the maximum number of CPU cores that can be used. Pricing is based on the maximum number of CPU cores that can be used instead.

Benefit

  • CPU licences are often required in server based processing scenarios.

Maximum Speed Limitation

  • This new option sets maximum recognition speed limitation on characters per second.

Benefit

  • If your business model is based on the maximum speed of the ORC Engine, then the new licensing model is perfect for your needs.

Architectural changes in FineReader Engine 9.0

ABBYY will now offer separate development environments for document recognition/document conversion and data capture scenarios with ABBYY template matching technology: ABBYY FlexiCapture Engine

Back to FineReader Engine 9.0 Windows Overview