beautypg.com

Chapter 8 - recognizing documents, Introduction, Hapter – I.R.I.S. Readiris Corporate 12 for Mac User Guide User Manual

Page 51: Ecognizing documents

background image

Readiris

TM

Corporate 12 – User Guide

45

C

HAPTER

8

R

ECOGNIZING DOCUMENTS

I

NTRODUCTION

To recognize documents, Readiris applies linguistics during the
recognition phase. As a result, Readiris recognizes text, tables and
graphics, barcodes and handprinted text in all kinds of documents.
Readiris even copes with complex columnized documents, low-
quality documents, faxes, dot matrix printouts, badly scanned and
copied documents containing too light or dark font shapes, etc.

Readiris supports 125 languages: all American and European
languages are supported, including the Central-European, Baltic and
Cyrillic languages as well as Greek and Turkish. Optionally,
Readiris can read Hebrew documents and four Asian languages -
Japanese, Simplified and Traditional Chinese and Korean. Readiris
even copes with mixed alphabets: the software detects “Western”
words that occur in Greek, Cyrillic, Hebrew and Asian documents -
many untranscribable proper names, brand names etc. are written
using the Western symbols.

Readiris is based on the most advanced recognition technologies.
Font-independent text recognition is complemented by self-learning
techniques. The system is able to learn new characters and words
through contextual and linguistic analysis. This means that the OCR
accuracy of the recognition system will improve as it goes along.

Besides that, Readiris has a user verification function. When
activated, the user verification function (Interactive learning) not
only flags characters the recognition system isn't sure of but also