beautypg.com

Kofax INDICIUS 6.0 User Manual

Page 114

background image

Chapter 4

104

Getting Started Guide (Classification and Separation)

ƒ

The first, a “reference batch,” is created during export of the Test Documents
from Transformation Studio. It contains accurate document types and
structure for all the Test Documents.

ƒ

The second, a “comparison batch,” contains the same documents. However,
the document types and structure are exported from Recognition Test Tool,
after a test has been run using the new configuration.

Once the two offline batches have been generated, they are compared using the
BatchCompare utility and the results of the comparison are output to a MS Excel
workbook.

Within Excel, additional statistics can be generated from the raw data using built in
macros.

Statistics

It is important to consider the complete set of statistics when analyzing the
performance, as no single value indicates good or bad performance. For example, if
the separation statistics show several missed or additional splits, the classification
statistics will not accurately represent the performance.

For information on all the statistics, refer to the BatchCompare Reference in the
INDICIUS Help.

Once the separation statistics have been taken into account, the key statistics are the
accuracy and classification rate for each document type.

ƒ

Accuracy should be as high as possible for each document type.

ƒ

Classification rate should be as high as possible for each document type.

A compromise must be found between the two values, since:

ƒ

In order to improve the classification rate the accuracy may drop.

ƒ

In order to improve the accuracy the classification rate may drop.

Note

In order to use the BatchCompare utility you need to have Microsoft Office

Excel installed.

X

To evaluate and improve performance

1

Export the comparison batch from Recognition Test Tool.

a

Select Results | Export Results to Batch File.