Dictionaries – Kofax Getting Started with Ascent Xtrata Pro User Manual
Page 186

Extraction
Ascent Xtrata Pro User's Guide
167
In the above example, an eight-digit invoice number (Rechnungsnummer) was
specified using a simple regular expression that also fits for other numeric values.
Since no keyword is defined, the order number “65005285” was not identified
exclusively. The result of the test is shown above.
Figure 4-35. Locate a 8-Digit Invoice Number (With Keyword “Invoice”)
Adding the keyword “Invoice” that is above the expected item changes the
confidence of all other format matches to 0%, while the invoice number is returned as
the best match with a confidence of 100%.
Note
Keywords are searched using a fault tolerant search, such that misspelled
keywords and words containing OCR errors will be found. The better the match of a
word with a keyword, the higher the confidence of the matched format.
Dictionaries
Dictionaries can be used to locate complex expressions that also contain words. It is
quite easy to define a format for dates, as long as a numeric format is used. But when
- AP Automation (12 pages)
- AP Automation (18 pages)
- TotalAgility AP Automation (43 pages)
- SupplierExpress (80 pages)
- SupplierExpress (94 pages)
- SupplierExpress (6 pages)
- Capture (14 pages)
- Capture (44 pages)
- Capture (12 pages)
- Capture (10 pages)
- TotalAgility (28 pages)
- Export Connector 1.0.0 (12 pages)
- Export Connector 1.0.0 (6 pages)
- Export Connector 1.0.0 (10 pages)
- Export Connector 8.0.0 (16 pages)
- Export Connector 8.0.0 (14 pages)
- Export Connector 8.0.0 (8 pages)
- Export Connector 8.1.0 (12 pages)
- Export Connector 8.1.0 (16 pages)
- Export Connector 8.2.1 (26 pages)
- Export Connector 8.2.1 (6 pages)
- Capture Export Connector (30 pages)
- Capture Export Connector (18 pages)
- Release Script (24 pages)
- Ascen 7.0 Release Script (32 pages)
- Ascen 7.0 Release Script (48 pages)
- Ascen 7.0 Release Script (36 pages)
- Ascent Captur (30 pages)
- Export Connector for Fax 1.1.0 (10 pages)
- Export Connector for Fax 1.1.0 (8 pages)
- Export Connector 8.2.0 (10 pages)
- Export Connector for TotalAgility 1.1.0 (10 pages)
- Export Connector 8.3.0 for Microsoft SharePoint (18 pages)
- Capture Export Connector for Documentum 6.7 (18 pages)
- Export Connector 7.6.0 (8 pages)
- DM API (528 pages)
- Export Connector 8.2.0 for IBM FileNet Content Manager (30 pages)
- Ascen 7.0 Release Script for FileNet Panagon Content Services 5.2-5.4 (30 pages)
- Export Connector 8.1.0 for FileNet Content Manager (8 pages)
- Ascen 7.1 Release Script for FileNet Content Manager 4.0 (48 pages)
- Ascen 7.5 Release Script for FileNet Content Manager 4.5 (42 pages)
- Capture 8.0 Release Script for FileNet Content Manager 5.0 (30 pages)
- Ascen 7.0 Release Script for Hummingbird DM 5.0-5.1 (46 pages)
- Ascen 7.5 Release Script for Hummingbird DM 6.0 (45 pages)