PRO Indexer
The PRO Indexer selects and extracts pertinent data from within a print-stream. Input file types supported include PDF, PostScript, PCL, AFP and Xerox
LCDS/Metacode. The generated Index files can be in a variety of formats including XML, Comma Separated Variables (CSV), Fixed Length Records, and a User Defined File Format, which allows almost any format a user requires.
Any number of index keys can be extracted.
After selection, fields can be manipulated as required before being written to the index file.
For example, PRO Indexer can create the commands needed to add records to a database.
PRO Indexer supports the creation of highly complex index files in any format including XML, thanks to its powerful, new scripting language. With the PRO
Indexer, users are able to script what/if/else statements that, for any particular page, can apply specific text extraction rules and required business logic
routines. PRO Indexer can also be used to add TLEs to AFP files and add bookmarks to PDF files.
PRO Indexer can create index files for any of the Filenet import utilities and the Xerox DocuShare import utility.
The PRO Indexer can be used to create XML files for either document indexing or for displaying document information online.
This can be used to build your own Electronic Bill Presentment System.
The PRO Indexer can also be used to "scrape" all or selected fields of text and place it an index file or text file.
When configured to run with any of the PRO transform products, this powerful indexing tool adds much valuable functionality to our continuous one-step processing.
The following are a few examples of how our customers are applying the power of PRO Indexer capabilities:
- Intelligent reading of the existing index information found in TLEs in an AFP document and creation of bookmarks when transforming to PDF format for web
presentment and/or archiving
- Intelligent extraction of text fields within a document and creation of TLE’s in an AFP output file or bookmarks in a PDF output file based on the
extracted text
- Data mining from complex dynamically assembled documents