I.R.I.S. Readiris Corporate 12 User Manual

Readiris
TM
Corporate 12
User Guide
ReadirisTM Corporate 12 – User Guide
Table of Contents
Copyrights ........................................................................................... 1
Chapter 1 Introducing Readiris ................................................ 3
Save time, avoid retyping.................................................. 3
The Readiris series ............................................................ 7
Chapter 2 Installing Readiris .................................................. 11
System requirements ....................................................... 11
Software installation ....................................................... 12
Uninstalling the software ................................................ 13
Software registration ....................................................... 13
Product support ............................................................... 14
Chapter 3 Getting started ........................................................ 15
Running Readiris ............................................................ 15
Using the OCR Wizard ................................................... 15
User interface .................................................................. 17
Changing the user interface language ............................. 20
Chapter 4 The Readiris SmartTasks ...................................... 21
Chapter 5 Scanning documents ............................................... 26
Selecting the document type ........................................... 26
Selecting the options ....................................................... 27
iii
Table of Contents
Opening image files ........................................................ 28
Scanning paper documents .............................................. 30
Chapter 6 Adjusting scanned documents ............................... 37
Chapter 7 Saving documents as image files .......................... 43
Chapter 8 Windowing documents ........................................... 45
Windowing documents automatically ............................. 45
Windowing documents manually .................................... 47
Using windowing templates ............................................ 51
Chapter 9 User indexing .......................................................... 55
Chapter 10 Recognizing documents ........................................ 57
Introduction ..................................................................... 57
Selecting the document language .................................... 58
Using user lexicons ......................................................... 61
Defining the document characteristics ............................ 63
Using interactive learning ............................................... 65
Using font dictionaries .................................................... 67
Chapter 11 Formatting and saving documents ...................... 69
Formatting documents .................................................... 69
Formatting text documents .............................................. 71
Formatting table-based documents ................................. 75
Creating PDF documents ................................................ 79
iv
ReadirisTM Corporate 12 – User Guide
Selecting the PDF options ............................................... 80
iHQC compressing PDF documents ............................... 81
Password protecting PDF documents .............................. 83
Digitally signing PDF documents ................................... 84
Repurposing PDF documents .......................................... 85
Creating XPS documents ................................................ 86
Selecting the XPS options ............................................... 87
iHQC compressing XPS documents ............................... 88
Selecting the graphics options ......................................... 89
Chapter 12 Saving and loading settings ................................. 91
Chapter 13 Recognizing multipage documents ...................... 93
Opening and recognizing multiple image files ................ 93
Scanning and recognizing multipage documents ............ 95
Editing multipage documents .......................................... 96
Chapter 14 Recognizing large volumes of scanned images .. 99
Executing Batch OCR ..................................................... 99
Setting up a watched folder ........................................... 101
Chapter 15 Separating and indexing document batches ..... 103
Separating document batches ........................................ 103
Indexing document batches ........................................... 106
Chapter 16 Recognizing handprinted text ........................... 109
v
Table of Contents
Chapter 17 Recognizing barcodes ......................................... 113
Chapter 18 Recognizing business cards................................ 117
Index .................................................................................. 121
vi
ReadirisTM Corporate 12 – User Guide
Copyrights
ReadirisCorporate12-dgi-110209-04
Copyrights © 1987–2009 I.R.I.S. All Rights Reserved.
I.R.I.S. owns the copyrights to the Readiris software, to the online help system and to this publication.
The information contained in this document is the property of I.R.I.S. Its content is subject to change without notice and does not represent a commitment on the part of I.R.I.S. The software described in this document is furnished under a license agreement which states the terms of use of this product. The software may be used or copied only in accordance with the terms of that agreement. No part of this publication may be reproduced, transmitted, stored in a retrieval system, or translated into another language without the prior written consent of I.R.I.S.
This user guide utilizes fictitious names for purposes of demonstration; references to actual persons, companies or organizations are strictly coincidental.
Trademarks
The Readiris logo, Readiris and IRISCard are trademarks of Image
Recognition Integrated Systems S.A.
OCR, ICR and barcode technology by I.R.I.S.
AutoFormat and Linguistic technology by I.R.I.S.
BCR and field analysis technology by I.R.I.S.
iHQC compression technology by I.R.I.S.
XML parser developed by Apache. This product includes software developed
by the Apache Software Foundation.
All other products mentioned in this user guide are trademarks or registered trademarks of their respective owners.
1
ReadirisTM Corporate 12 – User Guide
CHAPTER 1
NTRODUCING READIRIS
I

SAVE TIME, AVOID RETYPING

Congratulations on acquiring Readiris. This software package will undoubtedly be of great help in recapturing your texts, tables, graphics, barcodes and handprinted text.
As efficient as computers are, you have to key in your information first. If you have ever retyped a 15 page report or a large table of figures, you know how tedious and time-consuming it can be. Use this state-of-the-art OCR package to automatically convert paper documents or scanned image files into text searchable and editable documents that can be archived and shared. Two recognition modes are available: one ensures maximal speed, the other guarantees optimal OCR accuracy.
Scan a printed or typed document, indicate the zones you want to recognize with Readiris - or have the system detect them for you ­execute the character recognition and export the document to your word processor. Documents composed of many pages are processed from start to finish in a single effort. A few mouse clicks beat long hours of work as Readiris converts your paper documents into editable computer files: it’s up to 40 times faster than manual retyping.
The wizard smoothly guides you through the settings required to operate Readiris, allowing you to obtain quick and easy results. Or use the SmartTasks to speed up the process even more. You can send the reading results directly to your word processor or
3
Chapter 1 – Introducing Readiris
spreadsheet, archive them as PDF or XPS files, etc. To recognize faxes and convert PDF documents, drag their image files from Windows Explorer to the Readiris application window. Or send an image promptly to Readiris via the context menu.
Readiris recognizes tabular data and recreates them as worksheets in your spreadsheet software or as table objects inside your word processor; your numeric data are immediately ready for further processing.
Readiris is based on the most advanced recognition technologies. Font-independent text recognition is complemented by self-learning techniques. The system is able to learn new characters and words through contextual and linguistic analysis. This means that the OCR accuracy of the recognition system will improve as it goes along.
Readiris supports up to 128 languages: all American and European languages are supported, including the Central-European, Baltic and Cyrillic languages as well as Greek and Turkish. Optionally, Readiris can read Arabic, Farsi and Hebrew documents and four Asian languages - Japanese, Simplified and Traditional Chinese and Korean. Readiris even copes with mixed alphabets: the software detects “Western” words that occur in Greek, Cyrillic, Arabic, Hebrew and Asian documents - many untranscribable proper names, brand names, etc. are written using the Western symbols.
Readiris uses linguistics during the recognition phase, not afterwards. As a result, Readiris recognizes all kinds of documents with top accuracy, including low-quality documents, faxes and dot matrix printouts. It copes beautifully with badly scanned and copied documents containing too light or dark font shapes. Joined characters are resolved while fragmented characters, such as dot matrix symbols, are recomposed.
Besides that, Readiris has an (optional) user verification function. When activated, the user verification function (“Interactive learning”) not only flags the characters the recognition system isn't sure of but also allows to increase the system's accuracy. All
4
ReadirisTM Corporate 12 – User Guide
solutions you confirm are memorized, increasing the system speed and confidence and rendering the system more intelligent as you go along. This powerful learning tool also allows you to train Readiris on special characters such as mathematical symbols and dingbats and to handle distorted fonts.
To increase your productivity further, Readiris not only recognizes your texts, but can format them for you as well. Various levels of formatting are available. When you make use of “autoformatting”, Readiris recreates a facsimile copy of the scanned document: the word, paragraph and page formatting of the original document are retained. Similar typefaces are used, the point sizes and type styles as used in the source document are maintained across the recognition. The placement of columns, text blocks and graphics follows your original documents. Readiris can even include the background photo of a scanned page in the recognized document. And as Readiris supports grayscale and color scanning effortlessly, you can recapture any graphics - be they line art, black-and-white photos or color illustrations. When a document contains tables, Readiris reorganizes them in real cells and recreates the cell borders of the original tables.
In other words, Readiris allows you to archive a true copy of your documents, be it editable and compact text files instead of scanned images.
Barcodes that occur on a scanned page can also be read, and the same goes for handprinted text, provided you write well-spaced “block letters”.
You can even recognize business cards with Readiris: scan your business cards, recognize them and convert them into an address database.
The cards’ data is extracted automatically from the image and the recognition results are assigned to specific database fields. Readiris extensively uses a knowledge database, thus acquiring the necessary intelligence to distinguish between first and last names, cities and
5
Chapter 1 – Introducing Readiris
states, telephone and fax numbers, etc. The resulting data can be sent directly to your contact management software such as Microsoft Outlook (Express) or any vCard compliant application.
Readiris is Twain compliant and supports a wide range of flatbed and sheetfed scanners, “all-in-one” devices or “MFPs” (“multifunctional peripherals”) and digital cameras. Interval scanning allows you to scan multipage documents efficiently when your scanner is not equipped with a document feeder.
Readiris also supports high-speed scanners and executes batch OCR on large image collections: blank pages can be used to segment scanned batches into separate documents, automatic barcode reading ensures the proper indexing of the recognized documents.
6
ReadirisTM Corporate 12 – User Guide

THE READIRIS SERIES

The table below gives an overview of the available versions:
Readiris Home 12
Limited features
25 recognition languages
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF , BMP, PCX images
Generates PDF Image-Text, DOCX, ODT, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc. output
Readiris Pro 12 Readiris Corporate 12
Basic features Basic features
128 recognition languages 128 recognition languages
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF , BMP, PCX
Generates four types of PDF files , PDF­iHQC (level I), four types of XPS, XPS­iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF , BMP, PCX
Generates four types of PDF files , PDF­iHQC (level I-III), PDF/A, four types of XPS, XPS-iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Large volume recognition
Automated processing
Document indexing
Business card recognition
7
Chapter 1 – Introducing Readiris
Readiris Pro 12 Asian Readiris Corporate 12 Asian
Basic features Basic features
128 recognition languages 128 recognition languages
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF , BMP, PCX.
Generates four types of PDF files , PDF­iHQC (level I), four types of XPS, XPS­iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Traditional and Simp lified Chinese recognition
Japanese recognition
Korean recognition
Business card recognition
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF , BMP, PCX.
Generates four types of PDF files , PDF­iHQC (level I-III), PDF/A, four types of XPS, XPS-iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Traditional and Simp lified Chinese recognition
Japanese recognition
Korean recognition
Large volume recognition
Automated processing
Document indexing
Readiris Pro 12 Middle-East* Readiris Corporate 12 Middle-East*
Basic features Basic features
128 recognition languages
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF ,
128 recognition languages
Supports PDF, DCX, DJV, DJVU, JPG, JPEG, J2C, J2K, JP2, PNG, TIF, TIFF ,
8
ReadirisTM Corporate 12 – User Guide
BMP, PCX.
Generates four types of PDF files , PDF­iHQC (level I), four types of XPS, XPS­iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Arabic and Farsi recognition
Hebrew recognition
Business card recognition
*No Mac version available
BMP, PCX.
Generates four types of PDF files , PDF­iHQC (level I-III), PDF/A, four types of XPS, XPS-iHQC (level I), DOCX, ODT, XLS, WordML, SpreadsheetML, RTF, HTM, XML, TXT, TIFF, etc.
Arabic and Farsi recognition
Hebrew recognition
Large volume recognition
Automated processing
Document indexing
9
ReadirisTM Corporate 12 – User Guide
CHAPTER 2
NSTALLING READIRIS
I

SYSTEM REQUIREMENTS

This is the minimal system configuration required to use Readiris:
a 486-based Intel PC or compatible. A Pentium-based PC is
recommended.
256 MB RAM. 120 MB free disk space.
(105 MB of disk space suffices when you do not install the sample files)
the Windows Vista, Windows XP or Windows 2000 operating
system.
Note: Readiris Corporate is optimised to use a screen resolution of at least 1,024 x 768 pixels.
Note that some scanner drivers may not work under the latest version(s) of Windows. See the documentation supplied with your scanner to find out which platforms are supported.
11
Chapter 2 – Installing Readiris

SOFTWARE INSTALLATION

How to install Readiris:
Log on to Windows as administrator or make sure you have the
necessary administration rights.
Connect your scanner to your PC and install the corresponding
software. Test your scanner. If you experience any problem contact your scanner manufacturer.
Insert the Readiris CD-ROM in the CD-ROM drive and follow
the on-screen instructions to install the software.
Click Readiris to start the installation (additional software
products are offered: Copernic Desktop Search Home Edition and Cardiris 4 LE).
Select the installation language and click OK. Accept the terms of the license agreement. A complete and a custom installation are offered. Select the
required options and click Next each time you are ready to go to the next screen. All lexicons and sample images will be installed by default, as well as an electronic user guide and an online help.
Click Finish to complete the installation.
The submenu I.R.I.S. Applications - Readiris on the Windows Programs menu is created automatically by the installation program. The installation program also creates a shortcut to the Readiris application on the Windows desktop.
12
ReadirisTM Corporate 12 – User Guide
Repeat the installation process to install any additional software
from the CD-ROM.

UNINSTALLING THE SOFTWARE

There is only one correct way to uninstall Readiris: by using the Windows (un)install wizard. You are strongly recommended not to uninstall Readiris or any of its software modules by manually erasing the program files.
To uninstall Readiris:
Close the application. On the Start menu, click Control Panel. Under the Programs icon, click Uninstall a program. Select Readiris in the list and click the Uninstall button. Follow the on-screen instructions.

SOFTWARE REGISTRATION

In order to use Readiris Corporate you are required to register. By doing so, you will:
be kept informed of future product developments and related
I.R.I.S. products;
13
Chapter 2 – Installing Readiris
be entitled to product support; be entitled to special offers on I.R.I.S. products.
To register:
Use the Registration wizard on the Register menu. Follow the on­screen instructions.

PRODUCT SUPPORT

Once you have registered your product, you are entitled to product support from I.R.I.S. on all basic software functionalities. Contact I.R.I.S. at:
Europe: support.pro@irislink.com Tel:+32 10 45 13 64
USA: support.pro@irisusa.com Tel.:+1 800 447 4744
Asia-Pacific: support.pro@irislink.com Tel.: +852 22646133
I.R.I.S. Software Maintenance and Support Services
I.R.I.S. also offers a Software Maintenance and Support Services Program, which allows you to obtain major software upgrades of Readiris.
To obtain the I.R.I.S. Software Maintenance and Support Services Program application form, please contact I.R.I.S. at readiris.maintenance@irislink.com.
14
ReadirisTM Corporate 12 – User Guide
CHAPTER 3
ETTING STARTED
G

RUNNING READIRIS

To run Readiris:
Start Readiris from the Windows Start menu or double-click the
shortcut on your desktop.
If you acquired Readiris Corporate you will be prompted to
register.
Click anywhere in the startup screen to launch Readiris.
The OCR Wizard automatically opens.

USING THE OCR WIZARD

The OCR Wizard allows you to quickly define all the settings needed to operate Readiris.
When you start Readiris, click anywhere in the startup screen to start the OCR Wizard.
15
Chapter 3 – Getting started
Step 1
Select the type of document you want to recognize. Readiris recognizes text pages, business cards and multiple business cards in a single scan.
For more information, see the section Selecting the document type.
Click Next to go to the next step.
Step 2
Select the image source. You can capture images using your scanner or open image files. Select the rotation and deskewing options you want to use.
For more information, see the section Selecting the options.
To familiarize yourself with Readiris, use the sample images provided with the software. They can be found on the Readiris CD-ROM and in the subfolder Samples of the Readiris installation folder.
Click Next to go to the next step.
Step 3
In case you selected a scanner, click the Change button to select the scanner settings.
For more information on the scanner settings, see the section Scanning paper documents.
Click OK to save the settings. Click Next to go to the next step.
Step 4
Click the Change button to change the document language. The document language is set to American English by default. Select the required language or language combination and secondary languages in the list and click OK. Use the slider to set the required Speed-Accuracy settings.
16
ReadirisTM Corporate 12 – User Guide
For more information, see the section Selecting the document language.
Click OK to save the settings. Click Next to go to the next step.
Step 5
Click the Change button to change the output format or target application. The default target application is Microsoft Word. Select the required output format or application in the Send to or External file list. Click the various tabs and select the options of your choice. Options that are unavailable for the chosen format/application appear dimmed.
For more information, see the chapter Formatting and saving documents.
Click OK to save the settings. Click Next to go to the next step.
Step 6 Click GO to open/scan and recognize the document.

USER INTERFACE

To explore the Readiris interface, click anywhere in the Readiris startup screen and click Cancel when the OCR Wizard launches.
The empty Readiris interface will be displayed.
17
Chapter 3 – Getting started
The Readiris interface is composed of: the SmartTasks (in the middle)
The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button.
Use the SmartTasks to scan, recognize and send your documents to the target application or output format of your choice.
The SmartTasks apply default settings but can be configured easily by right-clicking to fit more particular needs.
the main toolbar (left toolbar)
Use the main toolbar commands and options to scan and recognize documents manually. The order in which you are advised to do so is given in the OCR Wizard.
the image toolbar (right toolbar)
18
ReadirisTM Corporate 12 – User Guide
Use the image toolbar buttons to edit documents in the Readiris interface. Point to the different buttons to display their tooltips.
When a document has been opened or scanned in Readiris, three main zones are added to the interface:
the page toolbar (right of the main toolbar)
The page toolbar displays the page thumbnails, which provide settings information if pointed to.
the image window (in the middle) the document panel (at the bottom)
The document panel displays statistical information about the documents that are open in Readiris, such as the scan and OCR time, the resolution, width and height of the documents etc.
19
Chapter 3 – Getting started

CHANGING THE USER INTERFACE LANGUAGE

The user interface of Readiris is available in a wide range of languages.
To change the user interface language:
On the Settings menu, click User Interface Language. In the Language list, select the required language, then click OK
to confirm.
Note: If you selected an incorrect language, click Ctrl+U. The Language dialog box will open and you will be able to select another language in the list.
20
ReadirisTM Corporate 12 – User Guide
CHAPTER 4
HE READIRIS SMARTTASKS
T
When starting Readiris, click anywhere in the Readiris startup screen and click Cancel when the OCR Wizard launches. The Readiris SmartTasks will be displayed.
The SmartTasks are predefined commands that allow you to use the most frequent Readiris functions at the touch of a button.
The various SmartTask buttons allow you to:
21
Chapter 4 – The Readiris SmartTasks
1. Scan and recognize documents and send them directly to
Word for text processing;
Microsoft Word is the default target application. See the section Formatting text documents to learn more about the other available applications.
2. Scan and recognize documents and send them directly to
OpenOffice for text processing;
OpenOffice.org Writer is the default target application. See the section Formatting text documents to learn more about the other available applications.
3. Scan and recognize tables and send them directly to Excel and
other spreadsheets;
Microsoft Excel is the default target application. See the section Formatting table-based documents to learn more.
4. Scan and recognize documents and archive them as PDF
files;
Adobe Acrobat PDF Image-Text is the default output format. See the section Creating PDF documents to learn more about the other available formats.
5. Scan and recognize documents and archive them as XPS
files;
XPS Image-Text is the default output format. See the section Creating XPS documents to learn more about the other available formats.
6. Scan and recognize documents and send them directly by e-
mail;
The documents will be sent as PDF Image-Text by default via your default e-mail application. See the section Formatting documents to learn more about the other available formats.
22
ReadirisTM Corporate 12 – User Guide
7. Scan and recognize business cards.
The documents will be sent in the vCard format by default. See the section Recognizing business cards to learn more about the other available formats.
8. Scan and recognize document batches and apply document
separation and indexing options.
TIFF is the default output format. See the sections Separating document batches and Indexing document batches for more information.
When you are using Readiris for the first time you must configure the SmartTasks.
To configure the SmartTasks:
Right-click the SmartTask you want to use. Select Scanner or Image files as image source.
o When you select Scanner, Readiris will start your scanner
as soon as you click the SmartTask. The scanned document(s) will be displayed in the interface, processed and saved.
Your scanner must be configured correctly in order for the SmartTasks to work.
To do so:
Click the Scanner button on the main toolbar. Click Scanner model and select your scanner in the
list.
If your scanner is not in the list, select Twain other models.
23
Chapter 4 – The Readiris SmartTasks
Click Configure if applicable to select the Twain
source.
Then click OK to save the settings.
For more information on the scanner settings and on scanning paper
documents, see the section Scanning paper documents.
o When you select Image files and click the SmartTask,
Readiris opens the Input dialog box in which you can select the image files you want to process.
For more information on opening image files, see the section Opening image files.
Click Configure to change the output format and its options.
Note that the available output formats and options depend on the selected SmartTask.
See the chapter Formatting and saving documents to learn more about the available formats and options.
When you are using Business Card Recognition, select the card
style and output format.
For more information, see the chapter Recognizing business cards.
When you are using Document separation and indexing, click
Document processing to access the separation and indexing
options.
For more information, see the chapter Separating and indexing document batches.
When you are done configuring the SmartTasks, use the buttons
on the main toolbar to specify the language settings and image enhancement options, and if still needed the Scanner settings.
24
Loading...
+ 100 hidden pages