Nuance OMNIPAGE WEB User Manual

OmniPage® Web
User’s Manual
Caere Corporation
100 Cooper Court Los Gatos, California 95032-7603 USA www.caere.com
Caere GmbH
Innere Wiener Strasse 5 81667 München, Germany
Caere UK Information Centre
3 Catherine Place Westminster, London SW1E 6DX
Caere France
72, rue Baratte-Cholet 94100 Saint-Maur, France
Please Note
To use this program, you should know how to work in the Microsoft Windows environment. Please refer to Windows documentation if you have questions about how to use menu commands, dialog boxes, scroll bars, edit boxes, and so on.
OmniPage Web
Ve r si o n 1
Copyright© 1999 Caere Corporation. All rights reserved. The Caere logo, Caere®, OmniPage®, OmniPage Web®, PageKeeper®, Language Analyst®, 3D OCR®, AutoWeb Toolbar™, and OCR Proofreader are trademarks of Caere Corporation
All other trademarks are the property of their respective companies.
ii
800-1308-030A1

Table of Contents

Welcome
Using This Manual................................................................................................................2
Getting Online Help .............................................................................................................3
Help Menu ......................................................................................................................3
Context-Sensitive Help..................................................................................................3
Product Support ....................................................................................................................4
Chapter 1 Installation and Setup
Minimum System Requirements.........................................................................................6
Installing OmniPage Web ...................................................................................................6
Setting Up Your Scanner with OmniPage Web................................................................7
Starting OmniPage Web.......................................................................................................8
Registering OmniPage Web ................................................................................................9
Chapter 2 Introduction to OmniPage Web
What Is Optical Character Recognition (OCR)?..............................................................12
What Is Outlining? ..............................................................................................................12
Basic Steps of Creating a Web Page..................................................................................13
The OmniPage Web Desktop ............................................................................................14
AutoWeb Toolbar ........................................................................................................16
Standard Toolbar..........................................................................................................17
Zone Toolbar.................................................................................................................17
Outline Toolbar ............................................................................................................18
Options Dialog Box......................................................................................................18
HTML Options Dialog Box.........................................................................................19
Chapter 3 Processing Documents
Ways to Process Documents..............................................................................................22
Using the Web Wizard ................................................................................................22
Automatic Processing
Performing Multiple Tasks at Once...........................................................................24
Bringing Document Images into OmniPage Web .........................................................24
Scanning Pages .............................................................................................................24
Loading Image Files ....................................................................................................25
Creating Zones for OCR ....................................................................................................27
.................................................................................................23
iii
Creating Zones Automatically...................................................................................27
Performing OCR on a Document .....................................................................................28
Proofreading OCR Results ................................................................................................29
Modifying Words.........................................................................................................30
Outlining a Document........................................................................................................30
Editing Outline Results ......................................................................................................31
Selecting HTML Components ...........................................................................................33
Working with Documents .................................................................................................34
Resizing a Page View .................................................................................................35
Changing Pages ...........................................................................................................36
Reordering Pages ........................................................................................................37
Deleting Pages .............................................................................................................38
Printing a Document ..................................................................................................38
Closing a Document ...................................................................................................38
Saving a Document ............................................................................................................39
Testing Your HTML Document ........................................................................................41
Chapter 4 OmniPage Web Settings
Setting AutoWeb Toolbar Commands.............................................................................44
AUTO Button Commands ..........................................................................................45
Image Button Commands...........................................................................................46
Zone Button Commands.............................................................................................47
OCR Button Commands .............................................................................................48
Outline Button Commands.........................................................................................49
Export Button Commands..........................................................................................50
Selecting Options ................................................................................................................51
Accuracy Settings.........................................................................................................52
Scanner Settings ...........................................................................................................53
Page Format Settings ...................................................................................................54
Language Settings........................................................................................................55
Process Settings ............................................................................................................56
Selecting HTML Options....................................................................................................57
General Settings ...........................................................................................................58
Components Settings...................................................................................................59
Component Styles Settings.........................................................................................60
Chapter 5 Customizing Your Web Page
Making Your Web Page More Effective...........................................................................62
Using Themes ......................................................................................................................64
Adjusting Page Images Before OCR.................................................................................66
Customizing Zones .............................................................................................................67
Zone toolbar..................................................................................................................67
Reordering Zones.........................................................................................................68
Modifying Zones..........................................................................................................68
Deleting Zones..............................................................................................................69
iv
Changing Zone Properties..........................................................................................69
Creating User Dictionaries.................................................................................................72
Chapter 6 Technical Information
General Troubleshooting Solutions .................................................................................74
Solutions to Try First ...................................................................................................74
Testing OmniPage Web...............................................................................................75
Low Memory Problems...............................................................................................76
Low Disk Space Problems ..........................................................................................76
Supported File-Format Types............................................................................................77
Scanner Setup Issues...........................................................................................................78
Scanner Drivers Supplied by the Manufacturer......................................................78
Scanner Drivers Supplied by Caere ..........................................................................79
Scan Manager is Needed with OmniPage Web ......................................................79
Problems Connecting OmniPage Web to Your Scanner ........................................80
Missing
Scanner Message on Launch ......................................................................................81
System Crash Occurs While Scanning......................................................................81
Scanner Not Listed in Supported Scanners List Box ..............................................81
Scanning Tips................................................................................................................82
OCR Problems .....................................................................................................................83
System Crash During OCR.........................................................................................83
Text Does Not Get Recognized Properly..................................................................83
Problems With Fax Recognition ................................................................................84
Scan Image
Command
...................................................................................80
Index
v
vi

Welcome

Welcome to OmniPage Web, and thank you for using our software! The following documentation has been provided to help you learn about OmniPage Web.
This User’s Manual
This manual introduces you to the basics of using OmniPage Web. It includes installation and setup instructions, an introduction to OmniPage Web, task-oriented instructions, ways to customize processing, settings guidelines, and technical information.
OmniPage Web Online Help
The online Help contains information on features, settings, and procedures. It has been designed for quick and easy information retrieval. Please see “Getting Online Help” on page 3 for more information.
OmniPage Web Readme File
The
OmniPage Web Readme
the software. The To open the taskbar and choose
Documents
Scanner Setup Notes
The the Caere Scan Manager and supported scanners. To open this HTML file after installation, click Start in the Windows taskbar and choose
Programs
!
Scanner Setup Notes
!
Readme
Readme
OmniPage Web Readme
Caere Applications!Caere Documents!Scanner Setup Notes
file after installation, click Start in the Windows
Programs
file contains last-minute information about
file has been provided to you in HTML format.
Caere Applications!Caere
!
.
is an HTML file that contains information about
.
1

Using This Manual

Using This Manual
This manual is written with the assumption that you know how to work in the Microsoft Windows environment. Please refer to your Windows documentation if you have questions about how to use dialog boxes, menu commands, scroll bars, drag and drop functionality, shortcut menus, and so on.
The following conventions are used in this manual.
Convention Purpose
Italicized text
monospaced text • Indicates physical file names
Note symbol
Warning symbol
• Emphasizes menu commands, dialog box options, labeled buttons, and file names
For example:
“Choose menu.”
• Emphasizes new terms the first time they are used
• Emphasizes important words in a sentence
and folder locations
Introduces a tip or an item of note about the current topic
Introduces important information about the current topic
Open...
in the File
2

Getting Online Help

In addition to using this manual, you can use OmniPage Web’s online Help to learn about features, settings, and procedures. Online Help is available after you install OmniPage Web.
Getting Online Help

Help Menu

Choose
Web
to connect to our Web site for the latest information on using OmniPage Web.
Caere on the
OmniPage Web
!

Context-Sensitive Help

Choose
How to Use Help...
in OmniPage Web’s Help menu to get
information on using Windows Help.
One way to open OmniPage Web’s online Help is to choose commands in the Help menu.
OmniPage Web Help Topics
first command in the Help menu. Choose it to get contents and index listings for all Help topics.
You can get on-the-spot information about a particular OmniPage Web command, toolbar button, or dialog box option in the following ways:
• Click the Help button in the Standard toolbar and then click any toolbar button, menu command, or area of the OmniPage Web desktop to display a Help topic explaining that item.
• Click the question-mark button in the upper-right corner of a dialog box and then click an item in the dialog box to get a pop­up explanation for that item.
is the
3

Product Support

Product Support
For the fastest and easiest way to get help, please look for solutions in this manual or in the online Help. See “General Troubleshooting Solutions” on page 74 for more information.
If you need additional help, please use the following resources.
• Caere on the World Wide Web
To access Caere’s corporate Web site for general product and company information, choose the Help menu. Caere’s Web site address is www.caere.com.
To connect to the OmniPage Web site for the latest information on using OmniPage Web, choose the Help menu.
To contact Product Support for common questions and answers, updates, patches, and troubleshooting procedures, choose
on the Web
• OmniPage Web Readme file
Read the about the software. The Readme file has been provided to you in HTML format. This file is available after installing OmniPage Web. To open the file, click choose
OmniPage Web Readme
!
• Caere Product Support document
Read the telephone numbers, including ones for international product support. This document has been provided to you as an electronic document in HTML format. To open this document, click
Applications
Product Support
!
OmniPage Web Readme
Programs
Caere Product Support
in the Windows taskbar and choose
Start
Caere Applications !Caere Documents
!
Caere Documents !Product Support
!
Caere on the Web
Caere on the Web
in the Help menu.
file for last-minute information
in the Windows taskbar and
Start
.
document to get a list of support
Caere Web site
!
OmniPage Web
!
Programs
.
!
Caere
Caere
in
in
4
Chapter 1

Installation and Setup

This chapter provides information on installing and starting OmniPage Web.
Please continue reading for information on these topics:
• Minimum System Requirements
• Installing OmniPage Web
• Setting Up Your Scanner with OmniPage Web
• Starting OmniPage Web
• Registering OmniPage Web
5

Minimum System Requirements

Minimum System Requirements
You need the following setup, at minimum, to install and run OmniPage Web:
• Computer with a Pentium or higher processor
• Microsoft Windows 95, Windows 98, or Windows NT 4.0
• 16MB of memory (RAM) for Windows 95 and 98 32MB of memory (RAM) for Windows NT 4.0
• 45MB of free hard disk space to install application files, the Scan Manager, and one language
55MB to install above files and all languages
• SVGA or VGA monitor with 256 colors and 800 x 600 resolution
• Windows-compatible pointing device
• CD-ROM drive for installation
• A compatible scanner if you plan to scan documents Please see the
• A Web browser to view your HTML documents You may also want to use an HTML editor to make changes to
the HTML code that is generated by OmniPage Web.
Scanner Setup Notes
for a list of tested scanners.
Performance and speed will be enhanced if your computer’s processor, memory, and available disk space exceed the minimum requirements.

Installing OmniPage Web

OmniPage Web’s Setup program takes you through installation with instructions on the screen at every step.
Before installing OmniPage Web:
• If you are using a scanner with OmniPage Web, make sure it is connected, turned on, and compatible with your system.
• Close all other applications, especially anti-virus programs.
• Log into your computer with administrator privileges if you are installing on Windows NT.
6 Chapter 1

Setting Up Your Scanner with OmniPage Web

To install OmniPage Web:
1 Insert OmniPage Web’s CD-ROM in the CD-ROM drive.
OmniPage Web’s Setup program should start automatically. If it does not start, locate your CD-ROM drive in Windows Explorer and double-click the level of the CD-ROM.
2 Follow the instructions on the screen to finish installation.
By default, OmniPage Web installs all its program files in this folder location:
c:\Program Files\Caere\OmniPageWeb10
Setup.exe
program at the top-
Setting Up Your Scanner with OmniPage Web
Caere Scan Manager is installed during OmniPage Web installation. This sets up your scanner (or another input device such as a digital camera) so you can use it with OmniPage Web. During installation, Caere Scan Manager prompts you to select your scanner manufacturer and model or other input device.
Read the Manager and supported scanners. You can open the after OmniPage Web has been installed by clicking taskbar and choosing
Documents!Scanner Setup Notes
After installation, you can use Caere Scan Manager to set up additional scanners with OmniPage Web and change various scanner settings. Caere Scan Manager is located in the Windows Control Panel.
To open Caere Scan Manager:
Scanner Setup Notes
1 Click
Settings
2 Double-click the
in the Windows taskbar and choose
Start
Control Panel
!
for more information on Caere Scan
Start
Programs
Caere Scan Manager
Caere Applications!Caere
!
.
.
icon.
Scanner Setup Notes
in the Windows
Installation and Setup 7

Starting OmniPa ge Web

Starting OmniPage Web
Standard
toolbar
Zone AutoWeb
The thumbnail view displays the pages in an open document.
toolbar
toolbar
To start OmniPage Web, click
Programs
Caere Applications!OmniPage Web 1.0
!
in the Windows taskbar and choose
Start
. Or, double-click the
OmniPage Web icon on your Windows desktop.
OmniPage Web’s desktop appears when you open OmniPage Web. See “The OmniPage Web Desktop” on page 14 for an introduction to OmniPage Web’s user interface.
The image view displays the current page’s original image.
The text view displays the current page’s recognized text and retained graphics.
8 Chapter 1

Registering OmniPage Web

Register your copy of OmniPage Web with Caere Corporation to receive access to product support, notification of special offers, and the best prices on product upgrades.
To register OmniPage Web:
Registering OmniPag e Web
1 Click the
2 Click
3 Fill out the information requested on the screen and then click
Next
4 Follow the instructions on the screen.
OmniPage Web will decide on the best method of registration according to your country and computer system. It may try using modem, FTP, or HTTP connections to transmit your registration information directly. Or, it may prompt you to call a phone number or print out and mail in your registration information.
After registration is complete, you will be given a registration number. Be sure to write that number down and keep it handy in case you need to use it for reinstallation.
To re-register OmniPage Web after reinstallation:
1 Click the
2 Click
3 Type in your registration number and click OK.
Register
Register Now
.
Register
Re-register
menu to open the Register dialog box.
.
menu to open the Register dialog box.
.
Installation and Setup 9
10 Chapter 1
Chapter 2
Introduction to
OmniPage Web
You probably have documents lying on your desk that you would like to share with the rest of your company, or, perhaps, the rest of the world. You could photocopy the information and mail it to anyone who might be interested, or you could retype it and hand-code it in HTML format. Neither of these is an appealing option.
OmniPage Web offers a smart solution to increase your productivity and the visibility of your documents. OmniPage Web’s
recognition (OCR)
paper documents and image files into editable text. Once you have converted your paper documents into electronic ones, OmniPage Web
outlines
site with separate Web pages for each chapter or section. OmniPage Web even creates hypertext links, navigation tools, and a hyperlinked table of contents.
the document structure and creates a complete, dynamic Web
technology accurately and easily converts scanned
optical character
Within minutes you have an HTML file that can be published to the World Wide Web, or your company’s intranet, making your documents instantly available to anyone in the world.
Please continue reading this chapter for information on these topics:
• What Is Optical Character Recognition (OCR)?
• What Is Outlining?
• Basic Steps of Creating a Web Page
•The OmniPage Web Desktop
11

What Is Optical Character Recognition (OCR)?

What Is Optical Character Recognition (OCR)?
Optical character recognition (OCR
computer-editable text. An image is an electronic picture of text such as a scanned paper document or an electronic fax file. Images do not have editable text characters; they have many tiny dots ( form a picture of text.
During OCR, OmniPage Web analyzes an image and defines characters to produce editable text. After OCR, you can convert the resulting text to HTML format using OmniPage Web’s

What Is Outlining?

Outlining
detecting original document elements (called and creating hypertext links.
OmniPage Web can recognize and outline these objects in the original document during outlining:
• Headline (the title of the document)
• Headings (levels 1 - 6)
•Body text
•Captions
•Tables
•Graphics
• Headers and footers
• URLs and e-mail addresses
• Cross-references
) is the process of turning an
pixels
outlining
is the process of examining the structure of a document,
feature.
in OmniPage Web),
objects
image
) that together
into
Once outlining is complete, the document outline appears in outline view where you can do the following:
• Filter which objects appear in outline view
• Change the hierarchy of the objects
12 Chapter 2

Basic Steps of Creating a Web Page

These are the basic steps of OmniPage Web’s HTML-conversion process.
1 Bring a document image into OmniPage Web.
You can scan a paper document or load an image file. The resulting image appears in OmniPage Web’s image view. See “Bringing Document Images into OmniPage Web” on page 24 for more information.
2 Create zones to identify areas you want to recognize as text or
retain as graphics.
are borders that enclose the areas of a document image that
Zones
will get processed. You can create zones automatically, manually, or with a template. Any areas not enclosed by zones are ignored during OCR. See “Creating Zones for OCR” on page 27 for more information.
3 Perform OCR to convert text information into editable text
characters.
During OCR, OmniPage Web interprets text characters in an image. After OCR, you can check and correct errors in the text using the OCR Proofreader. See “Performing OCR on a Document” on page 28 for more information.
Basic Steps of Creating a W eb Page
4 Outline the original document structure.
During outlining, OmniPage Web examines the document structure and creates an outline of the original document objects based upon the zone order. See “Outlining a Document” on page 30 for more information.
5 Select additional HTML components and set output formatting
options.
After outlining, you can select how you want the objects formatted and include additional HTML more interesting and effective. See “Selecting HTML Components” on page 33 for more information.
6 Save and test your Web page.
Save your document and test it in more than one browser to be sure that the formatting is acceptable. You can choose
Launch...
launch your browser when you save the HTML document. See “Testing Your HTML Document” on page 41 for more information.
Introduction to OmniPage Web 13
in the Export button’s drop-down list to automatically
components
to make your Web page
Save and

The OmniPage Web Desktop

The OmniPage Web Desktop
Before a document is outlined, OmniPage Web’s desktop displays the pages of the open document in its thumbnail view, image view, and text view. You can use buttons in the Standard, AutoWeb, and Zone toolbars to perform various tasks on the document.
Standard
toolbar
Zone
toolbar
AutoWeb
toolbar The
thumbnail view displays a picture of each page in the document.
The current page is highlighted with a light border around it.
Drag this splitter to
the left or right to
resize a view.
The image view displays the current page’s original image.
Drag this splitter up or down to resize a view.
The text view displays the current page’s recognized text and retained graphics.
14 Chapter 2
The OmniPage Web Desktop
After a document is outlined, OmniPage Web’s desktop displays the document outline in outline view, the original image in image view, and a preview of the HTML document in HTML view.
Outline
toolbar
The outline view displays an outline of the original document objects.
The image view displays the current page’s original image.
Drag this splitter up or down to resize a view.
The HTML view displays a preview of the HTML-formatted document.
Drag this splitter to
the left or right to
resize a view.
Introduction to OmniPage Web 15
The OmniPage Web Desktop

AutoWeb Toolbar

The AutoWeb toolbar contains buttons that can activate each step of the HTML-conversion process.
AUTO
button
button
button
Click the down arrow to
display the commands in a
button’s drop-down list.
Zone
Image
OCR
button
Outline
button
Export
button
You can set different commands in the AutoWeb toolbar buttons for the operations you want to perform. Choose a command using each button’s drop-down list.
• The AUTO button allows you to activate automatic processing or use the Web Wizard.
•The Image button allows you to bring in images by scanning or loading pages.
• The Zone button allows you to automatically create zones on images based on their original page layouts or predefined templates.
• The OCR button allows you to perform OCR and check OCR results.
• The Outline button allows you to outline the document structure and create hypertext links.
• The Export button allows you to save your document or save and launch your Web browser or HTML editor to view the HTML document.
Please see “Setting AutoWeb Toolbar Commands” on page 44 for more information on each toolbar button.
16 Chapter 2

Standard Toolbar

The Standard toolbar contains buttons and a drop-down list for performing standard tasks.
The OmniPage Web Desktop
New
Open
Save
Print

Zone Toolbar

View
Image
Editor
Options
HTML
Option
Straighten
Image
Rotate Image
Zoom
Help
Proofread
OCR
Copy
Undo
The Zone toolbar contains buttons that allow you to draw and define zones on a page image.
Draw
Rectangular
Zones
Add to
Zone
Reorder
Zones
Move Row or Column
Dividers
Insert
Row
Dividers
Remove/Replace
All Row and
Column Dividers
Tab le
tools
Draw
Irregular
Zones
Subtract
from
Zone
Zone
Properties
Insert
Column
Dividers
Remove Row or
Column Dividers
See “Customizing Zones” on page 67 for more information.
Introduction to OmniPage Web 17
The OmniPage Web Desktop

Outline Toolbar

The Outline toolbar contains buttons that allow you to filter which objects are visible in the outline, making the outline easier to read and edit. You can also use the Outline toolbar to change the object hierarchy by promoting, demoting, changing, or deleting objects.

Options Dialog Box

Click the tabs in the Options dialog box to view and select different settings.
Promote
Demote
Demote to Body
Text
Change
Header
Change
to
Footer
to
Delete
Filter
Objects...
Remove/
Apply
Filter
You can select settings for processing in the Options dialog box. To open it, click the Options button or choose
Options...
in the Tools menu.
See Chapter 4, OmniPage Web Settings, for more information on settings.
18 Chapter 2

HTML Options Dialog Box

Click the tabs in the HTML Options dialog box to view and select different settings.
The OmniPage Web Desktop
You can select settings for HTML components in the HTML Options dialog box. To open it, click the HTML Options button or choose
Options...
in the Tools menu.
HTML
See Chapter 4, OmniPage Web Settings, for more information on settings.
Introduction to OmniPage Web 19
20 Chapter 2
Chapter 3

Processing Documents

This chapter describes how to work with documents in OmniPage Web, including each step of converting paper documents to HTML.
There are different ways to accomplish the same tasks in OmniPage Web. You can use toolbar buttons or menu commands to start procedures. OmniPage Web can perform all steps automatically, or you can start each step individually. You can even do different tasks at the same time.
Please continue reading this chapter for information on these topics:
•Ways to Process Documents
• Bringing Document Images into OmniPage Web
•Creating Zones for OCR
•Performing OCR on a Document
• Proofreading OCR Results
•Working with Documents
• Outlining a Document
• Editing Outline Results
• Saving a Document
• Testing Your HTML Document
For complete information on all OmniPage Web commands, settings, and procedures, please use OmniPage Web’s online Help.
21

Ways to Process Documents

Ways to Process Documents
OmniPage Web instantly turns a paper document into an HTML file that you can publish as a Web page. The basic steps of OmniPage Web’s HTML-conversion process are explained on page 13. The following is a summary of those steps.
1 Bring a document image into OmniPage Web.
See page 24 for more information.
2 Create zones to identify areas you want to recognize as text or
retain as graphics. See page 27 for more information.
3 Perform OCR to convert text information into editable text
characters. See page 28 for more information.
4 Outline the document to detect structural elements such as
headings, body text, headers and footers, and to link cross­references, e-mail addresses, and URLs to their destinations.
See page 30 for more information.
5 Select Web page formatting and design options in the HTML
Options dialog box. See page 33 for more information.
6 Export the document to the desired location.
See page 39 for more information.

Using the Web Wizard

The Web Wizard guides you through the entire HTML-conversion process by asking you questions about your document and selecting the appropriate settings for you.
To process your document using the Web Wizard:
1Set
2 Click AUTO or choose
22 Chapter 3
Web Wizard
down list.
as the command in the AUTO button’s drop-
Web Wizard
in the Process menu.
The first wizard screen appears.
Ways to Process Documents
3 Answer the question in the first screen and click
4 Continue answering questions in the screens that follow.

Automatic Processing

Use the AUTO button to process a new document from start to finish or to finish processing an open document.
To process your document automatically:
1Set
2 Set the desired Image, Zone, OCR, Outline, and Export
3Choose
4 Place your document in your scanner if you are scanning.
5 Click AUTO or choose
AutoWeb
list.
commands. See “Setting AutoWeb Toolbar Commands” on page 44 for
more information.
Options...
appropriate for your document. See “Selecting HTML Options” on page 57 for more
information.
Each page of the document is processed and finished in order according to the selected commands. If page images in an open document already have zones, OmniPage Web will skip zoning those pages and continue with the selected OCR, outline, and export operations.
.
Next
as the command in the AUTO button’s drop-down
in the Tools menu and check that settings are
AutoWeb
in the Process menu.
Processing Documents 23
Loading...
+ 65 hidden pages