The software described in this book is furnished under license and may
be used or copied only in accordance with the terms of such license.
IMPORTANT NOTICE
TRADEMARKS AND CREDITS
Scansoft, Inc. provides this publication “as is” without warranty of any
kind, either express or implied, including but not limited to the implied
warranties of merchantability or fitness for a particular purpose. Some
states or jurisdictions do not allow disclaimer of express or implied
warranties in certain transactions; therefore, this state-ment may not
apply to you. Scansoft reserves the right to revise this publication and to
make changes from time to time in the content hereof without obligation
of Scansoft to notify any person of such revision or changes.
TextBridge is a registered trademark, and Smart Zones, Instant Access
OCR, and Custom Proof are trademarks, of Scansoft, Inc., a Xerox
Company. Xerox, The Document Company, and the Stylized X are
trademarks of Xerox Corp.
Excel, Word, and Windows are trademarks of Microsoft Corp.
WordPerfect is a registered trademark of WordPerfect Corp.
Other terms used in this manual are the trademarks of their respective
holders.
Using the Help System ............................. 3–4
Using the Sample Documents ........................ 3–6
Session 1: Processing a Simple Document
Using Auto Processing ........................ 3–12
Session 2: Using Instant Access OCR.................. 3–19
Session 3: Processing a Complex Document Using Manual
Processing ................................. 3–25
Session 4: Processing Text, Pictures, and a Table ........ 3–35
Session 5: Training OCR and Using the Page toolbar...... 3–42
Where to Go From Here............................ 3–51
INDEX
ivTextBridge Pro 98 User’s Guide
PREFACE
ScanSoft, Inc., a Xerox Company, welcomes you to TextBridge®
Pro 98 for Windows 95™ and Windows NT. (Hereinafter
TextBridge Pro 98 will be referred to as “TextBridge.”)
Before going on to find out more about TextBridge, please read
this preface because it describes these important items:
◆About this user’s guide
◆Related documentation
◆Technical support
ABOUT THIS USER’S GUIDE
This user’s guide includes introductory information designed
primarily for non-technical users as well as information designed
for more technical users. It assumes that you are familiar with
the management and operation of your computer and Windows.
The documentation that comes with TextBridge should provide
all the information you need to operate TextBridge. TextBridge
documentation includes this user’s guide, a Help system, and
Release Notes. ScanSoft invites your comments about the
information provided in the documentation. Please make sure to
register your software and provide any comments to ScanSoft.
TextBridge Pro 98 User’s Guidev
Organization of this user’s guide
This user’s guide is designed as a reference tool to provide basic
information about TextBridge. It is organized as follows:
◆Chapter 1, “Introduction to TextBridge,” discusses TextBridge’s
features. It also describes: documents TextBridge can recognize,
what comes with TextBridge, supported scanners, system
requirements, installation, setting up Instant Access, uninstalling
TextBridge, and input and output file formats.
◆Chapter 2, “OCR and TextBridge,” provides an explanation of the
concepts of document recognition and OCR and the basic
functionality of TextBridge.
◆Chapter 3, “Learning to Use TextBridge,” walks you through
several practice sessions designed to provide a firm basis on
which to learn and use the important features of TextBridge.
This user’s guide also provides a comprehensive index for you to
quickly locate the information you need.
viTextBridge Pro 98 User’s Guide
Documentation conventions
As described in Table P–1, TextBridge documentation uses
certain graphical elements and formatting to emphasize
information and give more meaning to text.
Table P–1. Documentation Conventions
boldIntroduces a new term or the first use of an
italicDenotes titles of other user’s guides or books
monospaceDenotes text that appears on the computer
“ ” (quotes)Denotes titles of chapters and sections in this
important term in a chapter. Sometimes used
to denote strong in-line emphasis.
and generic representations of file name entries
in examples; for example, filename
screen such as examples, menu text, and
messages plus actual file names.
user’s guide.
☞
NoteIntroduces information of note about the
RELATED DOCUMENTATION
TextBridge provides a comprehensive set of printed and online
documentation designed to assist you in learning and operating
the product. The documentation provided with TextBridge covers
all aspects of installation and operation.
In addition to this TextBridge Pro 98 User’s Guide, refer to the
following documentation for more information:
Prefacevii
Introduces tips that provide useful information
about a procedural step or system function.
current subject.
◆Online Release Notes—After you install TextBridge, read the
online Release Notes first. These provide the most up-to-date
information about TextBridge. Release Notes automatically
appears in the TextBridge 98 folder. Simply point to Release
Notes in the TextBridge 98 folder to open the Release Notes so
that you can read them.
◆Help—An extensive online Help system comes with TextBridge.
The Help provides you with information about the software in
general; the menus, commands, and tools; step-by-step
procedures; and a glossary.
◆TextBridge online electronic documentation—This includes
an electronic version of this TextBridge Pro 98User’s Guide in
Adobe Acrobat format (.pdf). The documentation resides on the
compact disk in the directory TextBridgePro Documents.
Please refer to the Release Notes in that directory for information
about using the online documentation.
◆Multimedia Guided Tour—The Guided Tour provides you with
an introduction to TextBridge.
NoteYou may need to refer to additional publications, such as the
manufacturer’s documentation for your scanner.
TECHNICAL SUPPORT
If you should experience problems with TextBridge that you
cannot resolve with the documentation and software, contact
TextBridge Technical Support.
You can contact TextBridge Technical Support by the Internet,
telephone, or fax.
This information will assist Technical Support in solving the
problem:
viiiTextBridge Pro 98 User’s Guide
◆Your software version number
(This is on the back of the CD-ROM case and in the Help menu
under About TextBridge.)
◆Your software serial number
(This is the serial number on the back of the TextBridge CD-ROM
case and in the Help menu under About TextBridge.)
◆Your scanner make and model
◆A description of the steps that led up to the problem
◆If TextBridge generated an error message, a verbatim description
of the error message or its number
Internet and electronic mail addresses
You can also contact Technical Support and get information about
TextBridge on the Internet at the addresses in the following list:
◆TextBridge site: www.textbridge.com
The TextBridge Web site provides a link to Technical Support
with Frequently Asked Questions, technical information
bulletins, and a problem report form.
E-mail in the United States, Canada, or the Pacific Rim:
E-mail from European countries and the Middle East:
◆Technical Support: uk_support@xis.xerox.com
◆Upgrade information: xisuk@xis.xerox.com
Prefaceix
Telephone and fax numbers
Call one of the following telephone numbers or send a fax
describing the problem to one of the fax numbers.
In the United States, Canada, or the Pacific Rim:
☎ Telephone: 978–977–0764
Fax: 978–977–2434
From European countries and the Middle East:
Xerox Scansoft Ltd. in England:
☎ Telephone: +44 (0) 1923 209140
Fax: +44 (0) 1923 208446
xTextBridge Pro 98 User’s Guide
1
INTRODUCTION TO
TEXTBRIDGE
Welcome to ScanSoft’s TextBridge™ Pro 98, optical character
recognition (OCR) software for Microsoft Windows™ 95 and
Windows NT. (Hereinafter TextBridge Pro 98 will be referred to
as “TextBridge.”)
This chapter provides an introduction to TextBridge including:
◆Features and benefits
◆What comes with TextBridge
◆Scanners supported
◆System requirements
◆Installing TextBridge
◆Setting up TextBridge Instant Access
◆Uninstalling TextBridge
◆Input image file formats supported
◆Output text file formats supported
OCR is a technology that enables you to reproduce the paper
documents you use every day into fully editable text on your
computer. TextBridge even retains the layout of the original
document when possible.
TextBridge Pro 98 User’s Guide1–1
You can use TextBridge to convert printed documents from fax
machines, photocopiers, and dot matrix and laser printers to
electronic documents for your word processor or text
application as well as documents for some database, desktop
publishing, and spreadsheet software. TextBridge OCR can also
recognize page image files from scanners as well as fax
machines and other sources.
FEATURES AND BENEFITS
Using Xerox’s latest document recognition technology, DocuRT™,
TextBridge OCR produces a fully-editable electronic document
that retains the original document layout, complete with text and
pictures (Figure 1–1). TextBridge understands your original
document format, and the keeps the layout the same, including
columns, headers, footers, pictures and picture captions. This
feature is supported only if your text application supports
pictures and layout. For example, this feature is supported in
Microsoft Word and WordPerfect but not in Notepad.
1–2TextBridge Pro 98 User’s Guide
Original document
Recomposed document
in word processor
Figure 1–1. TextBridge document recomposition
TextBridge offers many productivity features. Whether you need
to capture a simple one-page letter, a magazine article, a
spreadsheet, or a long transcript, TextBridge can save you
valuable time and effort. In addition, TextBridge provides all the
capabilities that experienced OCR users have come to expect.
Introduction1–3
New Productivity features in TextBridge Pro 98
TextBridge offers these major features:
◆Improved OCR accuracy. Dramatically saves time and
eliminates retyping.
◆Instant Access™. You can start TextBridge within most
Windows text programs. After recognizing and converting the
page image to text, TextBridge then automatically pastes
recognized data (text and pictures) directly into the text
program’s open document.
recomposition to retain your original page layout. It reproduces
multiple columns, cell tables, and pictures in the same location as
they are in your original document.
• When you specify output to Microsoft Word™ or WordPerfect
format, TextBridge can retain the original document layout in
fully-editable form, even for pages containing tables, line art,
reverse video, drop caps, insets, and pictures. When you edit
the document, the original text flow is maintained.
• When you specify output to Microsoft Excel™ or Lotus 1-2-3
format, spreadsheets and cell tables retain their original
layout as cell tables not tabbed columns. When you edit the
table information, the lines move to fit exactly as you would
expect.
TextBridge supports formats for the the following applications
that retain page layout:
• Excel 3.0–5.0
• Excel Max 3.0–7.0
• Excel Office 97
®
1–4TextBridge Pro 98 User’s Guide
• HTML
• HTML Editor
• HTML Netscape
• Lotus 1-2-3
• Quatro Pro
• Word 6.0
• Word 7.0 in Office 95
• Word for Windows 2.x
• Word Office 97
• Word Perfect 6.1, 7.0, and 8.0
◆TextBridge wizard. An easy-to-use wizard guides you through
each step of the TextBridge process, including page type selection
and recomposition options.
◆Page type templates. TextBridge provides many predesigned
page type templates to make processing more efficient. Templates
automatically provide appropriate settings for the type of page
you want to process. For example, there is a magazine page type
and a letter page type that automatically activate settings for
improved results. Page types incorporate three page settings:
page size, page loayout, and print quality. You do not have to go
through a complicated process of determining and specifying
settings for common types of pages.
◆Built-in Proofreader™. After document recognition, you can use
TextBridge’s built-in proofreader to view and accept or correct
any words that TextBridge suspects may not be recognized
accurately.
◆Automatic zoning and zone editing. TextBridge automatically
zones your page is into text, picture, and table zones.
Introduction1–5
◆Zone editing. You can edit the automatic zones to further refine
the zoning. Use zone editing to increase the accuracy and
efficiency of page processing by reshaping zones and renumbering
them.
◆Adobe Acrobat PDF output. You can output the document in
Adobe Acrobat Portable Document Format (PDF), which can be
viewed on either a PC or Macintosh computer.
◆Dynamic OCR training. You can train OCR to improve
recognition accuracy as the job progresses. Use dynamic training
with difficult documents, such as faxes or multi-generation
photocopies. TextBridge enables you to interact with the OCR
process and view then accept or correct its automatic recognition
decisions. The software actually learns special symbols and
words.
◆Guided Tour. Multimedia introduction to TextBridge including
a guide to major features.
◆ToolTips and What’s This? Help. Instant context-sensitive
information about commands, dialog boxes, and buttons on the
interface.
Other TextBridge features
In addition to the features listed in the previous section,
TextBridge provides these other productivity features:
◆Windows 95 certification
◆Microsoft Office 97 certification
◆MMX support
1–6TextBridge Pro 98 User’s Guide
◆Broad scanner support. TextBridge supports most popular
desktop scanners. It provides many built-in Image and Scanner
Interface Standard (ISIS) drivers supporting a number of
scanners. It supports the TWAIN device interface standard. It
also supports the Text Enhancement Technology and Auto Area
Segmentation features of the EPSON® scanner family. It also
supports some Windows NT scanners.
◆Image processing. TextBridge accepts a wide range of images
from a variety of sources for processing. Specifically, the program
imports and recognizes online document images in BMP, Delrina,
PCX, DCX, TIFF, and XIF formats originating from fax modems
and other sources. For more information, see the “Supported
Input Image File Formats” section in this chapter.
◆OLE drag and drop. TextBridge supports Windows OLE
standard drag and drop operations. For example, you can drag an
image from an OLE-compliant program onto TextBridge.
◆Clipboard support. TextBridge can import and recognize
images from the Clipboard.
◆Deferred processing. TextBridge enables you to scan all pages
of a document to a TIFF file, then later open the image file for
document recognition.
◆Output text formats including HTML. TextBridge supports a
number of output text formats, including word processor, desktop
publishing, portable document, spreadsheet, HTML, and
database formats. Now you can process your text for publication
on the Web. For more information, see the “Supported Output
Text File Formats” section in this chapter.
◆Preview with manual zoning. TextBridge provides a set of
tools for previewing page images before processing them. You can
view a page before continuing with processing. You can manually
define areas of page images as zones to be processed and capture
only the text, tables, or pictures you want. You can also edit the
automatic zoning by adjusting the text, table, and picture zones.
Introduction1–7
◆Zone templates with re-usable data. After you create a set of
zones, TextBridge lets you save and reload the zone templates for
new jobs. In this way you can consistently process or ignore
specific areas on the same type of pages and save time.
◆Re-usable training data. After you interactively train OCR,
you can save the training data in a file. You can reload this
training file for similar documents of the same page type. Using
this training file assures the highest recognition accuracy without
having to repeat the training.
◆Custom dictionaries. To improve recognition accuracy further,
you can create specialized word lists (scientific terminology,
proper names, acronyms, and so on) within TextBridge or in
ASCII files and load them into TextBridge.
◆Two-sided document processing. If your scanner has a sheet
feeder, you can scan the fronts (odd sides) of the pages first, then
flip the stack and scan the reverse (even) sides. When scanning
and recognition are complete, TextBridge automatically collates
the text.
With these features, you can import virtually any paper
document or document image file to your computer. TextBridge
attains the highest degree of OCR accuracy and provides the
output in fully editable form in your favorite text program.
Characteristics of Documents TextBridge can recognize
TextBridge includes a number of advances developed by the
Xerox Desktop Document Systems (DDS) division and by the Palo
Alto Research Center (PARC), where modern computer interfaces
were invented.
Consequently, TextBridge provides the highly accurate OCR and
format retention results on the widest range of documents.
TextBridge can recognize:
◆Documents printed on typewriters, phototypesetters, and impact,
ink-jet, dot-matrix, and laser printers
1–8TextBridge Pro 98 User’s Guide
◆Photocopied, degraded, or dirty documents
◆Documents with single- or multiple-column layouts
◆Spreadsheets or cell tables
◆Paper documents with black and white, grayscale, or color
pictures including photos and line art
◆ Page image files with black and white pictures
NoteAfter processing with TextBridge, all pictures are output as black
and white or grayscale. However, TextBridge can recognize and
retain color and grayscale pictures in XIF files.
◆Online single- or multiple-page images from fax modems and
other sources
◆Hard-copy faxes
◆Documents with point sizes ranging from 5-point to 72-point type
in practically any typeface
◆TextBridge software in English, French, German, Italian, or
Spanish
◆Documents composed in English, French, German, Italian, or
Spanish
NoteTextBridge versions shipped in international markets can
recognize an even greater number of languages: Danish, Dutch,
Finnish, Norwegian, Portuguese, and Swedish.
Introduction1–9
WHAT COMES WITH TEXTBRIDGE
TextBridge comes with the following items:
◆One installation CD-ROM. The CD-ROM includes software
programs, scanner drivers, language packs, sample page image
files, release notes, Help, and User’s Guide in PDF format.
◆A printed User’s Guide.
◆A software registration card.
NoteBe sure to register electronically or complete and return the
software registration card. Registration entitles you to free
customer support and assures that you are kept up-to-date on
new software releases and other information related to
TextBridge and the ScanSoft family of software.
In the US, the mailing address is on the registration card. In the
UK, the mailing address for registration is:
XIS Support Department (PSC)
Willow Grange
Church Road
Watford
Hertfordshore
WD1 3QA
Check to be sure you have all the items listed above. If any item
is missing from your TextBridge package, call your authorized
ScanSoft dealer.
For information about contacting ScanSoft, refer to the Preface of
this manual or the Help system.
1–10TextBridge Pro 98 User’s Guide
SCANNERS SUPPORTED
◆Built-in ISIS drivers provided by Pixel Translations Inc.
◆The TWAIN standard, which lets TextBridge work with virtually
◆Text Enhancement Technology and Auto Area Segmentation
NoteInstall your scanner before you install TextBridge.
TextBridge works with many popular desktop scanners using:
any fully TWAIN-compliant device that provides a binary image
in a supported size and resolution.
features of the EPSON® scanner family.
The full list of scanners supported by TextBridge is always
growing. Check the online Release Notes and the TextBridge Web
site at www.textbridge.com to find the latest list of supported
scanners. If your scanner is not in this list, call ScanSoft or to
check further if your scanner is supported.
Scanners require a system-level driver or a TWAIN source driver,
which is provided by the scanner or interface card manufacturer.
Consult the scanner documentation for details about installing
your scanner, interface card, and driver.
After installing your scanner, test that the scanner is functioning.
Refer to the scanner manufacturer’s documentation to answer
any questions about the scanner.
NoteYour scanner must be working independently of TextBridge prior
to connecting it to TextBridge.
In general, it is recommended that you turn on your scanner
before you turn on your PC.
Next, install the TextBridge software.
Introduction1–11
SYSTEM REQUIREMENTS
To install and run TextBridge 98, your Windows-compatible PC
must be equipped with the following:
◆An Intel (or compatible) 80486 or Pentium™ microprocessor
◆VGA, SVGA, or multi-sync color monitor
◆A minimum of 16 megabytes of random access memory (RAM) for
Windows 95 and Windows NT
◆Microsoft Windows™ 95 or Windows NT 4.0
◆A hard disk with a minimum of 20 megabytes (20 MB) of free
space in which to install TextBridge. This enables installation of
all TextBridge software and one language pack. Please allow one
megabyte (1 MB) for each additional language pack you intend to
install.
INSTALLING TEXTBRIDGE
After you have installed your scanner and checked that it is
working, you are ready to install TextBridge software.
This section provides procedures to install TextBridge.
NoteIf you want TextBridge to run on both Windows 95 and NT with a
dual boot, install TextBridge twice.
Before you begin installation, exit from any open applications so
that only Windows is running. There should be no applications
listed in the task bar and no floating toolbars.
To install TextBridge for Windows 95/NT 4.0:
1–12TextBridge Pro 98 User’s Guide
Read the instructions
Click Next to proceed
1. Insert the TextBridge CD into your CD-ROM drive.
An autorun program on the CD-ROM launches the TextBridge
setup program automatically. (You can also use the Windows
Explorer to open the drive, and double-click the autorun.exe
program at the top level of the CD-ROM.)
The TextBridge 98 Setup dialog box appears (Figure 1–2).
Figure 1–2. TextBridge Pro 98 Setup dialog box
2. Read the information in the Setup dialog box, then click
Next.
3. Read the Software License Agreement (Figure 1–3), then
click Yes to proceed with installation.
Introduction1–13
Read the agreement
Click Yes to proceed
Figure 1–3. Software License Agreement dialog box
☞ If you click No because you do not accept the license
agreement, the TextBridge setup closes.
4. The Setup Installation Type dialog box appears
(Figure 1–4). Complete this screen, as follows.
1–14TextBridge Pro 98 User’s Guide
Read the instructions
Click Next to proceed
Figure 1–4. Setup Installation Type dialog box
•Choose one type of installation:
• Typical (Recommended for most users.)
• Compact (Used for minimum installation.)
• Custom (Used to select OCR language packs to install.)
•Accept the default destination directory, or Browse for a
different directory. (It is recommended that you install
TextBridge in the default directory.)
•Click Next to install the TextBridge files onto your
destination disk directory.
5. After the files are copied onto your system, the installation
of TextBridge is complete.
All of the TextBridge program is now installed.
Introduction1–15
Select your scanner
6. A Scanner Setup dialog box asks if you want to set up your
scanner. Select Yes (Figure 1–5).
Figure 1–5. Scanner Setup dialog box
Click No if you are not using a scanner or are not sure at this
time.
7. The Scanner Setup dialog box appears (Figure 1–6).
Complete this screen as follows:
Figure 1–6. Scanner Setup dialog box
• If you will not be using TextBridge with a scanner, select No
scanner, then click OK. If you want to use a scanner at a later
time, you can get this dialog box from the Select Scanner in
the File menu
1–16TextBridge Pro 98 User’s Guide
• Select your scanner, then click OK.
• If applicable, click Configure to further define your scanner
configuration. Refer to your scanner documentation for details
about scanner configuration settings.
For some scanners, a dialog box appears that lets you define
settings including Port Address, SCSI ID Number, Transfer
Mode, and Scanning Speed.
For other scanners, a dialog box appears with the following
message:
This scanner’s configuration is set using the
system-level driver.
When you are finished specifying scanner configuration settings,
click OK to save the new settings.
• Click OK in the Scanner Setup dialog box.
8. Complete the electronic registration.
Follow the instructions in the registration dialog box.
If your PC is not set up for electronic registration, please fill in
the registration card, and mail it.
9. The Setup Complete dialog box appears. Specify when you
want to restart your PC, then click OK.
Figure 1–8. Setup Complete dialog box
1–18TextBridge Pro 98 User’s Guide
Click OK to restart your
computer
☞ Restarting is necessary to complete TextBridge setup. It is
recommended that you restart immediately. However, if you
want to perform other activities before restarting, you can
click No.
Congratulations! TextBridge setup is now complete, and your
new software is installed on your PC.
SETTING UP TEXTBRIDGE INSTANT ACCESS
When you restart your PC, you can use the TextBridge Instant
Access Control Panel dialog box to set up Instant Access (Figure
1–9). To set up TextBridge Instant Access from your other
programs, use the following procedure:
1. On the Windows task bar, click Start.
2. Point to Programs, then point to the TextBridge Pro 98
folder.
3. Click Instant Access Control Panel.
The TextBridge Instant Access Control Panel dialog box appears.
Check one or more programs
Click OK
Figure 1–9. TextBridge Instant Access Control Panel
Introduction1–19
4. Check one or more programs in the list.
5. Click OK.
More information is available if you click the Help button in the
dialog box.
TextBridge will now be available in the File menu of the
program(s) you checked if they are installed on your PC.
UNINSTALLING TEXTBRIDGE
To restore your PC to the state it was in before you installed
TextBridge, use the following procedure:
1. Close all active applications, including TextBridge.
2. On the Windows task bar, click Start.
3. Point to Programs, then point to the TextBridge Pro 98
folder.
4. Click TextBridge Uninstall.
The TextBridge Uninstall dialog box appears.
5. Click Yes to continue the uninstall process.
TextBridge automatically uninstalls.
Click No to exit the uninstall process.
6. The Uninstall Complete dialog box appears. Click OK to
restart your computer.
With the above steps completed, TextBridge is completely
uninstalled from your PC.
1–20TextBridge Pro 98 User’s Guide
Loading...
+ 96 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.