Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal
Xcelsius, Crystal Decisions, Intelligent Question, Desktop Intelligence, Crystal Enterprise,
Crystal Analysis, Web Intelligence, RapidMarts, and BusinessQuery are trademarks or
registered trademarks of Business Objects in the United States and/or other countries. All
other names mentioned herein may be trademarks of their respective owners.
Business Objects products in this release may contain redistributions of software licensed
from third-party contributors. Some of these individual components may also be available
under alternative licenses. A partial listing of third-party contributors that have requested or
permitted acknowledgments, as well as required notices, can be found at:
http://www.businessobjects.com/thirdparty
Business Objects owns the following U.S. patents, which may cover products that are offered
and licensed by Business Objects: 5,555,403; 6,247,008; 6,289,352; 6,490,593; 6,578,027;
6,768,986; 6,772,409; 6,831,668; 6,882,998 and 7,139,766.
April 26, 2007
If you find any problems with this documentation, please report them to Business Objects S.A.
in writing at documentation@businessobjects.com.
This guide provides an overview of the BusinessObjects™ Data Integrator
Platform and architecture. It also describes how to upgrade and install Data
Integrator.
With Data Integrator you can:
•Easily build and execute batch processing applications that create and
update a data warehouse.
•Combine batch capabilities with request-response processing logic and
message handling to support e-commerce and its data integration
requirements.
This chapter covers the following topics:
•Audience and assumptions
•Data Integrator product documentation
Audience and assumptions
This and other Data Integrator product documentation assumes that:
•You are an application developer, consultant, or database administrator
working on data extraction, data warehousing, or data integration.
•You understand your source data systems, RDBMS, business
intelligence, and e-business messaging concepts.
•You understand your organization’s data needs.
•You are familiar with SQL (Structured Query Language).
•You are familiar enough with Microsoft Windows or UNIX platforms to
effectively install Data Integrator.
Data Integrator product documentation
Data Integrator documentation is provided in PDF format on the Data
Integrator CD. You can read PDF files using the latest version of Adobe
Acrobat Reader (download instructions at the Adobe website).
After you install Data Integrator, you can view technical documentation from
many locations. To view documentation in PDF format, you can:
12Data Integrator Getting Started Guide
Introduction
Data Integrator product documentation
•If you accepted the default installation, select Start > Programs >
Business Objects > Data Integrator > Data Integrator Document ation
and select:
•Release Notes—Opens the Release Notes PDF. This document lists
supported and unsupported products, details migration
considerations, includes known and fixed bugs, and provides lastminute documentation corrections associated with the release.
•Release Summary—Opens the Release Summary PDF. this
document describes new Data Integrator features in the release.
•Technical Manuals—Opens the combined Technical Manuals PDF,
which contains Data Integrator core documentation and provides
searchable cross-book master Contents and master Index.
•Technical Manuals—Opens a “master” PDF document that has
been compiled so you can search across the Data Integrator
documentation suite
•Tutorial—Opens the Data Integrator Tutorial PDF, which you can
use for basic stand-alone training purposes
•Select one of the following from the Designer Help menu:
•Release Notes
•Release Summary
•Technical Manuals
•Technical Manuals
•Tutorial
Other links from the Designer’s Help menu include:
•DIZone—Opens a browser window to the DI Zone, an online resource for
the Data Integrator user community)
•Knowledge Base—Opens a browser window to Business Objects’
Technical Support Knowledge Exchange forum (access requires
registration)
1
Data Integrator Getting Started Guide13
Introduction
1
Data Integrator product documentation
You can also access Data Integrator documentation from the Start Page that
opens automatically when you open the Designer. To open the Designer,
choose Start > Programs > Business Objects > Data Integrator > Data Integrator Designer:
You can also view and download PDF documentation, including Data
Integrator documentation for previous releases (including Release
Summaries and Release Notes), by visiting the Business Objects
documentation Web site at http://support.businessobjects.com/
documentation/.
You can also open Help, using one of the following methods:
•Choose Contents from the Designer Help menu.
•Click objects in the object library or workspace and press F1.
Online Help opens to the subject you selected.
Use OnlineHelp links and tool bar to navigate.
14Data Integrator Getting Started Guide
Introduction
Data Integrator product documentation
The following technical documentation is currently available:
•Data Integrator Release Summary— Provides the most up-to-date
information about the product including information that was not available
when the other documents were published.
•Data Integrator Release Summary— Provides summarized information
highlighting all the newest features in the release.
•Data Integrator Technical Manuals includes the following books with a
master Table of Contents and master Index for powerful cross-book
search capabilities:
•Data Integrator Getting S ta rted Guide — Introduces the features and
benefits of using Data Integrator. This guide explains Data Integrator
architecture and how to install the product including system
requirements and pre-installation preparation that will help you
understand dependencies between Data Integrator and your
computing environment.
•Data Integrator Designer Guide — Provides conceptual and
procedural information for designing, developing, and implementing
applications using the Data Integrator Designer.
Provides conceptual and procedural information for scheduling and
monitoring batch jobs as well as configuring and administering realtime jobs. Explains the Data Integrator Administrator features.
•Data Integrator Reference Guide — Provides detailed information
about Data Integrator Designer objects, data types and expressions,
transforms, functions, and user exits.
•Data Integrator Advanced Development and Migration Guide —
Discusses how to migrate Data Integrator applications through
development phases and includes multi-user development concepts
and procedures.
information about improving the performance of your Data Integrator
environment and the Designer options you can use to reduce job
run-times by measuring and tuning performance. Includes
documentation on bulk loading and parallel execution.
•Data Integrator Supplement for J.D. Edwards — Provides
information about using Data Integrator to extract data from J.D.
Edwards World and J.D. Edwards OneWorld.
•Data Integrator Supplement for Oracle Applications — Provides
information about using Data Integrator to extract data from Oracle
Applications.
1
Data Integrator Getting Started Guide15
Introduction
1
Suggested reading path
•Data Integrator Supplement for PeopleSoft — Provides information
about using Data Integrator to extract data from PeopleSoft ERP and
HRMS software.
•Data Integrator Supplement for SAP — Provides information about
using Data Integrator to extract data from SAP R/3 and SAP BW.
This supplement includes conceptual, procedural, and reference
information.
•Data Integrator Supplement for Siebel — Provides information about
using Data Integrator to extract data from Siebel applications.
If you install the documentation with Data Integrator, you can also go to Start
> Programs > Business Objects > Data Integrator > Data Integrator
Documentation > Tutorial to view:
•Data Integrator Core Tutorial — Provides exercises to help you learn how
to extract, transform, and load data with the Data Integrator Designer.
Exercises include using design logic to extract data from SAP R/3 and
non-SAP sources.
Suggested reading path
Begin by reading the Data Integrator Getting St arted Guide which includes an
introduction to functionality and architecture as well as installation
instructions.
Understand how to use Data Integrator Designer by reading the Data
Integrator Designer Guide and doing the exercises in the Data Integrator
Core Tutorial. Use the Data Integrator Reference Guide to answer specific
questions about objects and window options in the Designer.
The Data Integrator Management Console: Administrator Guide provides
information about how to use the Administrator which is a browser-based
graphical user interface used to schedule and monitor batch jobs and to
configure real-time jobs and adapter interfaces available with Data Integrator.
Use the Data Integrator Performance Optimization Guide for tips about using
system and Designer options related to performance optimization. These
include measuring and tuning techniques, bulk loading, and parallel
processing objects within a data flow.
To install and configure Data Integrator’s pre-packaged adapters, read the
individual adapter guides on the adapter product CDs.
16Data Integrator Getting Started Guide
Data Integrator Getting Started Guide
Introducing Data Integrator
chapter
Introducing Data Integrator
2
About this chapter
About this chapter
This chapter introduces Data Integrator and explains its place in the Business
Objects product suite. Topics include:
•Data Integrator and the Business Objects Product Suite
•Data Integrator product benefits
•Data Integrator interfaces
Data Integrator and the Business Objects
Product Suite
The Business Objects product suite delivers extreme insight through
specialized end-user tools on a single, trusted business intelligence platform.
This entire platform is supported by BusinessObjects™ Data Integrator. On
top of Data Integrator, Business Objects layers the most reliable, scalable,
flexible, and manageable BI platform which supports the industry’s best
integrated end-user interfaces: reporting, query and analysis, and
performance management dashboards, scorecards, and applications.
True data integration blends batch extraction, transformation, and loading
(ETL) technology with real-time bi-directional data flow across multiple
applications for the extended enterprise.
By building a relational datastore and intelligently blending direct real-time
and batch data-access methods to access data from enterprise resource
planning (ERP) systems and other sources, Business Objects has created a
powerful, high-performance data integration product that allows you to fully
leverage your ERP and enterprise application infrastructure for multiple uses.
Business Objects provides a batch and real-time data integration system to
drive today’s new generation of analytic and supply-chain management
applications. Using the highly scalable data integration solution provided by
Business Objects, your enterprise can maintain a real-time, on-line dialogue
with customers, suppliers, employees, and partners, providing them with the
critical information they need for transactions and business analysis.
18Data Integrator Getting Started Guide
Data Integrator product benefits
Use Data Integrator to develop enterprise data integration for batch and realtime uses. With Data Integrator:
•You can share data and metadata with BusinessObjects Enterprise
platform.
•You can create a single infrastructure for batch and real-time data
movement to enable faster and lower cost implementation.
•Your enterprise can manage data as a corporate asset independent of
any single system. Integrate data across many systems and reuse that
data for many purposes.
•You have the option of using pre-packaged data solutions for fast
deployment and quick ROI. These solutions extract historical and daily
data from operational systems and cache this data in open relational
databases.
Data Integrator customizes and manages data access and uniquely combines
industry-leading, patent-pending technologies for delivering data to analytic,
supply-chain management, customer relationship management, and Web
applications.
Introducing Data Integrator
Data Integrator product benefits
2
Unification with the platform
Data Integrator provides several points of unification with BusinessObjects
Enterprise platform:
•Get end-to-end data lineage and impact analysis
•Create the semantic layer (universe) and manage change within the ETL
design environment
Business Objects deeply integrates the entire ETL process with the business
intelligence platform so you benefit from:
•Easy metadata management
•Simplified and unified administration
•Lifecycle management
•Trusted information
Data Integrator Getting Started Guide19
Introducing Data Integrator
2
Data Integrator associated products
Ease of use and high productivity
Data Integrator combines both batch and real-time data movement and
management to provide a single data integration platform for information
management from any information source, for any information use.
Using Data Integrator, you can:
•Stage data in an operational datastore, data warehouse, or data mart.
•Update staged data in batch or real-time modes.
•Create a single graphical development environment for developing,
testing, and deploying the entire data integration platform.
•Manage a single metadata repository to capture the relationships
between different extraction and access methods and provide integrated
lineage and impact analysis.
High availability and performance
Data Integrator’s high-performance engine and proven data movement and
management capabilities include:
•Scalable, multi-instance data-movement for fast execution
•Load balancing
•Changed-data capture
•Parallel processing
Data Integrator associated products
Choose from several Business Objects product options to further support and
enhance the power of your Data Integrator product.
Composer
Even before you start a data warehouse implementation, Business Objects
knows that you need to collect and organize critical data from information
users to “compose” a blueprint from which you can start to build your Data
Integrator jobs.
The BusinessObjects ™Composer is a stand-alone, web-based application
for designing extraction, transformation, and loading (ETL) projects.
20Data Integrator Getting Started Guide
Developing higher-level designs for your ETL jobs can make the
implementation process more efficient and improve the quality of resulting
jobs. Composer complements other ETL tools, providing a platform for
creating your design even before you start to build your data warehouse. Use
Composer to identify sources of data, implement transformations, and
document your designs.
For more information on BusinessObjects™ Composer, contact your
Business Objects sales representative.
Metadata Manager
Business Objects™ Metadata Manager provides an integrated view of
metadata and its multiple relationships for an complete Business Intelligence
project spanning some or all of the Business Objects stack of products. Use
Metadata Manager to:
•View metadata about Business Objects Reports, Documents, and data
sources from a single repository.
•Analyze lineage to determine data sources of Business Objects
Documents and Reports.
•Analyze the impact of changing a source table, column, element, or field
on existing Business Objects Documents and Reports.
•Track different versions (changes) to each object over time.
•View operational metadata (such as the number of rows processed and
CPU utilization) as historical data with a datetime.
•View metadata in different languages.
For more information on Business Objects™ Metadata Manager , contact your
Business Objects sales representative.
Introducing Data Integrator
Data Integrator interfaces
2
Data Integrator interfaces
Data Integrator provides many types of interface components. Interfaces
include:
•Data-level read/write interfaces using SQL:
Attunity
BusinessObjects Data Federator
DB2/UDB
DB2/UDB via DB2Connect to MVS
Data Integrator Getting Started Guide21
Introducing Data Integrator
2
Data Integrator interfaces
DB2/UDB via DB2Connect to AS/400
Informix
MS SQL Server
MySQL
Netezza
Oracle
ODBC to generic databases
Sybase ASE
Sybase IQ
Teradata
•Application-level interfaces (read/write and batch/real-time unless
otherwise specified):
J.D. Edwards One World or World (batch)
Oracle Application (batch)
PeopleSoft interface (batch)
Salesforce.com
SAP R/3 ABAP (batch, read only)
SAP R/3 Hierarchy
SAP R/3 IDoc
SAP R/3 RFC/BAPI
SAP BW (batch)
Siebel Application (batch)
•Technology read/write real-time interfaces:
HTTP/HTTPS
Java Message Service (JMS)
SNMP
Web Services
•Files read/write connectivity interfaces:
ASCII
COBOL (read only)
Excel (read only)
XML
22Data Integrator Getting Started Guide
Introducing Data Integrator
Data Integrator interfaces
You can use the Data Integrator Interface Development Kit to develop
adapters that read from and/or write to other applications.
In addition to the interfaces listed above, the Data Integrator Nested
Relational Data Model (NRDM) allows you to apply the full power of SQL
transforms to manipulate, process, and enrich hierarchical business
documents. For more information, see Chapter, “Nested Data,” in the Data
Integrator Designer Guidethe.
To learn more about which products and components are supported by your
version of Data Integrator, see your Data Integrator Release Notes and
Supported Platform documents.
2
Data Integrator Getting Started Guide23
Introducing Data Integrator
2
Data Integrator interfaces
24Data Integrator Getting Started Guide
Data Integrator Getting Started Guide
Data Integrator Architecture
chapter
Data Integrator Architecture
3
About this chapter
About this chapter
This chapter describes Data Integrator components and their distribution on
your network.
This chapter contains the following topics:
•Standard Data Integrator components
•Optional Data Integrator components
•Data Integrator management tools
•Data Integrator operating system platforms
•Data Integrator distributed architecture
The Data Integrator architecture is layered to allow data integration to occur
over a variety of open, industry-standard APIs for optimal data and metadata
management.
Standard Data Integrator components
Standard Data Integrator components include:
•Data Integrator Designer
•Data Integrator repository
•Data Integrator Job Server
•Data Integrator engine
•Data Integrator Access Server
•Data Integrator Administrator
•Data Integrator Metadata Reports applications
•Data Integrator Web Server
•Data Integrator Service
•Data Integrator SNMP Agent
•Data Integrator Adapter SDK
The following diagram summarizes the relationships among Data Integrator
components.
26Data Integrator Getting Started Guide
Data Integrator Architecture
Standard Data Integrator components
3
Data Integrator Designer
The Designer is a development tool with an easy-to-use graphical user
interface. It enables developers to define data management applications that
consist of data mappings, transformations, and control logic.
Use the Designer to create applications containing work flows (job execution
definitions) and data flows (data transformation definitions).
To use the Designer, create objects, then drag, drop, and configure them by
selecting icons in flow diagrams, table layouts, and nested worksp ace pages.
The objects in the Designer represent metadata. The Designer interface
allows you to manage metadata stored in a Data Integrator repository. From
the Designer, you can also trigger the Data Integrator Job Server to run your
jobs for initial application testing.
Data Integrator Getting Started Guide27
Data Integrator Architecture
3
Standard Data Integrator components
Data Integrator repository
The Data Integrator repository is a set of tables that hold user-created and
predefined system objects, source and target metadata, and transformation
rules. Set up repositories on an open client/server platform to facilitate
sharing metadata with other enterprise tools. Store each repository on an
existing RDBMS.
Each repository is associated with one or more Data Integrator Job Servers
which run the jobs you create. There are two types of repositories:
•A local repository is used by an application designer to store definitions of
Data Integrator objects (like projects, jobs, work flows, and data flows)
and source/target metadata.
•A central repository is an optional component that can be used to support
multi-user development. The central repository provides a shared object
library allowing developers to check objects in and out of their local
repositories.
Data Integrator Job Server
The Data Integrator Job Server starts the data movement engine that
integrates data from multiple heterogeneous sources, performs complex data
transformations, and manages extractions and transactions from ERP
systems and other sources. The Data Integrator Job Server can move data in
either batch or real-time mode and uses distributed query optimization, multithreading, in-memory caching, in-memory data transformations, and parallel
processing to deliver high data throughput and scalability.
While designing a job, you can run it from the Designer which tells the Job
Server to run the job. The Job Server gets the job from its associated repository,
then starts a Data Integrator engine to process the job. In your production
environment, the Job Server runs jobs triggered by a scheduler or by a realtime service managed by the Data Integrator Access Server. In production
environments, you can balance job loads by creating a Job Server Group
(multiple Job Servers) which executes jobs according to overall system load.
Data Integrator engine
When Data Integrator jobs are executed, the Job Server starts Data Integrator
engine processes to perform data extraction, transformation, and movement.
Data Integrator engine processes use parallel processing and in-memory
data transformations to deliver high data throughput and scalability.
28Data Integrator Getting Started Guide
Data Integrator Access Server
The Access Server is a real-time, request-reply message broker that collects
message requests, routes them to a real-time service, and delivers a
message reply within a user-specified time frame. The Access Server queues
messages and sends them to the next available real-time service across any
number of computing resources. This approach provides automatic scalability
because the Access Server can initiate additional real-time services on
additional computing resources if traffic for a given real-time service is high.
You can configure multiple Access Servers.
Data Integrator Administrator
The Administrator provides browser-based administration of Data Integrator
resources including:
•Scheduling, monitoring, and executing batch jobs
•Configuring, starting, and stopping real-time services
•Configuring Job Server, Access Server, and repository usage
•Configuring and managing adapters
•Managing users
•Publishing batch jobs and real-time services via Web services
Data Integrator Architecture
Standard Data Integrator components
3
Data Integrator Metadata Reports applications
The Metadata Reports applications provides browser-based analysis and
reporting capabilities on metadata that is associated with:
•your Data Integrator jobs
•other Business Objects applications associated with Data Integrator
Metadata Reports provide four applications for exploring your metadata:
•Impact and lineage analysis
•Operational dashboards
•Auto documentation
•Data validation
Data Integrator Getting Started Guide29
Data Integrator Architecture
3
Standard Data Integrator components
Impact and Lineage Analysis reports
Impact and Lineage Analysis reports include:
•Datastore Analysis — For each datastore connection, view overview,
table, function, and hierarchy reports. Data Integrator users can
determine:
•What data sources populate their tables
•What target tables their tables populate
•Whether one or more of the following Business Objects reports uses
data from their tables:
•Business Views
•Crystal Reports
•Universes
•Web Intelligence documents
•Desktop Intelligence documents
•Universe analysis — View Universe, class, and object lineage. Universe
users can determine what data sources populate their Universes and
what reports use their Universes.
•Business View analysis — View the dat a sources for Business Views in
the Central Management Server (CMS). You can view business element
and business field lineage reports for each Business View. Crystal
Business View users can determine what data sources populate their
Business Views and what reports use their views.
•Report analysis — View data sources for reports in the Central
Management Server (CMS). You can view table and column lineage
reports for each Crystal Report and Web Intelligence Document
managed by CMS. Report writers can determine what data sources
populate their reports.nic
•Dependency analysis — Search for specific objects in your repository
and understand how those objects impact or are impacted by other Data
Integrator or Business Objects Universe objects and reports. Metadata
search results provide links back into associated reports.
To view impact and lineage analysis for Business Objects applications, you
must configure the Metadata Integrator. To learn more about Metadata
Integrator, see “Installing and Configuring the Metadata Integrator” on
page 91.
30Data Integrator Getting Started Guide
Loading...
+ 124 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.