Welcome to the
The software is a component of the SAP BusinessObjects information management solutions and
allows you to extract and integrate data for analytical reporting and e-business.
Exercises in this tutorial introduce concepts and techniques to extract, transform, and load batch data
from flat-file and relational database sources for use in a data warehouse. Additionally, you can use
the software for real-time data extraction and integration. Use this tutorial to gain practical experience
using software components including the Designer, repositories, and Job Servers.
SAP BusinessObjects information management solutions also provide a number of Rapid Marts
packages, which are predefined data models with built-in jobs for use with business intelligence (BI)
and online analytical processing (OLAP) tools. Contact your sales representative for more information
about Rapid Marts.
Tutorial
. This tutorial introduces core features of SAP BusinessObjects Data Services.
1.1 Audience and assumptions
This tutorial assumes that:
•You are an application developer or database administrator working on data extraction, data
warehousing, data integration, or data quality.
•You understand your source data systems, DBMS, business intelligence, and e-business messaging
concepts.
•You understand your organization's data needs.
•You are familiar with SQL (Structured Query Language).
•You are familiar with Microsoft Windows.
1.2 SAP BusinessObjects information resources
A global network of SAP BusinessObjects technology experts provides customer support, education,
and consulting to ensure maximum information management benefit to your business.
Useful addresses at a glance:
2010-12-029
Introduction
ContentAddress
Customer Support, Consulting, and Education
services
http://service.sap.com/
SAP BusinessObjects Data Services Community
http://www.sdn.sap.com/irj/sdn/ds
Forums on SCN (SAP Community Network )
http://forums.sdn.sap.com/forum.jspa?foru
mID=305
Blueprints
http://www.sdn.sap.com/irj/boc/blueprints
Information about SAP Business User Support
programs, as well as links to technical articles,
downloads, and online forums. Consulting services
can provide you with information about how SAP
BusinessObjects can help maximize your information management investment. Education services
can provide information about training options and
modules. From traditional classroom learning to
targeted e-learning seminars, SAP BusinessObjects
can offer a training package to suit your learning
needs and preferred learning style.
Get online and timely information about SAP BusinessObjects Data Services, including tips and tricks,
additional downloads, samples, and much more.
All content is to and from the community, so feel
free to join in and contact us if you have a submission.
Search the SAP BusinessObjects forums on the
SAP Community Network to learn from other SAP
BusinessObjects Data Services users and start
posting questions or share your knowledge with the
community.
Blueprints for you to download and modify to fit your
needs. Each blueprint contains the necessary SAP
BusinessObjects Data Services project, jobs, data
flows, file formats, sample data, template tables,
and custom functions to run the data flows in your
environment with only a few modifications.
http://help.sap.com/businessobjects/
Supported Platforms (Product Availability Matrix)
https://service.sap.com/PAM
1.3 Tutorial objectives
SAP BusinessObjects product documentation.Product documentation
Get information about supported platforms for SAP
BusinessObjects Data Services.
Use the search function to search for Data Services.
Click the link for the version of Data Services you
are searching for.
2010-12-0210
Introduction
The intent of this tutorial is to introduce core Designer functionality.
After completing this tutorial you should be able to:
•Describe the process for extracting, transforming, and loading data using SAP BusinessObjects
Data Services
•Identify Data Services objects
•Define Data Services objects to:
•Extract flat-file, XML, and relational data from your sources
•Transform the data to suit your needs
•Load the data to your targets
•Use Data Services features and functions to:
•Recover from run-time errors
•Capture changed data
•Verify and improve the quality of your source data
•Run a real-time job
•View and print metadata reports
•Examine data throughout a job using the debugger
•Set up a multiuser development environment
1.4 Tutorial prerequisites
This section provides a high-level description of the steps you need to complete before you begin the
tutorial exercises.
1.4.1 Preparation for this tutorial
Read the sections on logging in to the Designer and the Designer user interface in the
BusinessObjects Data Services Designer Guide
including terms and concepts relevant to this tutorial.
SAP
to get an overview of the Designer user interface
This tutorial also provides a high-level summary in the next section,
Product Overview
.
2010-12-0211
Introduction
1.4.2 Environment required
To use this tutorial, you must have Data Services running on a supported version of Windows XP or
Windows Server 2003 and a supported RDBMS (such as Oracle, IBM DB2, Microsoft SQL Server, or
Sybase ASE).
You can install Data Services product components (Designer, Administrator, Job Server, Access Server)
on a single computer or distribute them across multiple computers. In the simplest case, all components
in the following diagram can reside on the same computer.
1.4.3 Tutorial setup
Ensure you have access to a RDBMS; contact your system administrator for assistance.
To set up your computer for this tutorial you must do the following tasks:
•Create repository, source, and target databases on an existing RDBMS
•Install SAP BusinessObjects Data Services
•Run the provided SQL scripts to create sample source and target tables
The following sections describe each of these tasks.
1.4.3.1 Create repository, source, and target databases on an existing RDBMS
2010-12-0212
Introduction
1.
Log in to your RDBMS.
2.
(Oracle only). Optionally create a service name alias.
Set the protocol to TCP/IP and enter a service name; for example, training.sap. This can act as your
connection name.
3.
Create three databases—for your repository, source operational data store (ODS), and target. For
each, you must create a user account and password.
The recommended values used in the tutorial SQL scripts are:
TargetSourceRepository
targetodsrepoUser name
targetodsrepoPassword
4.
Grant access privileges for the user account. For example for Oracle, grant CONNECT and
RESOURCE roles.
5.
Make a note of the connection names, database versions, user names and passwords in the following
table. You will be asked to refer to this information throughout the tutorial.
TargetSourceRepositoryValue
Database connection name (Oracle) OR
Database server name AND
Database name (MS-SQL Server)
Database version
User name
Password
1.4.3.2 Install a Central Management Server (CMS)
SAP BusinessObjects Data Services requires a Central Managment Server (CMS) provided by SAP
BusinessObjects Enterprise or Information platform services 4.0.
For detailed information about system requirements, configuration, and installation for Information
platform services 4.0, see the “Installing Information platform services 4.0” section of the
Guide for Windows
.
Installation
For detailed information about system requirements, configuration, and installation for SAP
BusinessObjects Enterprise, see the
SAP BusinessObjects Enterprise Installation Guide
.
2010-12-0213
Introduction
Note:
During installation, make a note of the administrator user name and password for the SAP
BusinessObjects Enterprise or Information platform services 4.0 system. You will be asked to enter it
to complete the setup of the tutorial.
1.4.3.2.1 To log in to the Central Management Console
To configure user accounts and define Data Services repositories, log in to the Central Management
Console (CMC).
1.
Navigate to http://<hostname>:8080/BOE/CMC/, where <hostname> is the name of the
machine where you installed SAP BusinessObjects Enterprise or Information platform services 4.0.
2.
Enter the username, password, and authentication type for your CMS user.
3.
Click Log On.
The Central Management Console main screen is displayed.
1.4.3.3 Install SAP BusinessObjects Data Services
For detailed information about system requirements, configuration, and installing on Windows or UNIX,
See the
Be prepared to enter the following information when installing the software:
•Your Windows domain and user name
•Your Windows password
•Your Windows computer name and host ID
•Product keycode
•Connection information for the local repository and Job Server
When you install the software, it configures a Windows service for the Job Server. To verify that the
service is enabled, open the Services Control Panel and ensure that all Data Services services are
configured for a Status of Started and Startup Type Automatic.
The default installation creates the following entries in the Start > Programs > SAP BusinessObjectsData Services XI 4.0 menu:
Data ServicesLocale Selector
Installation Guide for WindowsorInstallation Guide for UNIX
FunctionCommand
Opens the Designer.Data ServicesDesigner
Allows you to specify the language, territory, and
code page to use for the repository connection
for Designer and to process job data
.
2010-12-0214
Introduction
Data ServicesManagement Console
FunctionCommand
Opens a launch page for the Data Services web
applications including the Administrator and
metadata and data quality reporting tools.
Data ServicesRepository Manager
Data ServicesServer Manager
Opens a dialog box that you can use to update
repository connection information.
Opens a dialog box that you can use to configure
Job Servers and Access Servers.
Enter the connection information for the local repository database that you created.
4.
Type repo for both User and Password.
5.
Click Create.
After creating the repository, you need to define a Job Server and associate the repository with it. You
can also optionally create an Access Server if you want to use web-based batch job administration.
1.4.3.3.2 To define a job server and associate your repository
1.
Open the Server Manager.
2.
In the Job Server tab of the Server Manager window, click Configuration Editor.
3.
In the "Job Server Configuration Editor" window, click Add to add a Job Server.
4.
In the "Job Server Properties" window:
a. Enter a unique name in Job Server name.
b. For Job Server port, enter a port number that is not used by another process on the computer.
If you are unsure of which port number to use, increment the default port number.
c. You do not need to complete the other job server properties to run the exercises in this Tutorial.
5.
Under Associated Repositories, enter the local repository to associate with this Job Server. Every
Job Server must be associated with at least one local repository.
a. Click Add to associate a new local or profiler repository with this Job Server.
b. Under Repository information enter the connection information for the local repository database
that you created.
c. Type repo for both Username and Password.
d. Select the Default repository check box if this is the default repository for this Job Server. You
cannot specify more than one default repository.
e. Click Apply to save your entries. You should see <database_server>_repo_repo in the list
of Associated Repositories.
2010-12-0215
Introduction
6.
Click OK to close the "Job Server Properties" window.
7.
Click OK to close the "Job Server Configuration Editor" window.
8.
Click Close and Restart in the Server Manager.
9.
Click OK to confirm that you want to restart the Data Services Service.
1.4.3.3.3 To create a new Data Services user account
Before you can log in to the Designer, you need to create a user account on the Central Management
Server (CMS).
1.
Log in to the Central Management Console (CMC) using the Administrator account you created
during installation.
2.
Click Users and Groups.
The user management screen is displayed.
3.
Click Manage > New > New User.
The "New User" screen is displayed.
4.
Enter user details for the new user account:
ValueField
EnterpriseAuthentication Type
tutorial_userAccount Name
Tutorial UserFull Name
User created for the Data Services tutorial.Description
tutorial_passPassword
CheckedPassword never expires
UncheckedUser must change password at next logon
5.
Click Create & Close.
The user account is created and the "New User" screen is closed.
6.
Add your user to the necessary Data Services user groups:
a. Click User List.
b. Select tutorial_user in the list of users.
c. Choose Actions > Join Group.
The "Join Group" screen is displayed.
d. Select all the Data Services groups and click >.
Caution:
This simplifies the process for the purposes of the tutorial. In a production environment, you
should plan user access more carefully. For more information about user security, see the
Administrator's Guide
.
The Data Services groups are moved to the Destination Groups area.
2010-12-0216
Introduction
e. Click OK.
The "Join Group" screen is closed.
7.
Click Log Off to exit the Central Management Console.
Related Topics
• To log in to the Central Management Console
1.4.3.3.4 To configure the local repository in the CMC
Before you can grant repository access to your user, you need to configure the repository in the Central
Management Console (CMC).
1.
Log in to the Central Management Console using the tutorial user you created.
2.
Click Data Services.
The "Data Services" management screen is displayed.
3.
Click Manage > Configure Repository.
The "Add Data Services Repository" screen is displayed.
4.
Enter a name for the repository.
For example, Tutorial Repository.
5.
Enter the connection information for the database you created for the local repository.
6.
Click Test Connection.
A dialog appears indicating whether or not the connection to the repository database was successful.
Click OK. If the connection failed, verify your database connection information and re-test the
connection.
7.
Click Save.
The "Add Data Services Repository" screen closes.
8.
Click the Repositories folder.
The list of configured repositories is displayed. Verify that the new repository is shown.
9.
Click Log Off to exit the Central Management Console.
Related Topics
• To log in to the Central Management Console
1.4.3.4 Run the provided SQL scripts to create sample source and target tables
2010-12-0217
Introduction
Data Services installation includes a batch file (CreateTables_databasetype.bat) for each
supported RDBMS. The batch files run SQL scripts that create and populate tables on your source
database and create the target schema on the target database.
1.
Using Windows Explorer, locate the CreateTables batch file for your RDBMS in your Data Services
installation directory in <LINK_DIR>\Tutorial Files\Scripts.
2.
Open the appropriate script file and edit the pertinent connection information (and user names and
passwords if you are not using ods/ods and target/target).
The Oracle batch file contains commands of the form:
For Microsoft SQL Server 2008, use CreateTables_MSSQL2005.bat.
The output files provide logs that you can examine for success or error messages.
3.
Double-click on the batch file to run the SQL scripts.
4.
Use an RDBMS query tool to check your source ODS database.
The following tables should exist on your source database after you run the script. These tables
should include a few rows of sample data.
Table name in databaseDescriptive name
ods_customerCustomer
ods_materialMaterial
ods_salesorderSales Order Header
ods_salesitemSales Order Line Item
ods_deliverySales Delivery
ods_employeeEmployee
ods_regionRegion
5.
Use an RDBMS query tool to check your target data warehouse.
The following tables should exist on your target database after you run the script.
Table name in databaseDescriptive name
salesorg_dimSales Org Dimension
cust_dimCustomer Dimension
mtrl_dimMaterial Dimension
time_dimTime Dimension
2010-12-0218
Introduction
1.5 Tutorial structure
The goal of the tutorial exercises is to demonstrate SAP BusinessObjects Data Services features using
a simplified data model. The model is a sales data warehouse with a star schema that contains one
fact table and some dimension tables.
Table name in databaseDescriptive name
employee_dimEmployee Dimension
sales_factSales Fact
status_tableRecovery Status
CDC_timeCDC Status
Sections build on jobs you create and skills learned in previous sections. You must complete each
exercise to begin the next.
Note:
The screens in this guide are for illustrative purposes. On some screens, the available options depend
on the database type and version in the environment.
This tutorial is organized as follows.
Product Overview
Defining Source and Target Metadata
introduces the basic architecture and the user interface for Data Services.
introduces working with the Designer. Use the Designer to define
a datastore and a file format, then import metadata to the object library. After completing this section,
you will have completed the preliminary work required to define data movement specifications for flat-file
data.
2010-12-0219
Introduction
Populating the SalesOrg Dimension from a Flat File
source and target tables. The exercise populates the sales organization dimension table from flat-file
data.
Populating the Time Dimension Using a Transform
creates a data flow for populating the time dimension table.
Populating the Customer Dimension from a Relational Table
tables. This exercise defines a job that populates the customer dimension table.
Populating the Material Dimension from an XML File
This exercise defines a job that populates the material dimension table.
Populating the Sales Fact Table from Multiple Relational Tables
tables and introduces joins and the lookup function. The exercise populates the sales fact table.
Changed-Data Capture
variables, parameters, functions, and scripts.
Data Assessment
exercise uses profile statistics, the validation transform, and the audit data flow feature.
Recovery Mechanisms
Multiuser Development
environment.
Extracting SAP Application Data
sources.
introduces a basic approach to changed-data capture. The exercise uses
introduces features to ensure and improve the validity of your source data. The
presents techniques for recovering from incomplete data loads.
presents the use of a central repository for setting up a multiuser development
provides optional exercises on extracting data from SAP application
introduces basic data flows, query transforms, and
introduces Data Services functions. This exercise
introduces data extraction from relational
introduces data extraction from nested sources.
continues data extraction from relational
Running a Real-time Job in Test Mode
mode.
1.6 Exiting the tutorial
You can exit the tutorial at any point after creating a sample project (see Adding a new project).
1.6.1 To exit the tutorial
1.
From the Project menu, click Exit.
If any work has not been saved, you are prompted to save your work.
2.
Click Yes or No.
provides optional exercises on running a real-time job in test
2010-12-0220
Introduction
1.6.2 To resume the tutorial
1.
Log in to the Designer and select the repository in which you saved your work.
The Designer window opens.
2.
From the Project menu, click Open.
3.
Click the name of the project you want to work with, then click Open.
The Designer window opens with the project and the objects within it displayed in the project area.
2010-12-0221
Introduction
2010-12-0222
Product Overview
Product Overview
This section provides an overview of SAP BusinessObjects Data Services. It introduces the product
architecture and the Designer.
2.1 Product features
SAP BusinessObjects Data Services combines industry-leading data quality and integration into one
platform. With Data Services, your organization can transform and improve data anywhere. You can
have a single environment for development, runtime, management, security and data connectivity.
One of the fundamental capabilities of Data Services is extracting, transforming, and loading (ETL) data
from heterogeneous sources into a target database or data warehouse. You create applications (jobs)
that specify data mappings and transformations by using the Designer.
Use any type of data, including structured or unstructured data from databases or flat files to process
and cleanse and remove duplicate entries. You can create and deliver projects more quickly with a
single user interface and performance improvement with parallelization and grid computing.
Data Services RealTime interfaces provide additional support for real-time data movement and access.
Data Services RealTime reacts immediately to messages as they are sent, performing predefined
operations with message content. Data Services RealTime components provide services to web
applications and other client applications.
Data Services features
•Instant traceability with impact analysis and data lineage capabilities that include the data quality
process
•Data validation with dashboards and process auditing
•Work flow design with exception handling (Try/Catch) and Recovery features
•Multi-user support (check-in/check-out) and versioning via a central repository
•Administration tool with scheduling capabilities and monitoring/dashboards
•Transform management for defining best practices
•Comprehensive administration and reporting tools
•Scalable scripting language with a rich set of built-in functions
•Interoperability and flexibility with Web services-based applications
•High performance parallel transformations and grid computing
•Debugging and built-in profiling and viewing data
•Broad source and target support
2010-12-0223
Product Overview
•applications (for example, SAP)
•databases with bulk loading and CDC changes data capture
For details about all the features in Data Services, see the
2.2 Product components
The Data Services product consists of several components including:
•Designer
The Designer allows you to create, test, and execute jobs that populate a data warehouse. It is a
development tool with a unique graphical user interface. It enables developers to create objects,
then drag, drop, and configure them by selecting icons in a source-to-target flow diagram. It allows
you to define data mappings, transformations, and control logic. Use the Designer to create
applications specifying work flows (job execution definitions) and data flows (data transformation
definitions).
•Job Server
The Job Server is an application that launches the Data Services processing engine and serves as
an interface to the engine and other components in the Data Services suite.
•Engine
The Data Services engine executes individual jobs defined in the application you create using the
Designer. When you start your application, the Data Services Job Server launches enough engines
to effectively accomplish the defined tasks.
Reference Guide.
•Repository
The repository is a database that stores Designer predefined system objects and user-defined
objects including source and target metadata and transformation rules. In addition to the local
repository used by the Designer and Job Server, you can optionally establish a central repository
for object sharing and version control.
The Designer handles all repository transactions. Direct manipulation of the repository is unnecessary
except for:
•Setup before installing Data Services
You must create space for a repository within your RDBMS before installing Data Services.
•Security administration
Data Services uses your security at the network and RDBMS levels.
•Backup and recovery
2010-12-0224
Product Overview
•Access Server
•Administrator
You can export your repository to a file. Additionally, you should regularly back up the database
where the repository is stored.
The Access Server passes messages between web applications and the Data Services Job Server
and engines. It provides a reliable and scalable interface for request-response processing.
The Web Administrator provides browser-based administration of Data Services resources, including:
•Scheduling, monitoring, and executing batch jobs
•Configuring, starting, and stopping real-time services
•Configuring Job Server, Access Server, and repository usage
•Configuring and managing adapters
•Managing users
•Publishing batch jobs and real-time services via Web services
The following diagram illustrates Data Services product components and relationships.
2.3 Using the product
You use Data Services to design, produce, and run data movement applications.
2010-12-0225
Product Overview
Using the Designer, you can build work flows and data flows that cleanse your data and specify data
extraction, transformation, and loading processes. In Data Services RealTime, you have the added
capability to build real-time data flows that support e-business transactions.
You create jobs to contain, organize, and run your flows. You create projects to organize the jobs.
Refine and build on your design until you have created a well-tested, production-quality application. In
Data Services, you can set applications to run in test mode or on a specific schedule. Using Data
Services RealTime, you can run applications in real time so they immediately respond to web-based
client requests.
2.4 System configurations
You can configure SAP BusinessObjects Data Services in various ways. The following diagram illustrates
one possible system configuration.
When integrating Data Services into your existing environment, consider:
•The servers shown in the diagram can be separate physical computers, or they can be installed on
a single computer.
•For peak performance, install and create the Data Services local repository on either the same
computer as the Data Services Job Server or on the same computer as the target data warehouse.
In either of the previous configurations, the computer should be on the same LAN segment as the
rest of the Data Services components.
2010-12-0226
Product Overview
As shown in the diagram, most Data Services components—the Designer, Job Server, and Access
Server—can run on the same Windows system, or you can install the Job Server on a UNIX system
running Hewlett Packard HP-UX, Sun Solaris, or IBM AIX.
2.4.1 Windows implementation
You can configure a Windows system as either a server or a workstation. A large-memory, multiprocessor
system is ideal because the multithreading, pipelining, and parallel work flow execution features in Data
Services take full advantage of such a system.
You can create your target data warehouse on a database server that may or may not be a separate
physical computer.
You can use a shared disk or FTP to transfer data between your source system and the Data Services
Job Server.
2.4.2 UNIX implementation
You can install the Data Services Job Server on a UNIX system. You can also configure the Job Server
to start automatically when you restart the computer.
2.5 The Designer window
The following illustration shows the key areas of the Designer window.
2010-12-0227
Product Overview
The key areas of the Data Services application window are:
1.
Project area — Contains the current project (and the job(s) and other objects within it) available to
you at a given time. In Data Services, all entities you create, modify, or work with are objects.
2.
Workspace — The area of the application window in which you define, display, and modify objects.
3.
Local object library — Provides access to local repository objects including built-in system objects,
such as transforms and transform configurations, and the objects you build and save, such as jobs
and data flows.
4.
Tool palette — Buttons on the tool palette enable you to add new objects to the workspace.
2.6 SAP BusinessObjects Data Services objects
In SAP BusinessObjects Data Services, all entities you add, define, modify, or work with are objects.
Objects have:
•Options that control the object. For example, to set up a connection to a database, defining the
database name would be an option for the connection.
•Properties that describe the object. For example, the name and creation date. Attributes are properties
used to locate and organize objects.
2010-12-0228
Product Overview
•Classes that determine how you create and retrieve the object. You can copy reusable objects from
The following diagram shows transform objects in the Data Services object library.
the object library. You cannot copy single-use objects.
When you widen the object library, the name of each object is visible next to its icon. To resize the
object library area, click and drag its border until you see the text you want, then release.
2.6.1 Object hierarchy
The following illustration shows the hierarchical relationships for the key object types within Data
Services.
2010-12-0229
Product Overview
In the repository, the Designer groups objects hierarchically from a project, to jobs, to optional work
flows, to data flows. In jobs:
•Work flows define a sequence of processing steps. Work flows and conditionals are optional. A
conditional contains work flows, and you can embed a work flow within another work flow.
•Data flows transform data from source(s) to target(s). You can embed a data flow within a work flow
or within another data flow.
2010-12-0230
Loading...
+ 140 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.