U s e r G u i d e - Informatica · 2020-01-04 · Preface This I n f o r m a t i c a F a s t C l o...

Informatica® Fast Clone11.0

User Guide

Informatica Fast Clone User Guide11.0July 2018

© Copyright Informatica LLC 2011, 2018

This software and documentation are provided only under a separate license agreement containing restrictions on use and disclosure. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica LLC.

U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation is subject to the restrictions and license terms set forth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License.

Informatica and the Informatica logo are trademarks or registered trademarks of Informatica LLC in the United States and many jurisdictions throughout the world. A current list of Informatica trademarks is available on the web at https://www.informatica.com/trademarks.html. Other company and product names may be trade names or trademarks of their respective owners.

Portions of this software and/or documentation are subject to copyright held by third parties. Required third party notices are included with the product.

The information in this documentation is subject to change without notice. If you find any problems in this documentation, report them to us at [email protected].

Informatica products are warranted according to the terms and conditions of the agreements under which they are provided. INFORMATICA PROVIDES THE INFORMATION IN THIS DOCUMENT "AS IS" WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING WITHOUT ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY WARRANTY OR CONDITION OF NON-INFRINGEMENT.

Publication Date: 2018-08-07

Table of Contents

Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6Informatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Informatica Network. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Informatica Product Availability Matrixes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Informatica Velocity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Informatica Marketplace. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Chapter 1: Fast Clone Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8Product Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Fast Clone Usage Scenarios. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Fast Clone Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Fast Clone Components. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Fast Clone Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Fast Clone Executable. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Fast Clone Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

DataStreamer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

Data Unload Methods. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

Supported Datatypes for Oracle Sources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Fast Clone Deployment Topologies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

Running Fast Clone on the Oracle Database Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

Running Fast Clone on a Standalone Computer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

Running Multiple Fast Clone Instances with the Fast Clone Server. . . . . . . . . . . . . . . . . . . 16

Integration with Informatica Data Replication. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Chapter 2: Configuring and Using the Fast Clone Server. . . . . . . . . . . . . . . . . . . . . 19Fast Clone Server Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Fast Clone Server Configuration File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Installing and Uninstalling the Fast Clone Server on Windows. . . . . . . . . . . . . . . . . . . . . . . . . 22

Starting the Fast Clone Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

Stopping the Fast Clone Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

Chapter 3: Creating Cloning Configuration Files in the Fast Clone Console. . . 23Configuration File Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

Starting the Fast Clone Console. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Toolbar Buttons. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Task Flow: Creating a Configuration File in the Fast Clone Console. . . . . . . . . . . . . . . . . . . . . . 25

Defining a Source. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

Table of Contents 3

Configuring a Connection to an Oracle ASM Instance. . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

Selecting Source Tables and Materialized Views for Unload Processing. . . . . . . . . . . . . . . . . . . 30

Defining SQL Queries for Unload Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

Customizing Unload Processing for Table Partitions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

Customizing Unload Processing for Table Columns. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Excluding Table Columns from Unload Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Filtering Column Data with a WHERE Clause. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

Specifying a Custom Format for a Source Column. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

Changing the Order of Columns in Output Data Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Defining a Target. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

Defining Amazon Redshift Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Defining ASCII Flat Files as Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Defining Hadoop Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

Configuring Runtime Settings. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

Specifying Output File Format Settings for the Loader. . . . . . . . . . . . . . . . . . . . . . . . . . . 44

Configuring Parallel Processing for Direct Path Unload Jobs . . . . . . . . . . . . . . . . . . . . . . . 50

Specifying Fast Clone File Names and Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

Configuring Miscellaneous Conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

Tuning Conventional Path Unload Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

Configuring Fast Clone Server Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60

Configuring Integration with Informatica Data Replication . . . . . . . . . . . . . . . . . . . . . . . . 60

Configuring Load Settings for Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

Configuring Reporting and SNMP Traps. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

Saving a Cloning Configuration File Locally. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

Chapter 4: Unloading Data from the Source Database. . . . . . . . . . . . . . . . . . . . . . . . 72Unload Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

Character Set of the Unloaded Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

Unloading Data from Views. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

Unloading Data from TDE-Encrypted Columns and Tablespaces. . . . . . . . . . . . . . . . . . . . . . . . 74

Running Data Unload Jobs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

Unloading Data to Named Pipes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

Scenarios That Cause Cloning Processes to Remain Pending. . . . . . . . . . . . . . . . . . . . . . 76

Unloading Data to Named Pipes Created by Fast Clone. . . . . . . . . . . . . . . . . . . . . . . . . . 76

Unloading Data to Manually Created Named Pipes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

Unloading Data to Files of Fixed-Length Record Format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

Generating the Data Files in XML Format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

Chapter 5: Loading Data to a Target. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81Load Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

Configuration Considerations for Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

Generating Target Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86

Generating Target Tables Based on Source Table Schema. . . . . . . . . . . . . . . . . . . . . . . . 86

4 Table of Contents

Enabling the Extended Procedure for Generating Oracle Target Tables. . . . . . . . . . . . . . . . . 88

Validating the Target Schema. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

Disabling or Dropping Target Constraints. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

Transmitting the Output Files to the Target. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

Loading Data to the Target Database. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

Enabling or Creating Target Constraints. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

Chapter 6: Remote Configuration Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91Remote Configuration Management Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

Enabling Remote Configuration Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

Opening a Configuration File on a Remote System from the Fast Clone Console. . . . . . . . . . . . . 92

Saving a Configuration File to a Remote System. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

Chapter 7: Fast Clone Command Line Interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95Command Line Interface Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

Syntax. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

Command Line Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

Source Database Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

Source Tables, Columns, and SQL Queries. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

Target Database Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98

Runtime Command Line Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99

Command Line Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

Appendix A: Troubleshooting. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

Appendix B: Fast Clone Configuration File Parameters. . . . . . . . . . . . . . . . . . . . . . 116Configuration File Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

Source Database Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

Source Table Parameter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118

SQL Query Parameter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

Source Column Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

WHERE Condition Parameter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

Custom Column Format Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

Target Database Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123

Runtime Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

Character Replacement Parameter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147

Reporting Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148

Transparent Data Encryption Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

Appendix C: Glossary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

Table of Contents 5

PrefaceThis Informatica Fast Clone User Guide describes how to use the Informatica Fast Clone user interfaces to clone bulk data from an Oracle source to heterogeneous targets.

This guide is intended for users who need to configure, run, and monitor data cloning jobs, including DBAs, application developers, testing engineers, and system administrators.

This guide discusses how to perform the following tasks:

• Use the Fast Clone Console to configure, administer, monitor, and manage data unload jobs.

• Configure and use the Fast Clone Server in a distributed data-cloning topology.

• Run the Fast Clone executable from the Fast Clone Console and the command line.

• Load the unloaded data to the target database.

Informatica Resources

Informatica NetworkInformatica Network hosts Informatica Global Customer Support, the Informatica Knowledge Base, and other product resources. To access Informatica Network, visit https://network.informatica.com.

As a member, you can:

• Access all of your Informatica resources in one place.

• Search the Knowledge Base for product resources, including documentation, FAQs, and best practices.

• View product availability information.

• Review your support cases.

• Find your local Informatica User Group Network and collaborate with your peers.

Informatica Knowledge BaseUse the Informatica Knowledge Base to search Informatica Network for product resources such as documentation, how-to articles, best practices, and PAMs.

To access the Knowledge Base, visit https://kb.informatica.com. If you have questions, comments, or ideas about the Knowledge Base, contact the Informatica Knowledge Base team at [email protected].

6

HTTPS://NETWORK.INFORMATICA.COM/

http://kb.informatica.com

mailto:[email protected]

Informatica DocumentationTo get the latest documentation for your product, browse the Informatica Knowledge Base at https://kb.informatica.com/_layouts/ProductDocumentation/Page/ProductDocumentSearch.aspx.

If you have questions, comments, or ideas about this documentation, contact the Informatica Documentation team through email at [email protected].

Informatica Product Availability MatrixesProduct Availability Matrixes (PAMs) indicate the versions of operating systems, databases, and other types of data sources and targets that a product release supports. If you are an Informatica Network member, you can access PAMs at https://network.informatica.com/community/informatica-network/product-availability-matrices.

Informatica VelocityInformatica Velocity is a collection of tips and best practices developed by Informatica Professional Services. Developed from the real-world experience of hundreds of data management projects, Informatica Velocity represents the collective knowledge of our consultants who have worked with organizations from around the world to plan, develop, deploy, and maintain successful data management solutions.

If you are an Informatica Network member, you can access Informatica Velocity resources at http://velocity.informatica.com.

If you have questions, comments, or ideas about Informatica Velocity, contact Informatica Professional Services at [email protected].

Informatica MarketplaceThe Informatica Marketplace is a forum where you can find solutions that augment, extend, or enhance your Informatica implementations. By leveraging any of the hundreds of solutions from Informatica developers and partners, you can improve your productivity and speed up time to implementation on your projects. You can access Informatica Marketplace at https://marketplace.informatica.com.

Informatica Global Customer SupportYou can contact a Global Support Center by telephone or through Online Support on Informatica Network.

To find your local Informatica Global Customer Support telephone number, visit the Informatica website at the following link: http://www.informatica.com/us/services-and-training/support-services/global-support-centers.

If you are an Informatica Network member, you can use Online Support at http://network.informatica.com.

Preface 7

https://kb.informatica.com/_layouts/ProductDocumentation/Page/ProductDocumentSearch.aspx


https://network.informatica.com/community/informatica-network/product-availability-matrices

http://velocity.informatica.com


https://marketplace.informatica.com

http://www.informatica.com/us/services-and-training/support-services/global-support-centers/

http://network.informatica.com

C h a p t e r 1

Fast Clone OverviewThis chapter includes the following topics:

• Product Overview, 8

• Fast Clone Usage Scenarios, 9

• Fast Clone Targets, 9

• Fast Clone Components, 10

• Data Unload Methods, 11

• Supported Datatypes for Oracle Sources, 13

• Fast Clone Deployment Topologies, 15

• Integration with Informatica Data Replication, 18

Product OverviewFast Clone is a high-performance data cloning tool for unloading Oracle data and moving that data quickly and efficiently to other databases and warehouse appliances in a heterogeneous environment.

You can use Fast Clone in many scenarios. For example, use it to migrate data from test to production systems, clone data to another type of system or database, materialize targets for change data replication, or create a copy of your data for development or backup purposes.

Fast Clone runs on most Linux, UNIX, and Windows operating systems. It can move data from an Oracle source system to a broad range of target relational database systems and warehouse appliances on different types of Linux, UNIX, and Windows operating systems. You can also move data from an Oracle source system to a Hadoop Distributed File System (HDFS).

Fast Clone provides the following unload methods:

• Direct path unload. Reads data directly from Oracle data files, without using the Oracle Call Interface (OCI). This method is much faster than the conventional path unload method. However, you cannot use the direct path unload method to run unload jobs from a system that is remote from the Oracle source, perform SQL JOIN operations on two or more tables, or unload data from views or cluster tables.

• Conventional path unload. Uses the OCI to retrieve metadata and data from the Oracle source. This method is slower than the direct path unload method, but it does not have the limitations of the direct path unload method.

You can configure Fast Clone to unload data in ASCII, EBCDIC, COMP-3, or Teradata binary format. Fast Clone prepares the data for input to the native load utility of the target database.

8

Optionally, for Greenplum, Netezza, Teradata, and Vertica targets, Fast Clone provides the DataStreamer add-on component for even better performance. DataStreamer sends the unloaded data directly to Teradata Parallel Transporter (TPT), Greenplum Gpfdist, Netezza external tables, or Vertica targets. As a result, DataStreamer transfers data faster and reduces I/O and storage requirements.

Fast Clone includes the following additional features:

• Filters the Oracle tables and table partitions to be unloaded.

• Lets you define SQL queries, such as table joins for unloading data.

• Generates scripts for creating the target tables and for loading data.

• Provides a built-in compression feature that can quickly compress data during unload processing.

• Supports Oracle Automatic Storage Management (ASM), Oracle Cluster File Systems (OCFS), chained rows, fixed-length rows, and most datatypes including LOB datatypes.

• Supports different output formats including fixed length record format, EBCDIC, COMP-3, and Teradata binary format.

• Provides reporting and monitoring capabilities and can send alerts, such as SNMP traps.

• Provides integration with the Informatica Data Replication product to prepare targets for change data replication.

Fast Clone Usage ScenariosYou can use Fast Clone in many scenarios.

For example, use Fast Clone to perform the following tasks:

• Migrate data from an Oracle production system to another platform, without increasing overhead on the production system.

• Clone Oracle production data to create testing and development environments.

• Quickly create a backup of Oracle production data.

• Export data from Oracle tables in portable text format.

• Perform initial materialization of targets for Informatica Data Replication change data replication jobs.

• Stream data to Amazon Redshift, Greenplum, Netezza, Teradata, and Vertica data warehouse targets.

• Clone Oracle data to HDFS.

Fast Clone TargetsFast Clone unloads bulk data from Oracle sources. Fast Clone loads data to many types of relational databases and data warehouse appliances.

Fast Clone supports the following target types:

• Amazon Redshift

• Cloudera

• Flat files

Fast Clone Usage Scenarios 9

• Greenplum

• Hive

• Hortonworks

• Microsoft SQL Server

• Netezza

• Oracle

• Teradata

• Vertica

Fast Clone ComponentsInformatica Fast Clone is composed of multiple components, which can run as separate executables on different systems.

Fast Clone includes the following components:

• Fast Clone Console

• Fast Clone Server

• Fast Clone executable

• DataStreamer

Fast Clone ConsoleThe Fast Clone Console is the graphical user interface from which you configure and manage data-cloning jobs. From the Fast Clone Console, you can generate a cloning configuration file and run data-cloning jobs on a local or remote system. The cloning configuration file name is unload.ini by default. The Fast Clone Console runs on Linux, UNIX, and Windows. You can run it on the Oracle source system or on a standalone system. To start the Fast Clone Console, run gui.cmd on Windows or gui.sh on Linux or UNIX.

Fast Clone ExecutableThe Fast Clone executable, FastReader.exe on Windows or FastReader on Linux or UNIX, runs data-cloning jobs. You can start the Fast Clone executable from the command line or from the Fast Clone Console. If you run the Fast Clone executable from the command line, you can manually enter parameters to control unload processing.

For more information, see Chapter 7, “Fast Clone Command Line Interface” on page 95.

Fast Clone ServerThe Fast Clone Server is an optional add-on component that you can purchase to enable network communication across systems in a distributed Fast Clone topology. These systems can include the Fast Clone Console and Fast Clone instances on source and target database systems.

The Fast Clone Server runs as a Windows service or as a daemon on Linux or UNIX.

Use the Fast Clone Server to initiate unloads of Oracle data and metadata on a remote system or to transmit the output files to the target system. For example, use the Fast Clone Server to retrieve Oracle data that was

10 Chapter 1: Fast Clone Overview

unloaded by scheduled Fast Clone unload operations on separate Oracle systems and then make that data available to another Fast Clone instance for loading to the target database.

DataStreamerThe DataStreamer component is an add-on component that you can purchase for Amazon Redshift, Greenplum, Netezza, Teradata, and Vertica targets. For Greenplum, Netezza, and Teradata targets, DataStreamer is an optional component to load data faster. For Amazon Redshift targets, DataStreamer is a required component that is used transparently and always enabled.

With DataStreamer, you must use the direct path unload method. Depending on the target type, DataStreamer streams the unloaded Oracle data to the target in one of the following ways:

• For Amazon Redshift targets, DataStreamer sends the unloaded data to the Amazon Simple Storage Service (Amazon S3). After the source data is in Amazon S3 storage, Fast Clone issues a copy command that copies the data to the Amazon Redshift target tables.If you plan to run unload jobs on a Windows system, you must install the PostgreSQL ODBC driver on the system. If you plan to run unload jobs on a Linux and UNIX system, use the DataDirect ODBC driver for PostgreSQL that Fast Clone provides.

• For Greenplum targets, DataStreamer sends the unloaded data directly to the Greenplum parallel file distribution server (gpfdist) for loading to the target.

• For Netezza targets, DataStreamer writes the unloaded data to the named pipes that represent the Netezza external tables. The Netezza ODBC driver reads the data from these pipes and loads the data to the Netezza target tables.To use the Netezza DataStreamer, you must install the Netezza ODBC driver on the system where you plan to run unload jobs.

• For Teradata targets, you must install the Teradata Parallel Transporter (TPT) libraries on the system where you plan to run unload jobs. DataStreamer sends unloaded source data directly to a TPT operator (Update Operator, Load Operator, or Stream Operator) to load data to the target.

• For Vertica targets, DataStreamer uses the COPY command on the server side or the LCOPY command on the client side to send the unloaded data directly to Vertica targets.

Note: If you use DataStreamer to load Amazon Redshift, Greenplum, Netezza, Teradata, or Vertica targets, set the suppress_trailing_nullcols parameter value to false. You can set this parameter in the configuration file or in the Fast Clone Console on the Runtime Settings tab > Format Settings view. On the Format Settings view, clear the Suppress trailing null columns option. For more information, see “Runtime Parameters” on page 126.

Data Unload MethodsFast Clone supports conventional path and direct path unload methods.

The direct path unload method reads data directly from Oracle data files and requires the Oracle user to have special user permissions, such as SELECT FROM DICTIONARY. This method provides high-performance unloads of data without increasing overhead on the source system. With this method, you can use the DataStreamer component to additionally enhance performance. By default, Fast Clone does not lock the source tables to unload data. However, you can change this behavior to perform a consistent read of the source data.

Note: Fast Clone cannot read data from Oracle data files that are on a standalone system. Fast Clone must connect to a running Oracle instance to retrieve metadata for unload processing.

Data Unload Methods 11

The conventional path unload method uses the OCI to retrieve metadata and data from the Oracle source. This method usually provides an acceptable and efficient level of performance but is slower than the direct path unload method. With the conventional path unload method, you can run Fast Clone on a system that is remote from the source database server without any special permissions.

The following table describes the differences between the direct path unload and conventional path unload methods:

Feature Direct Path Unload Conventional Path Unload

Reads data directly from the Oracle source files

Yes No

Performs high performance unloads Yes. Provides very high-performance unloads, up to 20 times faster than the native database exports.

Yes. When Fast Clone unloads data locally, performance is good but CPU consumption can be twice that of the direct path unload method. When Fast Clone unloads data remotely, performance might be slower than that of unload on a local system.

Supports remote execution of Fast Clone

Yes, only if the Fast Clone Server is used.

Yes

Supports SQL queries that perform joins between tables

No Yes

Works with DataStreamer Yes No

Unloads data from Oracle Exadata Database Machine

No Yes. Fast Clone must use one thread per table to unload data from Oracle Exadata.

Unloads data from views and cluster tables

No Yes

Performs parallel processing of data unloads using multiple threads

Yes Yes. For partitioned tables, Fast Clone uses one thread per table partition or subpartition. For tables that are not partitioned, Fast Clone distributes work across threads based on the Oracle ROWID value, provided that unload processing is local.

Integrates with Informatica Data Replication

Yes Yes


Supported Datatypes for Oracle SourcesFast Clone supports most Oracle datatypes for bulk data movement.

The following table identifies the datatypes that Fast Clone supports and does not support for Oracle source databases:

Datatype Supported? Comments

ANY types No -

BINARY_DOUBLE Yes When unloading data, Fast Clone might express very large or small numbers that have many digits to the left or right of the decimal point in scientific notation with a maximum precision of eight digits. Loss of data accuracy might occur.

BINARY_FLOAT Yes When unloading data, Fast Clone might express very large or small numbers that have many digits to the left or right of the decimal point in scientific notation with a maximum precision of eight digits. Loss of data accuracy might occur.

BFILE No -

BLOB Yes For Oracle 11g and 12c sources, Fast Clone uses the direct path unload method by default to unload data from SecureFile BLOB columns. To use the conventional path unload method for SecureFile BLOBs, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

CHAR Yes -

CLOB Yes For Oracle 11g and 12c sources, Fast Clone uses the direct path unload method by default to unload data from SecureFile CLOB columns. To use the conventional path unload method for SecureFile CLOBs, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

DATE Yes -

FLOAT Yes -

Expression Filter Type No -

INTERVAL DAY TO SECOND No -

INTERVAL YEAR TO MONTH No -

LONG Yes -

LONG RAW Yes -

Media types No -

Supported Datatypes for Oracle Sources 13


MLSLABEL No -

NCHAR Yes -

NCLOB Yes For Oracle 11g and 12c sources, Fast Clone uses the direct path unload method by default to unload data from SecureFile NCLOB columns. To use the conventional path unload method for SecureFile NCLOBs, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

NUMBER Yes Maximum 128 characters

NVARCHAR2 Yes For Oracle 12c sources, Fast Clone uses the direct path unload method by default to unload data from extended NVARCHAR2 columns. To use the conventional path unload method for the extended NVARCHAR2 datatype, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

RAW Yes For Oracle 12c sources, Fast Clone uses the direct path unload method by default to unload data from extended RAW columns. To use the conventional path unload method for the extended RAW datatype, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

REF No -

ROWID Yes Supported only with the conventional path unload method

SDO_GEOMETRY Yes Supported only for Oracle targets

TIMESTAMP Yes -

TIMESTAMP WITH LOCAL TIMEZONE

Yes -

TIMESTAMP WITH TIMEZONE Yes -

URI types No -

UROWID Yes Supported only with the conventional path unload method

User-defined types No -

VARCHAR Yes -



VARCHAR2 Yes For Oracle 12c sources, Fast Clone uses the direct path unload method by default to unload data from extended VARCHAR2 columns. To use the conventional path unload method for the extended VARCHAR2 datatype, set the read_securefile_by_direct configuration file parameter to false or clear the Read SECUREFILE by direct option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console.

XML types Yes The Fast Clone Console displays these rows as CLOB.

Fast Clone Deployment TopologiesYou can use Fast Clone in different topologies to clone data. You can deploy Fast Clone components on a single system and run data unload jobs locally or on different systems in a distributed environment.

In a distributed deployment topology, run multiple Fast Clone Server instances to initiate unload requests on a remote system or to transmit the output files to the target system.

Review the sample network topologies to learn where to install Fast Clone components under alternative scenarios.

Running Fast Clone on the Oracle Database ServerTypically, you run Fast Clone on the system where the Oracle source database runs. In this topology, you can use the direct path unload method for faster bulk data movement.

The following image shows a typical topology in which Fast Clone runs on the source database server:

Fast Clone components, such as the Fast Clone Console and Fast Clone executable, run on the source database system. From the Fast Clone Console, you can create cloning configuration files and run data unload jobs. Fast Clone unloads data to output files on the source system.

After you copy the output files to the target system, you can run the load script on the target system to load data. Fast Clone uses the target database load utility to load data.

Alternatively, you can install the target database load utility on the source system and run the load script on the source system. If you load data from pipes, install the target load utility on the source system.

Fast Clone Deployment Topologies 15

Running Fast Clone on a Standalone ComputerIf you cannot install Fast Clone on the Oracle source system because of system or security restrictions, you can deploy Fast Clone on a standalone system. In this topology, Fast Clone uses the conventional path unload method to unload Oracle data from the remote source database.

The following image shows a deployment topology in which Fast Clone runs on a standalone system:

Fast Clone components, such as the Fast Clone Console and Fast Clone executable, run on the standalone system. From the Fast Clone Console, you can create cloning configuration files and run data unload jobs locally on the Fast Clone system. Fast Clone uses the Oracle Call Interface (OCI) to read the Oracle source data from the remote source system. Fast Clone unloads data to output files.

After you copy the output files to the target system, you can run the load script on the target system to load data. Fast Clone uses the target database load utility to load data.

Alternatively, you can install the target database load utility on the system where you run the Fast Clone executable and then run the load script on this system. If you load data from pipes, install the target load utility on the system where you run the Fast Clone executable.

Running Multiple Fast Clone Instances with the Fast Clone ServerYou can deploy multiple Fast Clone instances on different systems in a distributed environment. In this topology, you must use the Fast Clone Server to communicate with remote Fast Clone instances.

You can run the Fast Clone Server on the source database system, on the target database system, or on both the source and target systems.

When you run the Fast Clone Server on the source system, the Fast Clone Server accepts unload requests from the Fast Clone Console and runs data unload jobs. Run the Fast Clone Server on the source system if you installed the Fast Clone Console on a standalone system and want to run data unload jobs that use the direct path unload method from the Fast Clone Console.

When you run the Fast Clone Server on the target system, the Fast Clone Server accepts the output files from a remote Fast Clone instance. You do not need to copy the output files manually to the target. Run the Fast Clone Server on the target system if you run scheduled tasks for cloning data.


Running a Fast Clone Server on the Source System

The following image shows a deployment topology in which the Fast Clone Server runs on the source database system:

In this topology, the Fast Clone Server and Fast Clone executable run on the source system. The Fast Clone Console runs on a standalone system. From the Fast Clone Console, you can create cloning configuration files and initiate data unload jobs that run under the control of the Fast Clone Server. You can use either the direct path unload method or conventional path unload method. Fast Clone produces output files on the source system.

After you copy the output files to the target system, you can run the load script on the target system to load data. Fast Clone uses the target database load utility to load data. Alternatively, you can install the target database load utility on the source system and run the load script on the source system. Install the target load utility on the source system if you load data from pipes.

Running a Fast Clone Server on Both the Source and Target Systems

You can also run another Fast Clone Server instance on the target system to accept output files sent from the Fast Clone instance on the source system.

The following image shows a deployment topology that uses two Fast Clone Server instances for distributed data movement:

Fast Clone Deployment Topologies 17

In this topology, you do not have to manually copy the output files to the target.

Integration with Informatica Data ReplicationIf you have a license for Informatica Data Replication, you can use Fast Clone to perform an initial load of Data Replication targets.

Data Replication replicates transactional data between heterogeneous databases and platforms while maintaining transactional integrity and data consistency. Data Replication performs low-latency batched replication or continuous replication of change data and metadata.

Fast Clone can use a Data Replication configuration file to perform a high-speed initial load of Oracle data to the target systems. Fast Clone supports all of the targets that Informatica Data Replication supports.


C h a p t e r 2

Configuring and Using the Fast Clone Server

This chapter includes the following topics:

• Fast Clone Server Overview, 19

• Fast Clone Server Configuration File, 19

• Installing and Uninstalling the Fast Clone Server on Windows, 22

• Starting the Fast Clone Server, 22

• Stopping the Fast Clone Server, 22

Fast Clone Server OverviewThe Fast Clone Server enables network communication across systems in a distributed Fast Clone topology. These systems can include the Fast Clone Console and Fast Clone instances on source and target database systems. Use the Fast Clone Server to initiate unloads of Oracle data and metadata or to transmit the output files to the target database system.

The Fast Clone Server configuration file contains parameters that control how the Fast Clone Server runs. The Fast Clone Server reads this file when you start the server. If you edit parameters in this file, restart the Fast Clone Server for the changes to take effect.

When you start the Fast Clone Server from the command line, enter the SERVER_CONFIG parameter to specify the server configuration file.

Fast Clone Server Configuration FileFast Clone provides a predefined server configuration file, server.ini, in the top-level Fast Clone installation directory. You can edit this file to override the default Fast Clone Server settings.

The Fast Clone Server configuration file contains the following parameters to control how the Fast Clone Server runs.

19

Parameter Descriptions

allow_local_output_directories=list_of_directories

Identifies the directories on the Fast Clone Server system to which Fast Clone can write output files. You can include the question mark (?) and asterisk (*) wildcards to define a mask that matches multiple directories. When you define the cloning configuration file, unload.ini, ensure that the path and directory name that you specify in the Output directory field matches one of directories in this list.

Default value: Asterisk (*)

allow_remote_extraction_hosts=list_of_hosts

Identifies the hosts from which the Fast Clone Server can receive the cloning configuration files for remote unload processing. Enter the IP addresses or host names, separated by a semicolon. You can include the question mark (?) and asterisk (*) wildcards to define a mask for the hosts. Enter only the asterisk (*) wildcard to configure the Fast Clone Server to receive unload configuration files from any remote host.

For example:

allow_remote_extraction_hosts=84.109.19.100; 84.109.19.101; 84.109.19.102Default value: Asterisk (*)

allow_remote_extraction_requests={true|false}

Indicates whether the Fast Clone Server can receive unload requests from remote systems that have a Fast Clone instance. Valid values are:

• true. The Fast Clone Server can receive unload requests from a remote Fast Clone instance and run the unload jobs. For example, if you run the Fast Clone Console on a standalone system, the Fast Clone Server accepts unload requests and associated configuration files from the Fast Clone Console and runs the unload jobs.

• false. The Fast Clone Server does not accept unload requests from a remote Fast Clone instance. Enter this value for security purposes. This value ensures that the Fast Clone Server does not accept unload requests from a remote system that is not your data-cloning topology.

Default value: true

allow_recieve_remote_files={true|false}

Indicates whether the Fast Clone Server can receive output files from a remote Fast Clone system. Valid values are:

• true. The Fast Clone Server can receive output data files from another system for load processing. Use this option for a Fast Clone Server that you run on the target database system to receive output files from the source database system.

• false. If you run data load jobs on the system where Fast Clone produces the output files, or if you manually transmit the output files to the target system, enter this value for security purposes. This value ensures that the Fast Clone Server does not accept output files from a remote system that is not in your data-cloning topology.

You can specify the allow_recieve_remote_files_hosts parameter to identify the host systems from which the Fast Clone Server can receive output files.

Default value: false

allow_recieve_remote_files_hosts=list_of_hosts

Identifies the hosts from which the Fast Clone Server can receive output files. Enter the IP addresses or host names, separated by a semicolon. You can include the question mark (?) and asterisk (*) wildcards

20 Chapter 2: Configuring and Using the Fast Clone Server

to define a mask. Enter only the asterisk (*) wildcard to enable the Fast Clone Server to receive output files from any remote host.

For example:

allow_recieve_remote_files_hosts=84.109.19.100; 84.109.19.101; 84.109.19.102Default value: Asterisk (*)

bind_ip=list_of_IP_addresses

Specifies the IP addresses of the remote Fast Clone instances for which the Fast Clone Server listens for incoming requests. Enter 0.0.0.0 for all IPv4 addresses or [::] for all IPv6 addresses.

Default value: 0.0.0.0

listener_port=port_number

Specifies the port on which the Fast Clone Server listens for incoming requests. Valid values are from 1 through 65535.

Default value: 5521

log={true|false}

Indicates whether the Fast Clone Server writes informational and error messages to a log file. Options are:

• true. The Fast Clone Server creates a log file in the directory that you specify in the logdir parameter.

• false. The Fast Clone Server does not use a log file.


logdir=directory_path

If you set the log parameter to true, specifies the directory where the Fast Clone Server creates the message log file.

Default value:

• C:\\temp on Windows

• /tmp on Linux and UNIX

log_level=n

If you set the log parameter to true, specifies the verbose level for the Fast Clone Server logged output. Larger values provide more detailed output. Valid values are from 0 through 4.

Default value: 0

max_connections=n

Specifies the maximum number of Fast Clone client connections to the Fast Clone Server. Valid values are from 1 through 3000.

Default value: 150

Fast Clone Server Configuration File 21

Installing and Uninstalling the Fast Clone Server on Windows

On Windows, you must install the Fast Clone Server service before you can start it. Later, you can uninstall the Fast Clone Server service to upgrade Fast Clone or deploy the Fast Clone instance to another system.

• To install the Fast Clone Server service, run the install_server.cmd script.Alternatively, run the following commands from the command line to install the Fast Clone Server service:

cd Fast_Clone_installation_directoryFastReader.exe RUN_AS_SERVICE=true SERVER_CONFIG=server.ini -i

• To uninstall the Fast Clone Server service, stop the Fast Clone Server and then run the uninstall_server.cmd script.Alternatively, run the following commands from the command line to uninstall the Fast Clone Server service:

cd Fast_Clone_installation_directoryFastReader.exe RUN_AS_SERVICE=true SERVER_CONFIG=server.ini -r

Starting the Fast Clone ServerTypically, you run Fast Clone Server instances on the source system and on the target system. A Fast Clone Server runs as a service on Windows or as a daemon on Linux or UNIX.

To start the server, perform one of the following actions:

• On Windows, after you install the Fast Clone Server service, run the start_server.cmd script to start the Fast Clone Server service.Alternatively, run the following commands from the command line to start the Fast Clone Server service:

cd Fast_Clone_installation_directoryFastReader.exe RUN_AS_SERVICE=true SERVER_CONFIG=server.ini –s

• On Linux or UNIX, run the server.sh script to start the Fast Clone Server daemon.

Stopping the Fast Clone ServerYou might need to stop the Fast Clone Server service or daemon to upgrade Fast Clone or perform other administrative tasks.

• On Windows, run the stop_server.cmd script to stop the Fast Clone Server.Alternatively, run the following commands from the command line to stop the Fast Clone Server:

cd Fast_Clone_installation_directoryFastReader.exe RUN_AS_SERVICE=true SERVER_CONFIG=server.ini –k

After you stop the Fast Clone Server service, you can uninstall it, if necessary.

• On Linux or UNIX, run the following command from the command line to stop the Fast Clone Server:

kill -9 fast_clone_server_process_id

22 Chapter 2: Configuring and Using the Fast Clone Server

C h a p t e r 3

Creating Cloning Configuration Files in the Fast Clone Console


• Configuration File Overview, 23

• Starting the Fast Clone Console, 24

• Task Flow: Creating a Configuration File in the Fast Clone Console, 25

• Defining a Source, 26

• Selecting Source Tables and Materialized Views for Unload Processing, 30

• Defining SQL Queries for Unload Processing, 31

• Customizing Unload Processing for Table Partitions, 33

• Customizing Unload Processing for Table Columns, 34

• Defining a Target, 38

• Configuring Runtime Settings, 43

• Configuring Reporting and SNMP Traps, 69

• Saving a Cloning Configuration File Locally, 70

Configuration File OverviewIn the Fast Clone Console, you can create and manage cloning configuration files. Fast Clone uses the configuration files to unload source data, generate load scripts, enable integration with Informatica Data Replication, and configure reporting and SNMP traps for monitoring.

A configuration file is a text file. Informatica recommends that you use the Fast Clone Console to create and manage configuration files to reduce the risk of error. Alternatively, you can create and edit configuration files in a text editor.

By default, the Fast Clone Console saves a configuration file as unload.ini to the FastClone_installation directory in a local file system. You can optionally save the configuration file under another name.

In the Fast Clone Console, you can open a configuration file to edit unload settings or use it to run a data unload job. If you open the configuration file to run data unload job, the Fast Clone Console overrides the unload.ini file in the FastClone_installation directory if it already exists.

23

If you use the Fast Clone Server to run data unload jobs on a remote system, you can enable remote configuration management in the Fast Clone Console. You can then open and save the configuration files on the remote system from the Fast Clone Console using an FTP server.

A configuration file is not required for data cloning if you run Fast Clone from the command line. However, you must then manually specify all of the parameters at the command line for the Fast Clone executable.

Related Topics:• “Fast Clone Command Line Interface” on page 95

• “Fast Clone Configuration File Parameters” on page 116

Starting the Fast Clone ConsoleYou can run the Fast Clone Console on the source or target system or on a standalone system.

Before you start the Fast Clone Console the first time, verify that the following system requirements are met:

• The Java Runtime Environment (JRE) 1.8 is installed and the JRE_HOME environment variable points to the JRE base directory.

Note: For Hadoop and Hive Targets, the Fast Clone executable requires 64-bit JRE.

• The JRE_HOME directory is in the PATH environment variable.

• On Linux or UNIX, an X Window environment is configured.

For more information about prerequisites for the Fast Clone Console, see the Informatica Fast Clone Installation Guide.

To start the Fast Clone Console, run one of the following scripts that are located in the FastClone_installation directory:

• On Windows, run gui.cmd.

• On Linux or UNIX, run gui.sh.

Toolbar ButtonsInstead of choosing commands from the menu, you can click buttons on the toolbar to quickly initiate common actions.

The following buttons appear on the toolbar:

Button Description

Exits the Fast Clone Console.

Opens an existing configuration file.

Opens an existing configuration file from a remote system. This button appears only if remote configuration management is enabled.

Generates a configuration file based on your entries and saves it to a local file system, or saves an updated configuration file.

24 Chapter 3: Creating Cloning Configuration Files in the Fast Clone Console

Button Description

Transmits a configuration file to a remote system. This button appears only if remote configuration management is enabled.

Refreshes schemas for the source and target databases.

Starts a Fast Clone data unload job for the open configuration file.

Re-enables or creates target constraints that were previously disabled or dropped.

Disables or drops target constraints to avoid constraint violations during data loading.

Validates the target schema.

Generates the data files in XML format.

Generates SQL CREATE TABLE scripts to create target tables based on the source table schema.

Manages user-defined SQL queries for data unload jobs.

Task Flow: Creating a Configuration File in the Fast Clone Console

Perform the following tasks to create a configuration file for a data-cloning job in the Fast Clone Console:

1. Define the source database. See “Defining a Source” on page 26.

2. Select the source tables from which to unload data. See “Selecting Source Tables and Materialized Views for Unload Processing” on page 30.

3. Optional. Define SQL queries, such as table joins for unloading data. See “Defining SQL Queries for Unload Processing” on page 31.

4. Optional. If you unload data from partitioned tables, exclude selected partitions from the data unload job. See “Customizing Unload Processing for Table Partitions” on page 33.

5. Optional. Exclude selected columns from the data unload job. See “Customizing Unload Processing for Table Columns” on page 34.

6. Define the target database. See “Defining a Target” on page 38.

7. Configure runtime settings. See “Configuring Runtime Settings” on page 43.

8. Optional. Configure reporting and SNMP monitoring and traps. See “Configuring Reporting and SNMP Traps” on page 69.

9. Save the configuration file. See “Saving a Cloning Configuration File Locally” on page 70.

Task Flow: Creating a Configuration File in the Fast Clone Console 25

Defining a SourceFor Fast Clone to connect to the Oracle source database to unload data, you must enter the source database connection information. The Fast Clone Console saves this information to the configuration file. When you run an unload job, Fast Clone uses this information to generate a connection string for Oracle Net Services.

1. Click the Source DB tab > Source Database view.

The following image shows this view:

If you successfully connected to the Oracle source previously, Fast Clone displays the saved connection information.

2. Enter the connection information as needed.

The following table describes the connection fields:

Field Description

Hostname The host name or IP address of the system where the source database runs. The default value is the local host name.

Port The port number that Fast Clone uses to connect to the source database. The default port number is 1521.

Username A user name that has the privileges required to unload data from the source database. For more information about required user privileges, see Informatica Fast Clone Installation Guide.


Field Description

Password A valid password for the specified user.

Instance The Oracle instance name. If you leave this field blank, Fast Clone uses the ORACLE_SID or TWO_TASK environment variable to determine the Oracle instance name.

SSL Select this option to configure Fast Clone to use the TCP/IP protocol with the Secure Sockets Layer (SSL), also called the TCPS protocol, to connect to the Oracle source.Important: If you use TCPS connections between the Fast Clone Console and remote Oracle databases, you must install the 64-bit Oracle Instant Client on the Fast Clone Console system.

Remote Configuration Management

Select this option to enable remote configuration management. For more information, see Chapter 6, “Remote Configuration Management” on page 91.

RAC connect support

Select this option if you unload data from an Oracle database in a RAC environment. To connect to the Oracle RAC, the Fast Clone Console uses the instance name that is specified in the Instance field and the Fast Clone executable uses a service name that is specified in the Service field.

OS Authentication in Fast Clone

Select this option to configure Fast Clone to use operating system authentication to connect to the Oracle database.

Connect Direct in Informatica Fast Clone Console

Select this option to configure the Fast Clone executable to use the Bequeath protocol to connect to the Oracle source.

Use custom url Select this option to configure the Fast Clone executable and Fast Clone Console to use different connection settings. In this case, Fast Clone uses the following connection settings to connect to the source database:- The Fast Clone executable uses the connection information that you provide under

Connection Details to generate a connection string.- The Fast Clone Console uses the connection string that you enter in the JDBC custom

URL field.

JDBC custom URL If you select the Use custom url option, specifies a connection URL for the JDBC driver that the Fast Clone Console uses to connect to an Oracle source. This URL has the following format:

jdbc:oracle:thin@[host][:port]:SIDIf the Console uses an SSL connection to an Oracle source, the URL has the following format:

jdbc:oracle:oci@[host][:port]:SIDIf you enter an SSL connection URL, clear the SSL option.Important: JDBC custom connection URLs always use the values that are specified in the Username and Password fields. Ensure that any JDBC custom connection URL that you specify does not include a user name and a password.

Service For an Oracle database in a RAC environment, specifies the Oracle service name that the Fast Clone executable uses to connect to the source database.

Defining a Source 27

3. Click Connect to connect to the source.

If the connection is successful, the Source Tables tab opens. If the connection is not successful, the Fast Clone Console reports an error.

Tip: To edit connection information, click Disconnect, edit connection information, and then click Connect again.

Configuring a Connection to an Oracle ASM InstanceIf you use Oracle Automatic Storage Management (ASM) to manage storage for Oracle source data, you must enter connection settings for the ASM instance.

1. Click the Source DB tab > ASM Settings view.


2. Select the Define and use ASM instance option.

All of the fields on the ASM Settings view become available.

3. Enter connection information for the ASM instance.

Note: If your Oracle source is in a RAC with multiple ASM instances, enter connection information for any one of the running Oracle ASM instances.


The following table describes the fields on the ASM Settings view:

Field Description

ASM instance hostname

The host name or IP address of the system with the ASM instance. The default value is the local host name.

ASM instance port The port number that is used to connect to the ASM instance. The default value is 1521.

ASM sys username

The user name of the default Oracle ASM SYS user that was created at Oracle installation, or another user that you define. This user must have SYSDBA privileges.

ASM sys password

A password for the ASM SYS user.

ASM instance A service name for the ASM instance. The default value is +ASM.

SSL Select this option to configure Fast Clone to use the TCP/IP protocol with the Secure Sockets Layer (SSL), also called the TCPS protocol, to connect to the ASM instance.Important: If you use TCPS connections between the Fast Clone Console and remote Oracle databases, you must install the Oracle Instant Client on the Fast Clone Console system.

Note: If Fast Clone reports an ORA-12528 error when connecting to an ASM instance, add the ASM instance to the Oracle listener.ora configuration file and restart the Oracle listener process.

Defining a Source 29

Selecting Source Tables and Materialized Views for Unload Processing

On the Source Tables tab, select one or more source tables or materialized views from which to unload data and specify the unload method for these tables or materialized views.

1. Click the Source Tables tab.

The following image shows this tab:

2. Under Owner, select a schema owner name in one of the following ways:

• Select the schema owner from the list.

• Enter the first few letters of the schema owner name. If a matching owner name exists in the source database, the Fast Clone Console completes the name.

Note: The owner name is the same as the schema name.

The Tables box lists all of the tables and materialized views with the specified schema owner name. The Support Direct column indicates whether the table or materialized view supports the direct path unload method.

Tip: To view only the tables and materialized views that support the direct path unload method, click Show Direct Supported. To view all of the tables and materialized views in the schema again, click Show All.


3. To select a subset of source tables or materialized views for unload processing and set the unload method for each one, complete the following steps:

a. Select the Unload check box for each table or materialized view, or click Select All and then clear the Unload check box for any table or materialized view that you do not want to unload data from.

Note: The Select None button clears all selections to exclude all of the tables and materialized views from unload processing.

b. To use the direct path unload method for a selected table or materialized view, ensure that the Perform Direct Unload check box is selected for the table or materialized view.

By default, the Fast Clone Console selects the direct path unload method for all of the source tables and materialized views for which Fast Clone supports the direct path unload method.

c. To use the conventional path unload method for a selected table or materialized view, clear the Perform Direct Unload check box for the table or materialized view.

To clear the Perform Direct Unload check box for all of the tables and materialized views, click Make None Direct. If you configure Fast Clone to unload data from a remote source database server, click Make None Direct to use the conventional path method.

4. To select all of the tables and materialized views for the specified schema owner for unload processing, complete the following steps:

a. Select the Unload data from entire schema for selected owner option.

b. In the Unload type list, select an option for the unload method to use for the tables and materialized views in the schema. Options are:

• Let Informatica Fast Clone decide during unload. Fast Clone uses the direct path unload method for the tables and materialized views that support this method. Fast Clone uses the conventional path unload method for the tables and materialized views that do not support the direct path unload method.

• Perform direct unload for all tables. Fast Clone uses the direct path unload method for all of the tables and materialized views. Do not use this option if the source schema includes one or more tables or materialized views that do not support the direct path unload method.

• Perform conventional unload for all tables (none direct). Fast Clone uses the conventional path unload method for all of the tables and materialized views. Use this option if you need to unload data from a remote source database server.

Defining SQL Queries for Unload ProcessingYou can define SQL query statements that are processed when the data unload jobs run. For example, create a query that performs table joins or selects data from a view.

Fast Clone uses the conventional path unload method for unload jobs that use configuration files with user-defined SQL queries. During unload processing, Fast Clone generates an output file that contains the query

Defining SQL Queries for Unload Processing 31

results for each query that you define. Fast Clone uses the query name that you specify in the configuration file as the output file name.

1. Click Data > Add Conventional SQL on the menu bar, or click the Add conventional path SQL queries button on the toolbar.

The menu command and toolbar button are not available until you connect to the source database.

The Edit SQL Queries dialog box appears, as shown in the following image:

Note: This dialog box lists SQL queries that you previously defined. To edit or delete an existing SQL query, click Edit or Delete in the query row.

2. To add an SQL query, click Add.

The New Query dialog box appears, as shown in the following image:

3. In the Query Name field, enter a unique query name.

Fast Clone uses this value to name the query output file.

4. In the text box, type the query, or copy and paste it from an SQL editor.

5. Click Validate to validate the query.

Note: The Fast Clone Console does not execute the query when validating it.

6. If the validation is successful, click Save to save the query.

The query appears in the SQL query list.


Related Topics:• “Unloading Data from Views” on page 74

Customizing Unload Processing for Table PartitionsFor partitioned source tables, Fast Clone selects all partitions by default for unload processing. You can exclude certain partitions. Also, for the partitions that you select, you can write the unloaded data to separate output files.

1. Click the Source Table Partitions tab.


2. Under Table, select a partitioned source table for which to exclude some partitions from unload processing.

The table partitions appear in the Table Partitions box.

3. To exclude a table partition, clear the check box in the partition row.

Tip: If you want to exclude most table partitions, click Select None to clear all partitions and then select the few that you want to include. Click Select All to include all of the partitions again.

4. To write the source data that is unloaded from each selected table partition to a separate output data file, select the table and then select the Export table by partition option.

Note: Verify that the Export table by partition check box is selected for each partitioned table that you select.

Customizing Unload Processing for Table Partitions 33

Customizing Unload Processing for Table ColumnsOn the Source Columns tab, you can exclude source table columns from unload processing, define WHERE clauses for filtering data, customize date and timestamp formats, control numeric precision and padding, and change the order of columns in the output files.

Excluding Table Columns from Unload ProcessingBy default, Fast Clone unloads data from all of the columns in a selected source table. You can optionally exclude particular columns from unload processing, such as those with unsupported datatypes.

1. Click the Source Columns tab.


2. In the Table list, select a source table for which to exclude columns from unload processing.

All of the table columns appear in the Table Columns box.

3. To exclude a column, clear the Unload option in the column row.

Important: Ensure that you exclude all of the columns that have unsupported datatypes. For more information about supported Oracle datatypes, see “Supported Datatypes for Oracle Sources” on page 13.

Tip: If you want to exclude most of the columns, click Select None to clear all columns and then select the few columns that you want to include. Click Select All to include all of the columns again.


Filtering Column Data with a WHERE ClauseTo filter source data for an unload job, you can define a WHERE clause for one or more source columns. The Fast Clone Console provides a convenient tool for composing WHERE clauses.


2. In the Table list, select the table for which you want to define a WHERE clause for filtering column data.

All of the table columns appear in the Table Columns box.

3. Right-click a column row for which to add a condtition in a WHERE clause and click the button in the Column Condition column.

The Condition definition for table.'column_name' dialog box appears, as shown in the following image:

4. In the Operator list, select one of the following comparison operators: =, <>, <, >, >=, <=, IS NULL, IS NOT NULL, LIKE, or BETWEEN.

The operators that are available depend on the column datatype.

5. In the Value field, enter a value to use with the operator.

If the operator requires one value, leave the second field blank. For example, with the greater than (>) operator, you could enter a single numeric value of 1.

6. If you want to add another condition for the same column, select the OR check box. Then define another condition for the current column.

Note: If you clear the OR check box, the Fast Clone Console removes any conditions that follow the first one.

7. If you want to add a condition for another column, select one of the following options in the Operator for next column list to determine if the conditions for one or both columns must be true to filter data. Options are:

• OR. The condition for one or more columns in the WHERE clause must be true.

• AND. The conditions for all of columns in the WHERE clause must be true.

Then define the conditions for the next column.

8. Click OK.

9. On the Source Columns tab, verify the WHERE clause that you defined for the table columns by clicking Show Where.

The Fast Clone Console includes all of the conditions that you defined for the table columns to the WHERE clause.

Tip: To remove the WHERE clauses, click Reset Where.

Customizing Unload Processing for Table Columns 35

Specifying a Custom Format for a Source ColumnYou can customize the date and timestamp formats that Fast Clone uses to unload data from source DATE and TIMESTAMP columns. You can also change the precision for NUMBER columns, add padding, and replace null values with another value.

1. On the Source Columns tab, select a table for which to customize the column format.

All of the table columns appear in the Table Columns area.

2. Click a source column row for which to customize formatting, and then click the button in the Custom Formatting column.

The Set custom formatting for Column:'column_name' dialog box appears, as shown in the following image:

The fields that appear in this dialog box vary depending on the column datatype.

3. If the selected column is a DATE or TIMESTAMP column, optionally define a custom date or timestamp format that overrides the format that is specified on the Runtime Settings tab > Format Settings view. To define a custom date or timestamp format, complete the following steps:

a. Click the button next to the Date format or Timestamp format field.

Alternatively, on the menu bar, click Data > Manage Custom Formats and select Custom Date Formats or Custom Timestamp Formats.

The Edit Custom Formatting for Date or Edit Custom Formatting for Timestamp dialog box appears, as shown in the following image:

b. Click Add to add a custom date or timestamp format.

Tip: To edit a previously defined format, click Edit on the custom format row. To delete previously defined format, click Delete on the custom format row.


c. Enter a custom date or timestamp format and click Save.

Note: The length of a custom format must not exceed 99 characters.

The Set custom formatting for Column:'column_name' dialog box appears. To use the new custom format, select it in the Date format or Timestamp format field. These values override formats that you defined on the Runtime Settings tab > Format Settings view.

4. Enter other custom format settings for the selected column.

The following table describes the other fields that might be available depending on the column datatype:

Field Description

Padding The amount of padding to use for the column data.

Number precision The precision to use for numeric data. This field is available only for columns with the NUMBER datatype.This precision overrides the precision that you defined in the Truncate numeric precision integer field on the Runtime Settings tab > Format Settings view.

Use text when NULL A value that replaces null values in the column.Note: If you specify this value, to generate valid output files, you must clear the Suppress trailing null columns option on the Runtime Settings tab > Format Settings view.

Tip: To remove all custom format settings, click Clear.

5. Click OK.

Changing the Order of Columns in Output Data FilesBy default, Fast Clone unloads columns in the order that they were created in the source tables. Optionally, you can change the order in which Fast Clone writes column values to output data files.


2. In the Table list, select the table for which you want to change the order of columns.

3. In the Table Columns list, select a column that you want to reposition in the output data file.

4. Move the column up or down by clicking the Up column or Down column button.

Customizing Unload Processing for Table Columns 37

Defining a TargetTo define a target, enter connection information and set a few additional options. The Fast Clone Console saves this information to the configuration file and uses it to generate load scripts. When you run the Fast Clone executable, it uses the connection information to generate a connection string for the target.

After you enter the target connection information, you can connect to the target database to generate or validate the target schema and to manage target constraints that are used for loading data.

1. Click the Target DB tab.

The following image shows the Target DB tab for an Oracle target:

2. Select the Database option to make the connection fields available.

This option must be selected for Fast Clone to unload data into output files and generate the load scripts for the target database.

3. Enter connection information for the target in one of the following ways:

• Manually enter connection information for the target on the Target DB tab.

• Click From SourceDB to use the source connection information for the target.


The following table describes the connection fields on the Target DB tab:

Field Description Target Types

Database Type Select one of the following target types:- AMAZON REDSHIFT- CLOUDERA- GREENPLUM- HIVE- HORTONWORKS- MS SQL SERVER- NETEZZA- ORACLE- TERADATA- VERTICAThe default value is ORACLE.

All

Version The version of the selected target type. - Cloudera- Hortonworks- Hive

Hadoop version For Hive targets, the version of Hadoop for the Hive database.

Hive

Hostname The host name or IP address of the system where the target instance runs. The default value is the local host name.

All

Port The port number that Fast Clone uses to connect to the target. The default value depends on the target type.

All

Username A user ID that has the privileges that Fast Clone requires to write data to the target.Note: For Hive targets, this field name is Hadoop username.

All

Password A valid password for the specified user. - Greenplum- Microsoft SQL

Server- Netezza- Oracle- Teradata- Vertica

Instance An Amazon Redshift, Oracle or Teradata instance name.

- Amazon Redshift

- Oracle- Teradata

Defining a Target 39


Database The target name.To select the target name from the Database list, you must first connect to the target.To manually enter the database name without connecting to the target, select the Allow entering owner as free text instead of retrieving list from database option.Note: Hive, Netezza, and Vertica targets do not have the option to manually enter a database name.

- Greenplum- Hive- Microsoft SQL

Server- Netezza- Teradata- Vertica

Location For Hive targets, the location of flat files that store table data on the Hadoop Distributed File System (HDFS) for a selected Hive database. The Fast Clone Console gets this location from the Hive target.

Hive

Owner The target owner name.To select the owner name from the Owner list, you must first connect to the target.To manually enter the owner name without connecting to the target, select the Allow entering owner as free text instead of retrieving list from database option.

Oracle

Path The directory in a distributed file system for the output files. Click Select to browse to the directory, or enter the path to the directory.Important: The user name that you specify for connecting to the target must have write permissions on this directory.

- Cloudera- Hortonworks

SSL Select this option to configure Fast Clone to use the TCP/IP protocol with the Secure Sockets Layer (SSL), also called the TCPS protocol, to connect to a target.Important: If you use TCPS connections between the Fast Clone Console and remote Oracle databases, you must install the Oracle Instant Client on the Fast Clone Console system.

- Amazon Redshift

- Greenplum- Oracle

Schema A Greenplum schema name.To select the owner name from the Schema list, you must first connect to the target.To manually enter the schema name without connecting to the target, select the Allow entering owner as free text instead of retrieving list from database option.

Greenplum

OS Authentication in Informatica Fast Clone Console

Select this option to configure Fast Clone to connect to a target by using the operating system user.

- Microsoft SQL Server

- Oracle

Connect Direct in Informatica Fast Clone Console

Select this option to connect to the target directly without using a database instance.

- Microsoft SQL Server

- Oracle



Allow entering owner as free text instead of retrieving list from database

If you do not connect to the target to enter a database, owner, or schema name from a list, select this option to manually enter this information.

- Cloudera- Greenplum- Hortonworks- Microsoft SQL

Server- Oracle- Teradata

RAC connect support Select this option if you load data to an Oracle database in a RAC environment. To connect to the Oracle RAC, the Fast Clone Console uses an instance name that is specified in the Instance field and the Fast Clone executable uses a service name that is specified in the Service field.

Oracle in a RAC environment

Use custom url Select this option to configure the Fast Clone executable and Fast Clone Console to use different connection settings. In this case, Fast Clone uses the following connection settings to connect to the target:- The Fast Clone executable uses the connection

information that you provide under Connection Details to generate a connection string.

- The Fast Clone Console uses the connection string that you enter in the JDBC custom URL field.

All

JDBC custom URL If you selected the Use custom url option, specifies a connection URL for the JDBC driver that the Fast Clone Console uses to connect to an Oracle source. This URL has the following format:

jdbc:oracle:thin@[host][:port]:SIDFor the SSL connection to an Oracle target, use the following format:

jdbc:oracle:oci@[host][:port]:SIDIf you enter an SSL connection URL, clear the SSL option.Important: JDBC custom connection URLs always use the values that are specified in the Username and Password fields. Ensure that any JDBC custom connection URL that you specify does not include a user name and a password.

All

Service An Oracle service name that is defined in a SERVICE_NAME environment variable.

Oracle in a RAC environment

Note: If you do not enter the target connection information, the Fast Clone Console tries to use values that you entered for the source database on the Source DB tab.

4. Click Connect.

Verify that the target connection is successful.

If you did not manually enter the database, owner, and schema, which are required for some target types, you must enter this information at this point.

Note: Fast Clone does not require you to connect to the target to save the target connection information to the configuration file.

Defining a Target 41

Defining Amazon Redshift TargetsYou can configure Fast Clone to unload source data to Amazon Redshift targets. Fast Clone does not generate a load script for the target load utility. A load script is not needed because Fast Clone connects to and writes source data to the Amazon Simple Storage Service (Amazon S3).

1. On the Target DB tab, select AMAZON REDSHIFT in the Database Type list.

2. Enter the host, port number, database, user ID, and password to use for connecting to the Amazon Redshift database.

3. On the Runtime Settings tab > Amazon S3 Load Settings view, enter Amazon S3 information in the following fields:

• S3 file size (KB)

• S3 bucket name

• S3 key ID

• S3 path

• S3 secret key

Note: You must complete all of these fields.

Related Topics:• “Amazon S3 Load Settings” on page 62

Defining ASCII Flat Files as TargetsYou can configure Fast Clone to unload data into ASCII flat files. In this case, Fast Clone does not generate a load script for the target database load utility.

1. Click Target DB tab.

2. Select the No Target, only FLAT FILE or PIPE as output option.

When you run data unload jobs, Fast Clone unloads data into ASCII flat files in the output directory.

Defining Hadoop TargetsYou can configure Fast Clone to unload source data to output files on a Cloudera or Hortonworks distribution of Hadoop or to a Hive data warehouse target. With these Hadoop target types, Fast Clone does not generate a load script for a target load utility.

1. On the Target DB tab, select CLOUDERA, HIVE, or HORTONWORKS in the Database Type list.

2. Enter the version, host name, port number, and user name for connecting to the target.

3. To specify the directory in the distributed file system for the Cloudera or Hortonworks output files, click Select and then either browse to the directory or enter the path to the directory in the Path field.

The user name that you specify for connecting to the Hadoop target must have write permissions on this directory.

4. Click Connect.

The Fast Clone Console connects to the target.

Important: The Fast Clone Console requires the 64-bit Java Runtime Environment (JRE) 1.8 to connect to these Hadoop targets.


5. For Hive targets, select a database in the Database list.

The Fast Clone Console uses the definition of the selected database to determine the location of flat files that store table data on the HDFS.

Before you unload source data to a Hive target, enter the field delimiter that the Hive target uses in data files in the Column separator field on the Runtime Settings tab > Format Settings view. By default, Hive data warehouses use the ASCII start of heading (SOH) character as the field delimiter.

Configuring Runtime SettingsOn the Runtime Settings tab, you can configure runtime settings for data-cloning jobs.

Fast Clone provides the following types of runtime settings:

• Formatting options for output files

• Rules for replacing character sequences in the source data that the target load utility does not support with other character sequences

• Separator and delimiter characters in the data to be loaded

• Use of parallel processing for direct path unload and conventional path unload processing

• Names or naming criteria for the configuration file, data files, unload scripts, and load scripts

• Path to the output directory

• Criteria for splitting large output files into chunks

• Options for generating loader input only or for unloading data without generating loader input

• Miscellaneous options, such as those for performing dirty reads, writing data to pipes, encrypting passwords in the configuration file, and compressing data during unload processing

• Remote Fast Clone Server options

• Load settings for the target load utility

• Settings for integration with Informatica Data Replication

The Fast Clone Console saves these settings to the configuration file, except for the configuration file name and the path to the unload script.

Configuring Runtime Settings 43

Specifying Output File Format Settings for the LoaderYou can configure format settings for the output files to conform with the requirements of your target load utility. The load utility can then accurately load data to the target database.

1. Click the Runtime Settings tab > Format Settings view.


2. Enter information in any of the fields, or accept the default values.


The following table describes the fields in the Format Settings view:

Field Description Default

Column separator A character or character sequence that Fast Clone uses to separate columns in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.Fast Clone allows you to enter column separators that use the \xhh escape sequence in hexadecimal notation.Also, you can use the following non-printable ASCII characters:- \a (alert)- \b (backspace)- \n (new line)- \r (carriage return)- \t (horizontal tab)- \v (vertical tab)If you use non-printable characters as column separators, ensure that the target load utility supports these separators. For more information, see the target database documentation.

- ASCII start of heading (SOH) for Hive targets

- Semicolon (;) for other targets

Record separator A character or character sequence that Fast Clone uses to separate table rows in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.Fast Clone allows you to enter row separators that use the \xhh escape sequence in hexadecimal notation.Also, you can use the following non-printable ASCII characters:- \a (alert)- \b (backspace)- \n (new line)- \r (carriage return)- \t (horizontal tab)- \v (vertical tab)If you use non-printable characters in row separators, ensure that the target load utility supports these separators. For more information, see target database documentation.

\n

Enclosed by A character or character sequence that Fast Clone uses to enclose character data in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files.

Double-quotation mark (“)

Enclose numbers Indicates whether to enclose numeric data with the delimiter value that you specified in the Enclosed by field.

Not selected



Enclose dates Indicates whether to enclose DATE and TIMESTAMP data with the delimiter value that you specified in the Enclosed by field.

Not selected

Escape enclosure An escape character that Fast Clone inserts before the value that is specified in the Enclosed by field whenever that value appears in data in text columns. If you enter an escape character, Fast Clone truncates the Enclosed by value to a single character.

Backslash (\)

Replace new line with space Indicates whether to replace the line break character (\n) with the space character in unloaded data. Select this option if you use the line break character as the Record Separator value.

Not selected

Replace any char with Click this button to define rules for replacing particular character sequences in the source data with another character sequence. For more information, see “Defining Rules for Replacing Characters in Data” on page 48.

-

Pad with blanks (fixed length)

Select this option to unload data to output data files that use fixed-length record format. If the number of bytes of field data is less than the field length, Fast Clone pads the field on the right with the space characters.

Not selected

Convert CLOBs from UTF16 into UTF8

Select this option to convert character data in source CLOB columns from UTF-16 encoding to UTF-8 encoding when loading the data to output data files. If you do not use UTF-16 encoding for source character data, clear this option to avoid the overhead of character set conversion.

Not selected

Convert numbers into packed decimal COMP3

Indicates whether to write numeric data to the output files as COMP-3 values. Select this option if you need to read the output file with COBOL or another language that has built-in COMP-3 support.

Not selected

COMP3 implied decimal The COMP-3 implied decimal value. 0

Convert output ASCII into EBCDIC

Select this option if you need the output to be in EBCDIC format.

Not selected

Truncate numeric precision integer

The number of significant digits that Fast Clone uses to round down the fractional part of numeric values in the output. Valid values are from 1 to 36.You can override this value for individual source numeric columns. For more information, see “Specifying a Custom Format for a Source Column” on page 36.

No default



Unload binary into separate files

Indicates whether to write data from BLOB, CLOB, NCLOB, LONG, and LONG RAW source columns to separate files. In the data files, Fast Clone includes the file name instead of the source value.

- Not selected for Netezza targets that use DataStreamer and Hive targets

- Selected for other targets

Trailing column separator Indicates whether to append a column separator after the last column in each table row in the output data files.Note: For Oracle targets, if you unload data from binary columns to separate files, Fast Clone does not append the trailing column separator after the last column if the datatype of that column is BLOB, CLOB, or XMLTYPE.

Selected

Suppress trailing null columns

Indicates whether to omit the last column value in the output data file if the column contains a null value.Note: If you use DataStreamer to load Amazon Redshift, Greenplum, Netezza, Teradata, or Vertica targets, clear this option.

Selected

Timestamp format The format to use for timestamp values from TIMESTAMP source columns. For more information about valid TIMESTAMP formats, see the Oracle documentation.You can override this format for individual TIMESTAMP columns. For more information, see “Specifying a Custom Format for a Source Column” on page 36.

YYYY-MM-DD HH24:MI:SS.FF6

Date format The format to use for date values from DATE source columns. For more information about valid DATE formats, see the Oracle documentation.You can override this format for individual DATE columns. For more information, see “Specifying a Custom Format for a Source Column” on page 36.

YYYY-MM-DD HH24:MI:SS

Set local time zone Select this option to unload values from columns that have the TIMESTAMP WITH LOCAL TIMEZONE datatype. Then enter your local time zone offset.

Not selected


Defining Rules for Replacing Characters in DataYou can define a rule to replace a character in the source data with another character. If the source data includes characters that the target load utility does not support, define a replacement rule for each unsupported character.

1. On the Runtime Settings tab > Format Settings view, click Replace any char with.

Alternatively, click Data > Manage replacements on fly on the menu bar.

The Replace characters dialog box appears:

2. Click Add to add a replacement rule.

The New Character Replacement dialog box appears:

3. In the Replace from field, enter the character that you want to replace. In the Replace with field, enter the new character. Then click Save.

Tip: To remove all occurrences of the Replace from character from the incoming source data, leave the Replace with field empty.

In character replacement rules, you can use hexadecimal notation. You must use hexadecimal notation to replace a backslash (\) character. Also, you can use the following non-printable ASCII characters:

• \a (alert)

• \b (backspace)

• \n (new line)

• \r (carriage return)

• \t (horizontal tab)

• \v (vertical tab)

Restriction: You cannot use multibyte characters in character replacement rules.

4. In the Replace characters dialog box, click Ok to save all of the replacement rules.

Tip: To edit a replacement rule, click Edit next to the rule. To delete a replacement rule, click Delete next to the rule.


Selecting Column and Row Separator and Enclosure Character to Use for Data LoadingTo perform accurate data load to a target database, select a valid column and row separator and enclosure character that does not appear in the source data. The Fast Clone Console provides a convenient validation tool that you can use to select valid values for column and row separator and enclosure character using sampling.

The validation tool checks table data in the sample row set and reports an error if a control character sequence appears in the source data.

Validation of the source data is a time-consuming operation that is comparable to the data unload operation. Use the validation tool for small source tables. For large tables, select a row and column separator and enclosure character sequence that includes multiple characters and is unlikely to appear in the source data.

1. Click the Runtime Settings tab > Format Settings view.

2. Start the validation tool.

• For the column separator, click the button to the right of the Column separator field.

• For the row separator, click the button to the right of the Record separator field.

• For the enclosure character, click the button to the right of the Enclosed by field.

The dialog box of the validation tool appears, as shown in the following image:

3. In the list, select a source table that you want to validate.

4. In the Select Rows Count field, enter number of rows in the source table to validate.

The default value is 100 rows.

5. Click Validate and review the validation results.

If the Fast Clone Console reports a validation error, either select another value for the row or column separator or enclosure character, or define a replacement rule for the specified character value.


Configuring Parallel Processing for Direct Path Unload JobsWhen cloning large amounts of data with the direct path unload method, you can improve performance by configuring parallel processing on multiple threads.

With the direct path unload method, Fast Clone uses two threads per core or CPU to unload data from an Oracle source. If the source system has multiple CPUs, Fast Clone can use parallel processing to unload data from source tables.

1. Click the Runtime Settings tab > Parallel Execution view.


2. Enter information in the following fields:

Threads number

The maximum number of tables or partitions from which data can be unloaded in parallel. The default value is 1.

Threads per segment

The maximum number of threads that can be used to unload data from a single table or partition. If you use multiple threads for a single table, CPU usage might increase. Usually, multiple threads are used to unload large tables. The default value is 1.

To get the total number of unload threads, multiply these field values. For the direct path unload method, calculate the appropriate number of concurrent threads as follows:

number of CPUs or cores x 2If I/O bandwidth or the number of allocated CPUs is restricted, enter lower field values that are compatible with the appropriate concurrency for the system.

If you need to unload data from many tables of similar size, set Threads number to the appropriate concurrency and set Threads per segment to 1.


If you need to unload data from a few large tables, increase the Threads per segment value and set Threads number to the number that you calculate as follows:

(number of CPUs or cores x 2)÷ number of threads per table

Specifying Fast Clone File Names and LocationsOptionally, you can override the default configuration file name, output directory, load script name, unload script location, output file naming pattern and extension, and control file naming pattern. You can also set options that control how the output files are generated and whether Fast Clone generates the load script.

1. Click the Runtime Settings tab > File Locations view.


2. Enter information in any of the fields.


The following table describes the fields in the File Locations view:

Field Description

Config file name The path and file name for the cloning configuration file. The path can be the full path or a path that is relative to the FastClone_installation directory. The Fast Clone Console generates this file when you save the configuration settings. When you run data unload jobs, the Fast Clone Console uses the specified configuration file. The default name is FastClone_installation\unload.ini.

Output directory The full path to the directory to which Fast Clone writes the generated output files. The output directory can be on a local file system or on a network mapped share, such as NFS or Samba. If you do not specify an output directory, Fast Clone writes the output files to the target_type_output subdirectory in the current working directory.Tip: Specify a different directory for each type of target database.

Output data files extension

A file name extension for the output data files. For each unloaded source table, Fast Clone generates a data file that has a file name in the format table_name.extension. The default extension is dat.

Load script base name

The load script name.By default, Fast Clone uses the source schema name as the load script name. To use another name, clear the set default option and enter a script name.On Windows, Fast Clone generates load script files with the .cmd extension. On Linux and UNIX, Fast Clone generates load script files with the .sh extension.

Naming pattern for data files

Specifies the naming pattern that Fast Clone uses to create the base name of the output data files.In the naming pattern, you can include the following variables:- %&schema&% - Represents a default schema name for the user that Fast Clone uses to

connect to the source database.- %&source_schema&% - Represents a source schema name.- %&target_schema&% - Represents a target schema name.- %&table&% - Represents a table name.- %&table_counter&% - Represents a sequence number of a table.- %&partition&% - Represents a table partition name. This partition variable is required if

you specify partitions in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] sections of the configuration file.

- %&counter&% - Represents a sequence number of a data file for a table. This counter variable is required if Fast Clone splits the output data files based on the mb_per_file and split_files_by_row_count parameters.

- %&yyyy&% - Represents a year.- %&mm&% - Represents a month.- %&dd&% - Represents a day.If you do not specify the naming pattern for output data files, Fast Clone uses the table name as the name for output data files.


Field Description

Naming pattern for control files

The naming pattern that Fast Clone uses to generate names for control files, log files, and SQL scripts.In the naming pattern, you can include the following variables:- %&schema&% - Represents a default schema name for the user that Fast Clone uses to

connect to the source database.- %&source_schema&% - Represents a source schema name.- %&target_schema&% - Represents a target schema name.- %&table&% - Represents a table name.- %&table_counter&% - Represents a sequence number of a table.- %&partition&% - Represents a partition name.

The partition variable is required if you specify partitions in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] sections of the configuration file.

- %&counter&% - Represents the sequence number of a control file for a table. This variable value is always 1.

- %&yyyy&% - Represents a year.- %&mm&% - Represents a month.- %&dd&% - Represents a day.If you do not specify the naming pattern for control files, Fast Clone uses the table name as the name for control files, log files, and SQL scripts.

Informatica Fast Clone Console start script

The path and file name for the unload script that runs data unload jobs. The path can be the full path or a path that is relative to the FastClone_installation directory. By default, Fast Clone uses the FastClone_installation\unload.cmd script on Windows and the FastClone_installation\unload.sh script on Linux and UNIX.

Generate loader input only, do not unload data

Select this option to generate the load script but not unload data from the source. Use this option to prepare load scripts when unloading data to pipes.By default, this option is not selected.Note: Fast Clone does not generate load scripts and control files for flat file, Hadoop, and Hive targets.

Unload data only, do not generate loader input

After you generate the load script, select this option to unload data to pipes without regenerating the load script.By default, this option is not selected.

Split output files into chunks, By file size, in megabytes

Select this option to split output into multiple data files based on output chunk size. To specify the maximum amount of output in each file, in megabytes, either select a value from the list or type an appropriate value. If you select the default value of NONE, Fast Clone unloads data to a single data file for each table.

Split output files into chunks, By number of rows

If you use the direct path unload method, select this option to split output into multiple data files based on a fixed number of rows that each file can contain. Then enter the fixed number of rows.Note: For parallel unload processing, splitting output into multiple data files based on a number of rows is slower than splitting output based on a chunk size. This performance degradation occurs because of locks between the parallel threads.


Configuring Miscellaneous ConditionsOptionally, you can set miscellaneous runtime parameters for data cloning. These parameters provide some important capabilities, such as writing output to pipes, encrypting passwords, compressing data, and configuring advanced settings for the Fast Clone executable.

1. Click the Runtime Settings tab > Miscellaneous Conditions view.


2. Enter information in any of the fields or accept the default values.


The following table describes the fields in the Miscellaneous Conditions view:


Dirty read If you use the direct path unload method, clear this option to lock the source tables for unload processing. When the tables are locked, Fast Clone can perform a consistent read.By default, this option is selected and Fast Clone does not lock the source tables during unload processing. Fast Clone bypasses this Oracle consistency mechanism. However, Fast Clone preserves block-level consistency because block I/O operations are atomic.

Selected

Consistent lock on table set If you use the direct path unload method and cleared the Dirty read option, select this option to lock all of the selected source tables in a table set before unloading the source data and to unlock these tables after unloading data from all of them. By default, this option is not selected and Fast Clone holds a lock on each source table only while unloading data from the table.

Not selected

Export by partitions when all selected

If you unload data from partitioned source tables, indicates whether to write the data from each table partition to a separate output data file.On the Source Table Partitions tab, you can override this setting for individual source tables.Note: To generate an output data file for each table partition, select this option. Then, on the Source Table Partitions tab, select each partitioned source table and verify that the Export table by partition check box is selected.

Not selected

Create output as pipes instead files

Select this option to write output to a pipe instead of to a file system. Use this parameter for integration with other systems or scripts. Fast Clone creates a pipe file for each table prior to unloading table data.Note: If you created a pipe file manually before unloading data, clear this option to have Fast Clone use the existing pipe file.

Not selected

Use SID instead of service name

Select this option to use the Oracle SID instead of the SERVICE_NAME when you cannot connect to the source instance by using the instance name.

Not selected

Skip unreadable blocks If you use the direct path unload method, select this option to skip blocks that Fast Clone cannot read from Oracle data files and continue unload processing.By default, this option is not selected. When Fast Clone encounters a block it cannot read, it ends with an error and stops unload processing.

Not selected

Print column names If you unload data to an ASCII flat file, select this option to have Fast Clone write column names in the first row of the output files.By default, this option is not selected and Fast Clone does not write column names in the first row.

Not selected



Encrypt database passwords Indicates whether to encrypt database passwords in configuration files. Fast Clone applies the XOR operator to a password value by using the encryption key. Fast Clone then encrypts the resulting value by using the Base64 encryption schema.By default, this option is selected and database passwords are encrypted. Clear this option if you do not want to use encrypted passwords.

Selected

Extract long raw as HEX For Oracle targets, select this option to unload BLOB and LONG RAW source data in hexadecimal format. For other target types, clear this option to avoid generating incorrect control files.

Not selected

Direct select from sys Indicates whether to use SYS-owned dictionary tables, such as sys.obj$ and sys.tab$, to get Oracle extent boundaries. By default, Fast Clone uses SYS-owned dictionary tables to get extent boundaries. If you clear this option, Fast Clone uses the dba_extents and dba_segments views to get extent boundaries.Note: For Oracle sources that use locally managed tablespaces, selecting data from the dba_extents view is slow. If the dictionary is partially analyzed and the optimizer mode is not RULE, selecting data from the dba_extent view might be very slow. Informatica recommends that you grant SELECT ANY DICTIONARY permissions to the user and use the default value for this option.

Selected

Read SECUREFILE by direct For Oracle 11g and 12c sources, indicates whether to use the direct path unload method to unload data from SecureFile LOB columns. For Oracle 12c sources, indicates whether to use the direct path unload method to unload data from extended NVARCHAR2, RAW, and VARCHAR2 columns. Clear this option to use the conventional path unload method to unload data from SecureFile LOB and extended datatype columns.

Selected

Character Set If you unload data to a Teradata target by using the direct path unload method, indicates whether to convert the source character data to UTF-8. The default value is NONE, which causes Fast Clone to preserve the source database encoding.

NONE

Read OCFS Direct For Oracle RAC sources on an Oracle Cluster File System (OCFS) on Linux or UNIX, indicates whether Fast Clone uses the direct I/O mode to read Oracle data files. The direct I/O mode minimizes the effects of the cache on reading data from Oracle data files. Select this option if the Oracle data files are located on an OCFS.

Not selected



Write OCFS Direct For Oracle RAC sources on an OCFS on Linux or UNIX, indicates whether Fast Clone uses the direct I/O mode to write the output files. The direct I/O mode minimizes the effects of the cache on writing the output files. Select this option if you configured Fast Clone to generate the output files on an OCFS.

Not selected

Compress data at unload time Indicates whether to compress the output data files during unload processing to reduce the file size.

Not selected

Data compression If you selected Compress data at unload time, select the compression method that Fast Clone uses to compress data. Options are:- GZIP. Provides a better compression ratio but is slower.- QZIP. Provides a lower compression ratio but is faster.

GZIP

GZIP compression level If you selected GZIP as the data compression method, enter the compression level. Valid values are 1 to 9. Higher values provide faster compression with a lower compression ratio. Lower values provide slower compression with a higher compression ratio.

1

Compress network traffic Select this option to compress network traffic between source and target servers when creating Fast Clone files on a remote server and using a Fast Clone Server.

Selected

Create external table script For Oracle targets, select this option to generate an SQL script for creating Oracle external tables based on the output data files.

Not selected

Create external table log file For Oracle targets, if you selected the Create external table script option, indicate whether to use a log file. If you select this option, the external table utility logs messages to this file when accessing data in the data file.

Not selected

External table name includes partition

For Oracle targets, if you selected the Create external table script option, indicate whether the Oracle external table names include partition names in the output SQL script. Select this option to append partition names to external table names.

Not selected

Show progress in increments The number of rows that determines the frequency at which Fast Clone updates progress information when a data unload job is running. Each time Fast Clone finishes processing the specified number of rows, Fast Clone updates progress information.

100000



Perform checkpoint every The number of tables that determines the frequency with which Fast Clone performs checkpoints on the source database. Each time Fast Clone completes unloading the specified number of tables, Fast Clone performs a checkpoint on the source.Frequent checkpoints can increase the overhead of unload processing on the source system and might degrade unload performance.With the default value of 0, Fast Clone performs a checkpoint only prior to starting the data unload job.

0

Limit result set to Select this option to specify the maximum number of rows that Fast Clone can unload from each table. Valid values are -1 to 1073741823.By default, this option is not selected and Fast Clone uses the value of -1 to unload all of the rows from each table.

Not selected

Tuning Conventional Path Unload ProcessingIf you use the conventional path unload method, you can configure parameters for parallel unload processing and fetching LOB data to improve performance.

1. Click the Runtime Settings tab > Conventional Path view.



2. To configure parallel processing, complete the following fields:

Field Description

Threads number for unload statements

The maximum number of tables or partitions from which data can be unloaded in parallel. The default value is 1.

Threads per segment for unload statements

The maximum number of threads that Fast Clone can use to unload data from a single table or partition. If you use multiple threads for a single table, CPU usage might increase. Usually, multiple threads are used to unload data from large tables. The default value is 1.

Use direct file access to fetch extent map

Select this option to have Fast Clone try to read extent information for parallel unload processing directly from a data file. By default, this option is selected.

With the conventional path unload method, Fast Clone uses one thread per core or CPU to unload data from an Oracle source. If the system has multiple CPUs, Fast Clone can perform parallel unload processing for the source tables.

To determine the total number of unload threads to use, multiply the number of tables from which data is unloaded in parallel by the number of threads for each table. For conventional path unload, the appropriate number of concurrent threads is equivalent to the number of cores or CPUs. If I/O bandwidth or the number of allocated CPUs is restricted, you might need to use lower the value.

If you need to unload data from many tables of similar size, set Threads number for unload statements to the appropriate concurrency and set Threads per segment for unload statements to 1.

If you need to unload data from a few large tables, increase the Threads per segment for unload statements value and set Threads number for unload statements to the number that you calculate as follows:

number of cores or CPUs ÷ number of threads per table3. To configure settings for fetching LOB data, complete the fields described in the following table:

Field Description

Fetch LOBs as single chunk

Select this option to unload a single chunk of LOB data that has the size specified in the Fetch LOBs as single chunk, in KB field. By default, this option is not selected and Fast Clone fetches all LOB data in multiple chunks.

Fetch chunk size for LOBs, in KB

To unload all of the LOB data in chunks of a consistent size, enter the size, in kilobytes, to use for each chunk. Default is 128 KB.

Fetch LOBs as a single chunk, in KB

If you selected Fetch LOBs as a single chunk, enter the maximum size, in kilobytes, of the single chunk.

Fetch array size The number of rows in an array that Fast Clone unloads from the database during each round trip for conventional path unload processing. Default is 500.


Configuring Fast Clone Server TasksIn a distributed Fast Clone environment, you can configure a Fast Clone Server to accept output files from another Fast Clone system or to run unload jobs that are initiated on a standalone Fast Clone Console system.

1. Click the Runtime Settings > Remote Server view.


2. To configure a Fast Clone Server on a target system to accept output files from another Fast Clone system, complete the following steps:

a. Ensure that the Fast Clone Server is running on the target system.

b. Select the Send output to remote Informatica Fast Clone Server option.

c. In the adjacent text box, enter the host name or IP address of the Fast Clone Server system followed by the port number. Use the format host:port.

3. To configure a Fast Clone Server on the source system to accept unload requests from a remote Fast Clone Console system and run unload jobs on the source system, complete the following steps:

a. Ensure that the Fast Clone Server is running on the source system.

b. Enter connection information for the Fast Clone Server system in the Initiate unload on remote Fast Clone Server field. Enter the host name or IP address followed by the port number. Use the format host:port.

Related Topics:• “Transmitting the Output Files to the Target” on page 89

Configuring Integration with Informatica Data ReplicationIf you have the Informatica Data Replication product for change data replication, you can enable Fast Clone to use the Data Replication configuration file to perform an initial load of a data replication target. In the Fast Clone Console, enable integration with Data Replication and specify the Data Replication XML configuration file.

Ensure that the mapped source columns in the Data Replication configuration file have datatypes that Fast Clone supports for unload processing.

To integrate Fast Clone with Data Replication, the Fast Clone Console requires source and target connection information. Fast Clone uses this information to generate a minimal .ini configuration file for the data-cloning job. When you run the Fast Clone cloning job, the Fast Clone executable reads the connection information


from the generated cloning configuration file and all of the other settings from the Data Replication XML configuration file.

After Fast Clone loads data to a target, it updates the SCN value in the Data Replication XML configuration file. When you use this configuration file for change data replication, Data Replication uses the updated SCN to replicate the changes that occur after the initial load of the target.

Note: Fast Clone does not support Data Replication configurations that include virtual columns.

1. On the Source DB tab, enter connection information for the source database and click Connect to connect to the source.

2. On the Target DB tab, enter connection information for the target database.

If you want to validate the target connection information, connect to the target database. However, Fast Clone does not require that you connect to the target at this time.

3. Click the Runtime Settings tab > Informatica Data Replication Integration view.


4. To integrate Fast Clone with Data Replication, complete the fields described in the following table:


Integrate Fast Clone with Data Replication

Select this option to enable Fast Clone to use a Data Replication configuration file for a data-cloning job.

Not selected

Data Replication configuration file

The path to the Data Replication XML configuration file relative to the DBSYNC_HOME directory. You can click the Browse button to the right of the field to browse to the XML configuration file.Ensure that the DBSYNC_HOME environment variable is defined.

No default



Data Replication schema xsd file

The path to the schema file, config.xsd, for the Data Replication configuration file relative to the DBSYNC_HOME directory. The XML structure of the Data Replication configuration file must be consistent with the schema in this schema file.

No default

Wait until all connected users exit

Indicates whether Fast Clone waits for all of the user sessions on the mapped source tables to end before beginning unload processing. If you clear this option, Fast Clone ends the active user sessions on the mapped tables during unload processing.

Selected

Update SCN after completion in the Data Replication configuration file

Indicates whether Fast Clone updates the SCN value in the Data Replication configuration file after loading data to the target to set a start point for change data replication. When you run a replication job with the configuration, Data Replication replicates only the changes that have SCN values larger than the one that Fast Clone set in the configuration file.

Selected

Data unload method The unload method that Fast Clone uses to unload data with the specified Data Replication configuration file. Options are:- Direct. Use the direct path unload method.- Conventional path. Use the conventional path unload method.

Direct

Configuring Load Settings for TargetsOn the Runtime Settings tab > Load Settings view, you can configure load settings that are specific to the target type.

The Load Settings view is displayed for most target types. It appears after you define the target on the Target DB tab.

Amazon S3 Load SettingsOn the Runtime Settings tab > Amazon S3 Load Settings view, configure load settings for Amazon Redshift targets.

The following table describes the fields in this view:

Field Description Parameter Name Default

S3 bucket name Specifies the name of the Amazon S3 bucket that stores the temporary files with the data to be loaded to the Amazon Redshift target.

s3_bucket No default

S3 path Specifies the name of the directory in the Amazon S3 bucket where Fast Clone creates the temporary files that store source data.

s3_path No default

S3 key ID Specifies an AWS access key ID that Fast Clone must use to access the Amazon S3 account resources including the bucket where Fast Clone creates temporary files.

s3_key_id No default



S3 secret key Specifies the secret access key for the access key ID that is specified in the S3 key ID field. The access key ID must have the authority to access the Amazon S3 account resources.

s3_secret_key No default

S3 file size (KB) Specifies the maximum size of the temporary files, in kilobytes, that Fast Clone creates in the Amazon S3 bucket before loading the source data to Amazon Redshift.Default value: 5120 KB

s3_file_size 5120 KB

Related Topics:• “Defining Amazon Redshift Targets” on page 42

Greenplum Load SettingsOn the Runtime Settings tab > Greenplum Load Settings view, configure load settings for Greenplum targets.



Greenplum loader home When you use output files or pipes to load data, enter the path to the Greenplum loader root directory that Fast Clone uses to locate the gpload utility and generate valid load scripts.

greenplum_loader_home No default

Enable GreenPlum gpfdist Direct Data Stream

Indicates whether to use DataStreamer to load data. Select this option to stream data directly to a Greenplum target with the gpfdist utility.

direct_data_stream Not selected

Skip writing the output data into files

If you use DataStreamer, Fast Clone skips writing unloaded data to output files or pipes.If you do not use DataStreamer and need to troubleshoot unload jobs, clear this parameter to write the data to output files.

skip_all_out_file_ops Depends on DataStreamer use

Max errors If you use DataStreamer, enter the maximum number of errors that can occur when streaming data to a Greenplum target. If the number of errors exceeds this maximum value, DataStreamer ends with an error.

stream_maxerrors 100000

Remotely accessed IP or hostname

If you use DataStreamer, enter the host name or IP address of the system where you run the Fast Clone instance that starts the gpfdist utility.

greenplum_gpfdist_current_hostname

Local host name



Listen on specific interface

If you use DataStreamer, enter the IP addresses on which the gpfdist utility listens for incoming requests. Use 0.0.0.0 to listen to all request interfaces.

greenplum_gpfdist_listen_hostname

No default

Listening port If you use DataStreamer, enter the listener port for the gpfdist utility. A Greenplum target uses this port to submit requests to the gpfdist utility.

greenplum_gpfdist_listen_port

8000

Microsoft SQL Server Load SettingsOn the Runtime Settings > SQLServer Load Settings view, you can configure load settings for Microsoft SQL Server targets.



Batch size Specifies the number of rows in a batch that is passed to the osql utility for loading data to the target. The osql utility copies each batch to the target server as one transaction.Fast Clone uses this value as the BATCHSIZE option value in the BULK INSERT statement for the osql utility.

mssql_batchsize 10000

Max errors Specifies the maximum number of syntax errors that can appear in the output data files before the osql utility cancels the BULK INSERT operation. The osql utility ignores all of the rows that it cannot parse and counts an error for each row.Fast Clone uses this value as the MAXERRORS option value in the BULK INSERT statement for the osql utility.

mssql_maxerrors 10000

Column collations

The name of an SQL Server collation that Fast Clone uses to store character and Unicode data in output data files. Fast Clone uses this value to correctly specify a column collation in the format file that maps the fields in the data file to table columns. Fast Clone uses the specified collation for all of the table columns.Fast Clone uses this value as the FORMATFILE option value in the BULK INSERT statement for the osql utility.

textfile_charset_collation SQL_Latin1_General_Cp437_BIN


Netezza Load SettingsOn the Runtime Settings > Netezza Load Settings view, you can configure load settings for Netezza targets.



Enable Netezza Direct Data Stream

Indicates whether to use DataStreamer to load data to a Netezza target. Select this option to stream data directly to the Netezza target by using the Netezza external tables that are represented by the named pipes.

direct_data_stream Not selected

Pipe directory path

If you use DataStreamer to load data to a Netezza target, specifies the directory in which Fast Clone creates named pipes that represent Netezza external tables.On Windows, DataStreamer ignores this parameter because the named pipes are created in the named pipe directory that is mounted under the special path \\.\pipe\.

netezza_pipe_directory The output directory

Error log directory name

If you use DataStreamer to load data to a Netezza target, specifies the directory in which DataStreamer creates the following Netezza log files:- table_name.schema_name.nzlog. This log file

includes load statistics and diagnostic messages that the Netezza ODBC driver issues when loading data to the target tables from the corresponding external tables.

- table_name.schema_name.nzbad. This log file includes the external table rows that DataStreamer could not load to the corresponding target table.

netezza_error_log_directory

The output directory

Max errors If you use DataStreamer, the maximum number of errors that can occur when streaming data to a Netezza target. If the number of errors exceeds this maximum, DataStreamer ends with an error.

stream_maxerrors 100000



Quoted value If you use DataStreamer to load data to a Netezza target, specifies the value of the QuotedValue option for the Netezza external tables. DataStreamer requires this option to build correct Insert statements when loading data to the target tables from the corresponding external tables. Options are:- single. Indicates that the data values in the

external tables are enclosed with single quotation marks.

- double. Indicates that the data values in the external tables are enclosed with double quotation marks.

- no. Indicates that the data values in the external tables are not quoted.

Ensure that the quotation character that you specify for the load operation matches the quotation character that you specify for the unload operation. For unload jobs, Fast Clone uses the quotation character that you specify in the Enclosed by field on the Runtime Settings tab > Format Settings view.

netezza_quoted_value no

Control character and carriage returns

If you use DataStreamer to load data to a Netezza target, specifies the value of the EscapeChar option for the Netezza external tables. This option indicates whether the control characters, such as delimiter and backslash characters, are escaped in the data fields of the external tables. The Netezza external tables use a backslash as an escape character.

netezza_control_chars_enabled

Selected

Oracle Load SettingOn the Runtime Settings > Oracle Load Settings view, you can configure a load setting for Oracle targets.

The following table describes the field in this view:


Load data to same partition as source

Select this option to load data from source partitions to Oracle target partitions of the same name. This information is used by the SQL*Loader utility.

Not selected


Teradata Load SettingsOn the Runtime Settings tab > Teradata Load Settings view, you can configure data load settings for Teradata targets.



Teradata Load Utilities in Indicator mode

If you selected the Binary length prefix option, indicates whether to set the indicator bits in the binary output for column values that contain NULL data.Note: This option must be selected for the Use Teradata Parallel Transporter (TPT) and Enable Teradata Direct Data Stream options to be available.

Selected

Use Teradata Parallel Transporter (TPT)

Indicates whether Fast Clone uses Teradata Parallel Transporter (TPT) to load source data to the target database. If you select the Enable Teradata Direct Data Stream option to use DataStreamer, this check box is automatically selected and cannot be edited.

Selected

Loader to be used The Teradata load utility that the Fast Clone generated load script calls to load data to the Teradata target. Options are:- Teradata FastLoad. Use the Teradata FastLoad utility, or use the

TPT Load Operator if the Use Teradata Parallel Transportor (TPT) check box is selected.

- Teradata MultiLoad. Use the Teradata MultiLoad utility, or use the TPT Update Operator if the Use Teradata Parallel Transportor (TPT) check box is selected.

- Teradata Parallel Data Pump. Use the TPT Stream Operator. This option is available only when the Use Teradata Parallel Transporter (TPT) check box is selected

Teradata FastLoad

Load Sessions The number of sessions that the load utility opens to load source data to each target table.

2

Binary length prefix Indicates whether to unload source data in binary format. Each binary-format record includes a 2-byte field that indicates the total length of the record, the optional bytes for null values, and the data. If you select the Enable Teradata Direct Data Stream option to use DataStreamer, this check box is automatically selected and cannot be edited.

Not selected

Binary length prefix endianness

If you selected the Binary length prefix option, specify the endianness of the binary output. This parameter determines the format of the integer that specifies the length of records and fields in the binary files. Options are: LITTLE ENDIAN and BIG ENDIAN.

LITTLE ENDIAN

Enable Teradata Direct Data Stream

Select this option to have DataStreamer call TPT to load source data to the target database. DataStreamer uses the TPT Load Operator, Update Operator, or Stream Operator to load data, depending on the option that is selected in the Loader to be used list.

Not selected

Drop and rebuild tables in Teradata prior to load

Select this option if you want DataStreamer to drop and re-create the Teradata tables before loading source data into the tables.

Not selected



Skip writing the output data into files

If you use DataStreamer, Fast Clone skips writing unloaded data to output files or pipes.If you do not use DataStreamer and need to troubleshoot unload jobs, clear this parameter to write the data to output files.

Depends on DataStreamer use

Max errors If you use DataStreamer, the maximum number of errors that can occur when streaming data to a Teradata target. If the number of errors exceeds this maximum, DataStreamer ends with an error.

100000

Vertica Load SettingsOn the Runtime Settings tab > Vertica Load Settings view, you can configure data load settings for Vertica targets.



Loader to be used Indicates the Vertica command to be used to load data to the Vertica target. Options are:- Client side. Fast Clone uses the LCOPY command to load data from

the data files that are located on the target database server.- Server side. Fast Clone uses the COPY command to load data from

the data files that are located on the system where you run the target load utility.

Client side

Binary length prefix (native)

Controls whether the data in Fast Clone data files is loaded into Vertica targets in NATIVE VARCHAR or text format.If you select this option, Fast Clone adds the NATIVE VARCHAR keyword to the COPY or LCOPY command in the output SQL load script. The database server converts CHAR or VARCHAR fields in the data files to Vertica datatypes when loading data.By default, this option is not selected and Fast Clone loads data in text format with delimiters.

Not selected

Binary length prefix endianness

The endianness of the data in the data files. Options are LITTLE ENDIAN and BIG ENDIAN.This field is available only when the Binary length prefix option is selected.

LITTLE ENDIAN

Enable DataStream Indicates whether to use DataStreamer to load data. Select this option to stream data directly to a Vertica target with the LCOPY command on the client side or the COPY command on the server side.

Not selected

Max errors If you use DataStreamer, the maximum number of errors that can occur when streaming data to a Vertica target. If the number of errors exceeds this maximum, DataStreamer ends with an error.

100000

The Vertica COPY and LCOPY commands have a 65 KB limit on the length of NATIVE and NATIVE VARCHAR data. If Vertica encounters a value larger than 65 KB, it rejects the row, even if the ENFORCELENGTH option for the COPY or LCOPY command is disabled.


Configuring Reporting and SNMP TrapsFast Clone can optionally generate reports for unload jobs in XML and use the Simple Network Management Protocol (SNMP) to send asynchronous notifications, or traps, to an SNMP manager.

The reports specify the number of rows that were unloaded and the number of rows that failed to be unloaded, if any, for each source table. Fast Clone generates the XML report files in the output directory at the completion of unload processing.

If you configure Fast Clone to send SNMP traps, you can monitor the SNMP traps by using PowerSNMP Free Manager or a similar product.

1. Click the Monitoring tab. The following image shows this view:

2. In the Job name field, enter a name that will be used as the report file name base and included in the SNMP trap text.

You can override this name for reports.

3. To configure Fast Clone to generate an unload run report, complete the following steps:

a. Select the Create run report option.

b. To override the report name based on the Job name value, select the Manually specify report name option. Then enter a report name.

4. To configure Fast Clone to send SNMP traps to an SNMP manager, complete the following steps:

a. Select the Send Snmp traps option.

Configuring Reporting and SNMP Traps 69

b. Complete the fields described in the following table:

Field Description

Report on start Select this option to send SNMP traps to a SNMP manager when the unload job starts.By default, this option is not selected.

Report on completion

Select this option to send SNMP traps to a SNMP manager when the unload job starts.By default, this option is not selected.

Monitor IP address Enter the IP address of the SNMP manager.No default is provided.

Snmp community Enter a name to use for the Fast Clone SNMP community.The default name is wisdomforce.

Snmp enterprise oid Enter the Fast Clone enterprise object identifier (OID) in SNMP.The default value is 1.3.6.1.2.1.1.1.2.0.1.

Snmp trap oid Enter the Fast Clone SNMP trap OID.The default value is 1.3.6.1.2.1.1.1.2.0.2.

Snmp oid start If you selected Report on start, enter the Fast Clone SNMP trap detail OID for traps that are sent when the unload job starts.The default value is 1.3.6.1.2.1.1.1.2.0.3.

Snmp oid end If you selected Report on completion, enter the Fast Clone SNMP trap detail OID for traps that are sent when the unload job completes.The default value is 1.3.6.1.2.1.1.1.2.0.4.

Saving a Cloning Configuration File LocallyAfter you finish configuring the data-cloning job, save the configuration file.

1. To save the Fast Clone configuration file on the local computer, perform one of the following actions:

• On the menu bar, click File > Save.

• On the toolbar, click the Generate and save configuration file button. The Fast Clone Console generates the configuration file in the current working directory.

Note: To change the directory, specify an Output directory value on the Runtime Settings tab. For more information, see “Configuring Runtime Settings” on page 43.

2. If you selected source columns that are encrypted with Oracle Transparent Data Encyrption (TDE) when creating the configuration file, perform the following steps to open the Oracle wallet:

a. Click Specify PKCS#12 wallet and open the wallet file.

b. Enter a valid password in the PKCS#12 password field.

c. Click Open.


If the direct path unload method is enabled, the Fast Clone Console opens the wallet file, reads the master key IDs and values from the file, and then writes the key IDs and values to the configuration file.

Saving a Cloning Configuration File Locally 71

C h a p t e r 4

Unloading Data from the Source Database


• Unload Overview, 72

• Character Set of the Unloaded Data, 73

• Unloading Data from Views, 74

• Unloading Data from TDE-Encrypted Columns and Tablespaces, 74

• Running Data Unload Jobs, 75

• Unloading Data to Named Pipes, 75

• Unloading Data to Files of Fixed-Length Record Format, 79

• Generating the Data Files in XML Format, 79

Unload OverviewFast Clone can unload data from regular tables, compressed tables, partitioned tables, views, materialized views, index-organized tables, and encrypted tables and tablespaces. For all target types, you can configure Fast Clone to unload data into output files or pipes.

For Greenplum, Netezza, and Teradata targets, if you use the direct path unload method and the DataStreamer add-on component, you can configure Fast Clone to stream data to the target database. Streaming data avoids the use of intermediate storage for output data files.

Fast Clone provides two methods of unloading data from an Oracle source: direct path unload and conventional path unload. The direct path unload method is much faster because it reads Oracle data files directly. It also works with the DataStreamer component. The conventional path unload method uses the standard Oracle Call Level Interface (OCI) to read data and metadata. With the conventional path method, you can unload data over the network. Also, Fast Clone can read data from additional types of Oracle objects, such as views and cluster tables.

You can run a data unload job from the Fast Clone Console or command line. If you use the Fast Clone Console, the Console runs the unload script that is specified in the runtime settings to start the Fast Clone executable. By default, the unload script name is unload.cmd on Windows and unload.sh on Linux and UNIX. The Fast Clone executable uses the cloning configuration file to unload data. If you use the command line, you can start the Fast Clone executable directly from the command line or run the unload script. When you enter the Fast Clone executable or unload script at the command line, you can specify the unload parameters

72

either by using the config parameter to point to the cloning configuration file or by entering the parameters at the command line.

Character Set of the Unloaded DataFor the direct path unload method, Fast Clone does not convert source character data from the source character set to another character set. For the conventional path unload method, the NLS_LANG environment variable determines the character set of the unloaded data.

For the direct path unload method, Fast Clone unloads data in the character set that the Oracle database uses to store the source data. For example, if Oracle stores the data in UTF-8 encoding, Fast Clone unloads data in UTF-8. To view the character set of the source database, use the following command:

SELECT parameter,value FROM NLS_DATABASE_PARAMETERSwhere parameter in ('NLS_CHARACTERSET','NLS_NCHAR_CHARACTERSET');

For the conventional path unload method, you can define the NLS_LANG environment variable on the system where you run unload jobs to unload data with the character encoding of the target database. Use the following syntax, depending on the system type:

• On Linux, run the following commands from the Linux console:

NLS_LANG=language_territory.charsetexport NLS_LANG

• On UNIX, run the following command from the UNIX console:

setenv NLS_LANG language_territory.charset• On Windows, run the following command from the command prompt:

SET NLS_LANG=language_territory.charsetWhen you run Fast Clone unload processing, Oracle uses the NLS_LANG environment variable to convert the character set and provide the source data in the specified encoding.

For both the direct and conventional path unload methods, you can configure Fast Clone to convert character data in CLOB, NCLOB, NCHAR, and NVARCHAR2 columns from UTF-16 to UTF-8 encoding. Depending on the source datatypes, set the following parameters to true in the cloning configuration file to convert the source character data to UTF-8 encoding:

• convert_utf16_clobs_to_utf8

• convert_utf16_nclobs_to_utf8

• convert_utf16_nchars_to_utf8

Character Set of the Unloaded Data 73

Unloading Data from ViewsYou can configure Fast Clone to unload data from Oracle views by using the conventional path unload method.

You cannot select views for a data unload job in the Fast Clone Console because source views are not displayed on the Source Tables tab. To unload data from views, use one of the following methods:

• Open the configuration file in a text editor and manually add views to the [SOURCE_INDIRECT_TABLES] section. Use the following syntax:

TABLEid=view_nameTo unload specific columns from a view, add a statement for each view on a separate row in the [SOURCE_INDIRECT_TABLES_COLUMNS_LIST] section. In each view statement, list the columns to unload, using a comma (,) as the separator. Use the following format:

TABLEid_COLUMN_LIST=column_name_1,column_name_2,...For example, the following statements unload data from three columns of the DEMO_VIEW view:

[SOURCE_INDIRECT_TABLES]TABLE1=DEMO_VIEW

[SOURCE_INDIRECT_TABLES_COLUMNS_LIST]TABLE1_COLUMN_LIST=COLUMN1,COLUMN2,COLUMN3

• Define an SQL query that selects data from the view. In the query statement, use the full view name, including the schema name. You can define an SQL query for a configuration file in the Fast Clone Console.

Note: To unload data from Oracle materialized views, select the materialized views on the Source Tables tab in the Fast Clone Console. You can use the direct path unload method because Oracle stores materialized view data in the data files.

Related Topics:• “Defining SQL Queries for Unload Processing” on page 31

Unloading Data from TDE-Encrypted Columns and Tablespaces

Fast Clone supports Oracle sources that use Oracle Transparent Data Encryption (TDE) encryption of columns and tablespaces.

Note: Fast Clone supports encrypted tablespaces for Oracle 11.2.0.2 and later.

Fast Clone supports TDE with all Oracle encryption algorithms, including 3DES and AES with 128-bit, 192-bit, and 256-bit key lengths. Fast Clone also supports TDE with HMAC integrity checking and SALT enhanced security.

To unload data from encrypted columns or tablespaces, perform one of the following steps:

• If you use the direct path unload method, Fast Clone uses the master key IDs and values in the [MKEY_ID] and [MKEY_VALUES] sections of the cloning configuration file to unload data from encrypted tables. The Fast Clone Console prompts you to open the Oracle wallet file when you save the configuration file. The Fast Clone Console then reads the master key IDs and values and writes them to the [MKEY_ID] and [MKEY_VALUES] sections.

74 Chapter 4: Unloading Data from the Source Database

• If you use the conventional path unload method, ensure that the Oracle wallet is open in the database before you start the unload job. Use the following command:

ALTER SYSTEM SET ENCRYPTION WALLET OPEN IDENTIFIED BY "password";

Running Data Unload JobsYou can run a data unload job from the Fast Clone Console or the command line.

• To start a data unload job from the Fast Clone Console, perform one of the following actions:

- On the menu bar, click Data > Unload data with Informatica Fast Clone Console.

- On the toolbar, click the Start Informatica Fast Clone Console as specified in Runtime settings button.

The Fast Clone Console runs the unload script that you specified in the Informatica Fast Clone Console start script field on the Runtime Settings tab > File Locations view. By default, the Fast Clone Console uses the unload.cmd script on Windows and the unload.sh script on Linux and UNIX to start the Fast Clone executable. You can edit the default script or create another script based on the default script to set environment variables or perform particular configuration tasks.

If you run a data unload job with the default script, Fast Clone uses the configuration file that you specified in the Config file name field on the Runtime Settings tab > File Locations view to unload data.

• To run a data unload job from the command line, you can start the Fast Clone executable directly or use the unload scripts. To start the Fast Clone executable directly, use the following syntax:

- On Windows:

Fastreader.exe config=configuration_file- On Linux and UNIX:

Fastreader config=configuration_fileTo start the Fast Clone executable with the default unload scripts, use the following syntax:

- On Windows:

unload.cmd configuration_file- On Linux and UNIX: .

./unload.sh config=configuration_fileInstead of using the cloning configuration file, you can enter unload parameters directly at the command line. For more information, see “Command Line Parameters” on page 96.

Unloading Data to Named PipesYou can configure Fast Clone on Linux and UNIX to unload data to named pipes instead of to output files. Use pipes to avoid consuming storage for large output files.

Note: Fast Clone does not support unloading data to named pipes on Windows.

Fast Clone uses a named pipe that has fixed size of approximately 6 MB for each source table from which data is unloaded. By default, the pipe name is based on the name of the source table or table partition. After Fast Clone writes a chunk of data to the pipe, the target load utility reads the data from the pipe and loads it to the target. Fast Clone then writes the next chunk of data to the pipe. After the load utility loads the last

Running Data Unload Jobs 75

chunk of data from the pipe, Fast Clone closes the pipe. If the load utility does not read the data from the pipe, Fast Clone unload processing remains pending.

To load data from named pipes to targets, you must generate the load script before you run the unload job. By default, Fast Clone generates the load script after it completes unloading data from the source to output files. However, to stream data through pipes to the target database, you must generate the load script, start unload processing, and then run the load script while unload processing is in progress. To generate the load script and unload data to pipes, you must run the Fast Clone twice:

• The first time you run Fast Clone, select the Generate loader input only, do not unload data runtime option. Fast Clone generates the load script in the output directory but does not unload data.

• The second time you run Fast Clone, clear the Generate loader input only, do not unload data runtime option and then run Fast Clone either directly or with the unload script. Fast Clone then unloads data to the pipes but does not generate another load script. While unload processing is occurring, you can start the load script from the first Fast Clone run to begin loading data to the target.

Before you begin unload processing, either manually create a pipe for each source table in the output directory or configure Fast Clone to create the pipes. If you configure Fast Clone to create the pipes, monitor pipe creation in the output directory so that you do not start the load jobs until after the pipes are created. Alternatively, you can create a script to delay load processing until after Fast Clone creates all of the pipes.

Scenarios That Cause Cloning Processes to Remain PendingWhen you unload data to named pipes, the Fast Clone cloning process might remain in a pending state in certain scenarios.

Typically, this situation occurs in the following scenarios:

• You configure Fast Clone to create a named pipe for a table prior to unloading data from the table. However, a named pipe was previously created for this table in the output directory and the load utility reads data from this pipe. When you start the cloning job, Fast Clone creates another named pipe in the output directory. The target load utility reads data from the first pipe, and Fast Clone writes data to the second pipe. Consequently, both the unload and load processes remain in a pending state.

• You configure Fast Clone to create a named pipe for a table prior to unloading data from the table. You run the data unload job and then run the load job. However, Fast Clone did not create the named pipe by the time you started the load job. The load job fails because it cannot read data from the named pipe. Consequently, the unload process remains in a pending state.

Unloading Data to Named Pipes Created by Fast CloneYou can configure Fast Clone to create a named pipe in the output directory for each source table before unloading data from the tables. The target load utility reads data from the pipes and loads it to the target.

When unloading data to named pipes, Fast Clone performs a checkpoint for the source database and then creates the pipes for the tables to unload. Checkpoint processing can take up to 20 seconds.

Before you run the load job for a table, ensure that Fast Clone created the pipe for the table. You can create a script to delay the load job until after the pipe is created.

Note: If you start a load job for a table before Fast Clone creates the pipe for the table, the load process fails because the target load utility cannot read data from the pipe. Also, the unload process remains in a pending state.

1. Create a cloning configuration file.

Specify connection information for the source and target databases, the source tables to unload, the load script base name, and other settings, as needed.


2. To generate a load script for the target utility, complete the following steps:

a. Configure Fast Clone to generate the load script in one of the following ways:

• In the Fast Clone Console, select the Generate loader input only, do not unload data option on the Runtime Settings tab > File Locations view.

• At the command line, enter create_load_input_only=true.

• In the configuration file, enter the create_load_input_only=true parameter.

b. Save the cloning configuration file.

c. Run the Fast Clone with the updated cloning configuration file.

Fast Clone generates a .cmd load script on Windows or a .sh load script on Linux and UNIX in the output directory.

3. Configure Fast Clone to unload data without overwriting the generated load script in one of the following ways:

• In the Fast Clone Console, clear the Generate loader input only, do not unload data option and select the Unload data only, do not generate loader input option on the Runtime Settings tab > File Locations view.

• At the command line, enter the create_load_input_only=false and unload_data_only=true parameters.

• In the configuration file, enter the create_load_input_only=false and unload_data_only=true parameters.

4. Configure Fast Clone to create named pipes in the output directory in one of the following ways:

• In the Fast Clone Console, select the Create output as pipes instead of files option on the Runtime Settings tab > Miscellaneous Conditions view.

• At the command line, enter the create_output_pipe_instead_of_file=true parameter.

• In the configuration file, set the create_output_pipe_instead_of_file parameter to true.

5. Run the data unload job.

Fast Clone performs a checkpoint for the source database and creates a named pipe for each source table. Fast Clone then unloads the source data to the pipes.

6. After Fast Clone creates the pipes, run the load script that was generated in Step 2.

To delay the load job until after the pipes are created, you can use another script that monitors creation of the pipes and invokes the load script. For example, use the following Perl script on Linux:

perl -e "while (not(-p '/output_directory/table_name.dat')) {sleep 1;}";/output_directory/load_script.sh

The target load utility reads data from the named pipes and loads it to the target.

Unloading Data to Manually Created Named PipesIf you want to unload data to pipes, you can create the named pipes manually before running the cloning job instead of configuring Fast Clone to generate the pipes.

When you run the cloning job later, Fast Clone unloads data to the named pipes that you manually created. The target load utility reads data from the pipes and loads it to the target.

Unloading Data to Named Pipes 77

The advantage of manually creating pipes, in comparison to Fast Clone generation of the pipes, is that you do not need to monitor when pipes are generated or create a script that delays the load job until after Fast Clone creates the pipes.

1. Create a cloning configuration file.

Specify connection information for the source and target databases, the source tables to unload, the load script base name, and other settings, as needed.

2. To generate a load script for the target utility, complete the following steps:

a. Configure Fast Clone to generate the load script in one of the following ways:

• In the Fast Clone Console, select the Generate loader input only, do not unload data option on the Runtime Settings tab > File Locations view.

• At the command line, enter the create_load_input_only=true parameter.

• In the configuration file, enter the create_load_input_only=true parameter.

b. Save the cloning configuration file.

c. Run the Fast Clone Console with the updated cloning configuration file.

Fast Clone generates a .cmd load script on Windows or a .sh load script on Linux and UNIX in the output directory.

3. Configure Fast Clone to unload data without overwriting the generated load script in one of the following ways:

• In the Fast Clone Console, clear the Generate loader input only, do not unload data option and select the Unload data only, do not generate loader input option on the Runtime Settings tab > File Locations view.

• At the command line, enter the create_load_input_only=false and unload_data_only=true parameters.

• In the configuration file, enter the create_load_input_only=false and unload_data_only=true parameters.

4. Create a named pipe for each table that you selected for unload processing in the output directory.

For example, issue the following commands on Linux and UNIX to create a named pipe for a table:

rm /output_directory/table_name.datmknod /output_directory/table_name.dat p

5. Configure Fast Clone to use the pipes that you manually created in one of the following ways:

• In the Fast Clone Console, verify that the Create output as pipes instead of files option on the Runtime Settings tab > Miscellaneous Conditions view is cleared.

• In the configuration file, enter the create_output_pipe_instead_of_file= false parameter.


Fast Clone writes the unloaded data to the named pipes that you manually created in the output directory.

7. Run the load script that was generated in Step 2.

The target load utility reads data from the named pipes and loads it to the target.


Unloading Data to Files of Fixed-Length Record Format

Fast Clone can unload data to output files that use fixed-length record format. To use output files of fixed-length record format, you must enable padding for the fields.

In data files of fixed-length record format, each field has the same start position and end position in every record. Fast Clone records these positions for all of the columns in the output control files. If the number of bytes of data in a field is less than the field length, Fast Clone can pad the field on the right with space characters. Use this format to unload data to files that use blank column and row separators.

To enable padding for all columns in fixed-length record format data files, use one of the following methods:

• In the Fast Clone Console, on the Runtime Settings > Format Settings view, select the Pad with Blanks (fixed length) option.

• Edit the configuration file to set the right_pad_columns parameter to true.

• From the command line, set the RIGHT_PAD_COLUMNS parameter to true.

If you unload data to a fixed-length format file and configured Fast Clone to unload data from columns that have BLOB, CLOB, NCLOB, LONG, and LONG RAW datatypes to a separate file, Fast Clone unloads all of the column data. Otherwise, the maximum amount of data that Fast Clone unloads to a fixed-length format file is 8 KB.

Generating the Data Files in XML FormatIf you run the unload job from the Fast Clone Console, you can optionally generate the data files in XML format based on the regular ASCII data files. You can then use the XML files with third-party software.

1. Open the FastClone_installation/uiconf/default.cfg file in a text editor and set the generate_xml_metadata parameter to true.

This parameter indicates whether Fast Clone creates metadata files in the _xml subdirectory of the output directory when you run a data unload job.

2. Restart the Fast Clone Console for this setting to take effect.

3. Open the configuration file for which to run the data unload job and save it again in the Fast Clone Console.


Fast Clone generates the table_name.columns file for each unloaded table in the _xml subdirectory of the output directory. These files include table metadata for generating XML data files.

5. Generate the XML data files based on the generated metadata files and the regular data files in one of the following ways:

• From the command prompt on Windows, run the xmlconvert.cmd script. From the command line on Linux or UNIX, run the xmlconvert.sh script.

• From the Fast Clone Console, click Data > Convert to XML on the menu bar, or click the Convert CSV output into XML format button on the toolbar.

Fast Clone generates the XML data files in the _xml subdirectory for each unloaded table.

Unloading Data to Files of Fixed-Length Record Format 79

If the operation ends with an error, view the informaticaFastClone_mngr.log file that Fast Clone generates in the installation directory to diagnose the problem. The following table describes return codes for the script that generates the XML data files:

Error Code Description

0 Fast Clone successfully generated the XML data files.

1 Fast Clone encountered an internal error when generating the XML data files.

2 Fast Clone did not find the output directory.

3 Fast Clone did not find the output data files for which to generate XML data files.

4 Fast Clone did not find the _xml subdirectory in the output directory.

5 Fast Clone could not create an XML file.

6 Fast Clone did not find the metadata files in the _xml subdirectory.

7 Fast Clone could not open the configuration file.


C h a p t e r 5

Loading Data to a TargetThis chapter includes the following topics:

• Load Overview, 81

• Configuration Considerations for Targets, 84

• Generating Target Tables, 86

• Validating the Target Schema, 88

• Disabling or Dropping Target Constraints, 88

• Transmitting the Output Files to the Target, 89

• Loading Data to the Target Database, 89

• Enabling or Creating Target Constraints, 90

Load OverviewThe load mechanism that Fast Clone uses depends on the target type and whether you use DataStreamer.

• For target types other than Amazon Redshift, the Hadoop-based targets, and flat files, you can use the native load utilities to load data from output files or pipes to the target database.

• For Greenplum, Netezza, Teradata, and Vertica targets, you can optionally use the DataStreamer add-on component to stream data to the target by using the appropriate Greenplum or Teradata load utility, Netezza external tables, or Vertica COPY or LCOPY command. This method is faster than loading data from output files or pipes.

Note: For Amazon Redshift targets, Fast Clone always uses DataStreamer to write source data to the Amazon Simple Storage Service (Amazon S3) and then issues a copy command to copy the data to the Amazon Redshift target. You do not need to manually enable the use of DataStreamer for Amazon Redshift targets.

• For Hadoop-based and flat file targets, Fast Clone loads data directly to the targets. Fast Clone does not generate load scripts and control files.

The Fast Clone cloning configuration file contains the target connection information and load settings that are required to load data.

Loading Data from Output Files or Pipes

If you unload data to output files or pipes, Fast Clone generates a load script that you run to load data to the target. In the Fast Clone Console, you can specify the script base name in the Load script base name field on the Runtime Settings tab > File Locations view or accept the source table owner or schema as the base

81

name. Fast Clone adds the extension.cmd (on Windows) or .sh (on Linux or Windows) to the script name. The generated load script will invoke the appropriate load utility or command-line tool for the target type. The following table identifies the load utilities that Fast Clone uses:

Target Type Load Utility

Greenplum gpload or gpfdist utility

Microsoft SQL Server osql utility

Netezza nzload utility

Oracle SQL*Loader (sqlldr) utility

Teradata - Teradata FastLoad- Teradata MultiLoad- Teradata Parallel Transporter (TPT) Load Operator, Update Operator, or Stream Operator

Vertica COPY command on the server sideLCOPY command on the client side

To load data from output files or pipes to a target, perform the following tasks:

1. Create a configuration file.

2. Run the data unload job with the configuration file.

3. Generate the target tables.

4. Optional. Validate the target schema.

5. Optional. Copy the output files to the target system if you are not running a Fast Clone Server on the target and you enabled that Fast Clone Server to accept output files from a remote system.

6. Optional. Disable or drop constraints on the target.

7. Run the load script.

8. Optional. Enable or add the constraints that you previously disabled or dropped on the target.

Loading Data with DataStreamer

If you use DataStreamer, Fast Clone does not generate output files and load scripts. DataStreamer streams data directly to the target by using the following load mechanisms:

Target Type Load Mechanism

Amazon Redshift Data is sent to temporary files in the Amazon Simple Storage Service (Amazon S3) by using the PostgreSQL ODBC driver on Windows or the DataDirect ODBC driver on Linux and UNIX. Then the AWS COPY command is used to move data from the S3 data files to the target.

Greenplum gpfdist utility

Netezza Netezza external tables and Netezza ODBC driver

82 Chapter 5: Loading Data to a Target

Target Type Load Mechanism

Teradata Teradata Parallel Transporter (TPT) Load Operator, Update Operator, or Stream Operator

Vertica COPY command on the server sideLCOPY command on the client side

To load data with DataStreamer, perform the following tasks:

1. Install connectivity software to connect to the target on the system where you run data unload jobs.

• For Teradata targets, install the TPT libraries and JDBC.

• For Netezza targets, install the Netezza ODBC driver.

• For Amazon Redshift targets, if you run Fast Clone on Windows, install the PostgreSQL ODBC driver.

For more information, see the Informatica Fast Clone Installation Guide.


• For Greenplum targets, select the Enable Greenplum gpfdist Direct Data Stream option on the Runtime Settings tab > Greenplum Load Settings view.

• For Netezza targets, select the Enable Netezza Direct Data Stream option on the Runtime Settings tab > Netezza Load Settings view.

• For Teradata targets, select the Enable Teradata Direct Data Stream option on the Runtime Settings tab > Teradata Load Settings view.

• For Vertica targets, select the Enable DataStream option on the Runtime Settings tab > Vertica Load Settings view.

If you create the configuration file for Greenplum, Netezza, and Teradata targets in a text editor, set the direct_data_stream parameter to true.

3. For Amazon Redshift, Greenplum, Netezza, Teradata, and Vertica targets, clear the Suppress trailing null columns option on the Runtime Settings tab > Format Settings view of the Fast Clone Console. If you create the configuration file in a text editor, set the suppress_trailing_nullcolls parameter to false.

4. Generate the target tables.


6. Optional. Disable or drop constraints on the target.


8. Optional. Enable or add the constraints that you previously disabled or dropped on the target.

Loading Data to Hadoop and Flat File Targets

For Hadoop-based and flat file targets, Fast Clone does not generate load scripts and control files because Fast Clone loads the source data directly to the targets. The Hadoop-based targets are Cloudera, Hive, and Hortonworks.

To load data to Hadoop-based and flat file targets, perform the following tasks:


2. For Hive targets, generate the target tables.



Load Overview 83

Configuration Considerations for TargetsTo accurately load data to the target database, review the configuration considerations for your target type and the requirements of the target load utility.

Considerations That Apply to Multiple Target Types• Greenplum, Netezza, Teradata, and Vertica targets require data files to use a new-line character (\n) as a

row separator. If the new-line character occurs in the Oracle data as a data value, the load fails. To prevent this problem, configure Fast Clone to replace each new-line character with the space character in one of the following ways:

- In the Fast Clone Console, select the Replace new line with space option on the Runtime Settings > Format Settings view.

- Manually enter the replace_new_line_with_space=true parameter in the configuration file.

- At the command line, enter the replace_newline_with_space=true parameter.

Replacement of the new-line character with the space character can increase the overhead of unload processing.

Note: For Teradata targets, if you unload data in binary format, Fast Clone does not use a row separator in the output data files.

• Greenplum and Vertica targets require the column separator and the enclosure character to be a single ASCII character. You can also specify a non-printable separator character by using the \xhh escape sequence in hexadecimal notation.

• Greenplum, Teradata, and Vertica targets require data files to not use a column separator at the end of each row. For Teradata and Vertica targets, you must manually configure Fast Clone to produce data files without a column separator at the end of each row in one of the following ways:

- In the Fast Clone Console, clear the Trailing column separator option on the Runtime Settings tab > Format Settings view.

- Manually enter the trailing_column_separator parameter=false parameter in the configuration file.

- At the command line, enter the trailing_column_separator=false command-line parameter.

For Greenplum targets, Fast Clone never writes a trailing column separator at the end of each row, regardless of the trailing_column_separator setting.

Considerations for Amazon Redshift Targets• Fast Clone connects to and writes source data to the Amazon Simple Storage Service (Amazon S3).

Amazon S3 is a storage service that can copy data from a source and simultaneously move the data to Amazon Redshift clusters. After the source data is in Amazon S3 storage, Fast Clone issues a copy command that copies the data to an Amazon Redshift target table.

• Fast Clone does not delete the temporary files that contain the source data from the Amazon S3 bucket if the Fast Clone executable ends with an error. You can delete these files manually, or use Amazon S3 Object Lifecycle Management to automatically delete the temporary files from the bucket when the lifetime of the temporary files expires.

• Fast Clone cannot load data from source columns that have binary datatypes to Amazon Redshift targets.

Considerations for Flat File Targets• Fast Clone does not generate load scripts and control files for flat file targets.


Considerations for Cloudera and Hortonworks Targets• To unload data to Cloudera and Hortonworks targets, your Fast Clone license must include Hadoop target

support.

• Fast Clone does not generate load scripts and control files for Hadoop targets.

Considerations for Hive Targets• To unload data to Hive targets, your Fast Clone license must include Hadoop target support.

• To unload data to remote Hive targets, the underlying Hadoop software must allow remote connections.

• In the Column separator field on the Runtime Settings tab > Format Settings view of the Fast Clone Console, enter the field delimiter that the Hive target uses in data files. By default, Hive data warehouses use the ASCII start of heading (SOH) character as the field delimiter.

• Use the Custom url option to connect to a Hive target that requires Kerberos or LDAP authentication.

• Ensure that the target tables are in CSV format. Fast Clone does not support loading data directly into tables that use other formats such as the Record Columnar File (RCFile) or Optimized Row Columnar (ORC) format.

• Fast Clone does not support unloading source data to partitioned tables on Hive targets.

• Fast Clone does not support unloading source data to Hive tables that use custom locations.

• Fast Clone does not generate load scripts and control files for Hive targets.

Considerations for Netezza Targets• Netezza targets require the column separator to be a single ASCII character. You can also specify non-

printable separator and enclosure character by using the \xhh escape sequence in hexadecimal notation.

• To make unload and nzload processing faster, delete the default double quotation mark (") character in the Enclosed by field on the Runtime Settings tab > Format Settings view.

• If you use the nzload utility to load data to a Netezza target, TIMESTAMP columns must use the format YYYY-MM-DD HH24:MI:SS.FF6, and DATE columns must use the format YYYY-MM-DD. The nzload utility supports these ISO date formats. Ensure that you specify the correct formats in the Timestamp format and Date format fields on the Runtime Settings tab > Format Settings view.

• If you unload data from CLOB or LONG columns and load these values with the nzload utility to a Netezza target, clear the Unload binary into separate files option. The nzload utility can load up to 32,768 bytes of data into VARCHAR columns.

• If you use DataStreamer to load data to a Netezza target, verify that the Unload binary into separate files, Trailing column separator, and Suppress trailing null columns options are not selected on the Runtime Settings tab > Format Settings view.Alternatively, verify that the export_binary_to_separate_files, trailing_column_separator, and suppress_trailing_nullcols parameters are set to false in the configuration file.

Considerations for Teradata Targets• For Teradata targets, delete the default double quotation mark (") character in the Enclosed by field on the

Runtime Settings tab > Format Settings view so that the field is empty.

• If you want the output data file to be in Teradata binary format instead of text-delimited format, select the Binary length prefix option on the Runtime Settings tab > Teradata Load Settings view.

• For Teradata targets, Informatica recommends that you create separate unload_td.cmd and unload_td.sh scripts that call the FastReaderTD executable for all unload jobs for this target type. You can then specify the unload_td file in the Informatica Fast Clone Console start script field on the Runtime Settings tab > File Locations view.

Configuration Considerations for Targets 85

• Data cannot be loaded to Teradata targets by using Fast Clone-generated load scripts and data files if the total row size exceeds the Teradata maximum buffer size. This error is most likely to occur with source tables that contain LOB columns because Fast Clone uses a default width of 32,768 bytes for LOBs.

Generating Target TablesFor each source table, Fast Clone requires a corresponding target table or file.

The source and target columns must have compatible datatypes. Also, the table and column names on the target must be identical to those on the source.

In the Fast Clone Console, you can perform the following tasks:

• Generate target tables based on the source table schema.

• Generate SQL CREATE TABLE scripts that you can run later to create target tables based on the source table schema.

If the source and target database types are different, Fast Clone uses the rules in the <ConversionRules> section of FastClone_installation\uiconf\DataTypes.xml file to convert source column datatypes to the compatible target column datatypes.

Generating Target Tables Based on Source Table SchemaYou can optionally generate target tables based on source table schema.

1. In the Fast Clone Console, connect to the source database and the target database.

2. Click Schema > Schema creation script on the menu bar, or click the Generate tables on destination script button on the toolbar.

The Generating Schema for the Target dialog box appears.

3. In the Schema/owner field, enter the schema or owner name of source tables that you want to use to generate the target tables.

The tables that match the selected schema or owner name are listed in the Tables box.


The following image shows an example listing:

4. To filter the list of tables, in the Filter field, enter the first few letters of the source table names that you want to use for generating target tables. For case-sensitive filtering, enclose the filter value in double quotation marks.

Only the tables that match the filter criteria are listed in the Tables box.

5. Select the tables to use for generating the target tables:

• To select all listed tables, click Select All.

• To select tables individually, select the Convert check box next to each table.

6. Under Target, in the Schema field, enter a schema name for the target.

7. In the Database type list, select the target database type.

8. To create the tables, perform one of the following actions:

• To generate SQL CREATE TABLE scripts, click Convert. Then in the Save dialog box, save the generated scripts to a directory. You can run the scripts on the target system to create the target tables.The Fast Clone Console generates the SQL CREATE TABLE script in the specified directory. If you use sequence objects on the Oracle source, the Fast Clone Console also creates an SQL script for generating sequence objects in the specified directory. To name this script, the Fast Clone Console uses a name of the SQL CREATE TABLE script with the prefix _sequences.

• Click Run to create the tables on the target immediately.

Generating Target Tables 87

Enabling the Extended Procedure for Generating Oracle Target Tables

By default, when generating target tables based on the source table schema, the Fast Clone Console preserves only the primary key constraints and column names that you define on the source.

For Oracle targets, if you also want to preserve default values, foreign key constraints, check constraints, unique constraints, indexes, and triggers that you define on the source, set the newDDLConversion parameter in the default.cfg file to false, as follows:

1. Close the Fast Clone Console.

2. In the FastClone_installation/uiconfig/default.cfg file, set the newDDLConversion parameter to false:

newDDLConversion=falseThis parameter affects Oracle targets only. For other targets, the Fast Clone Console always uses the default target generation procedure.

Validating the Target SchemaFrom the Fast Clone Console, you can validate the target schema to ensure that all of the source tables and columns have corresponding target tables and columns.

Note: The validation feature does not check whether the source column datatypes are compatible with the target column datatypes.

1. Connect to the source database and target database.

2. On the Target DB tab, ensure that you select the target schema or owner.

3. Click Schema > Validate target on the menu bar, or click the Validate selected tables on destination button on the toolbar.

The Validate tables structure on target dialog box appears.

4. Review the validation results.

The Fast Clone Console highlights validation errors in red.

5. Click OK.

6. Resolve validation errors, if any, and repeat validation procedure.

Disabling or Dropping Target ConstraintsFrom the Fast Clone Console, you can generate a script for disabling or dropping target table constraints, depending on the target type.

If you have target tables with primary key-foreign key relationships, run this script prior to loading data to avoid constraint violations on the target.


2. On the Target DB tab, ensure that you select the target schema or owner.

3. Click the Schema > Disable Constraints on the menu bar, or click the Disable all constraints on all selected tables on destination button on the toolbar.


The View Script dialog box appears. It shows the generated script for disabling or dropping target constraints.

4. Perform one of the following actions:

• Click Run to run the script immediately.

• Click Save to save the generated script to a directory of your choice. You can run the script on the target system later to disable or drop constraints.

Transmitting the Output Files to the TargetAfter Fast Clone completes the data unload job, you can either run the data load job remotely or transmit the output files to the target database system.

Typically, you run the data unload job and generate output files on the source database system.

Different strategies are available to transmit the output files to the target system. If the load utility for your target type is not installed on the source system, use one of the following strategies to transmit the output files:

• Run a Fast Clone Server instance on the target system. On the Runtime Settings tab > Remote Server view, configure the Fast Clone Server to receive output files.

• Manually transmit the output files to the target system by using FTP or another means. For example, archive the output files and then transfer them using FTP.

Related Topics:• “Configuring Fast Clone Server Tasks” on page 60

Loading Data to the Target DatabaseIf you do not use the DataStreamer component, you must run the load script that Fast Clone generates in the output directory to load data from output files or pipes to the target database. Fast Clone generates the load script for the native load utility of the target database.

The load script is a .cmd file on Windows and a .sh file on Linux and UNIX. The load script file name is based on the value of the Load script base name field on the Runtime Settings tab > File Locations view in the Fast Clone Console. If you configured the cloning operation from the command line, the load script file name is based on the value of the load_script_base_name command line parameter.

1. Ensure that the target load utility is installed on the system where you run data load jobs.

The target load utilities are usually installed as a part of the target database software and you do not need to install them separately. However, if you want to run the load operation from the source system or if you load data from pipes, you must install the target load utilities on the source system.

2. Review the load script that Fast Clone generated in the output directory. Optionally edit load options.

Verify that Fast Clone correctly set connection information and load settings for the target load utility. For more information about load options, see the target database documentation for the load utility.

3. Run the load script from the command line.

Transmitting the Output Files to the Target 89

Enabling or Creating Target ConstraintsIf you previously disabled or dropped target constraints to avoid constraint violation errors, you can re-enable or re-create the constraints after loading data to the target database.

From the Fast Clone Console, generate a script for enabling or creating target constraints.


2. On the Target DB tab, ensure that you selected target schema or owner.

3. Click Schema > Enable Constraints on the menu bar, or click the Enable all constraints on all selected tables on destination button on the toolbar.

The View Script dialog box appears and displays the generated script.

4. Perform one of the following actions:

• Click Run to run the script immediately.

• Click Save to save the generated script to a directory of your choice. You can run the script on the target system later to re-enable or re-create the constraints.


C h a p t e r 6

Remote Configuration Management


• Remote Configuration Management Overview, 91

• Enabling Remote Configuration Management, 91

• Opening a Configuration File on a Remote System from the Fast Clone Console, 92

• Saving a Configuration File to a Remote System, 93

Remote Configuration Management OverviewIn the Fast Clone Console, you can use remote configuration management to open and save a cloning configuration file on a remote system. You must run an FTP server on the remote system to transfer the cloning configuration files.

After you save a cloning configuration file to a remote system, you can use it to run data unload jobs on the remote system from the command line. For example, if you run the Fast Clone Console on a standalone system, you can save the cloning configuration file to the source system and then run unload jobs from the command line on the source system using the direct path unload method. In this case, a Fast Clone Server is not required.

Also, you can use remote configuration management to store backups of the cloning configuration files on a remote system.

Enabling Remote Configuration ManagementTo open and save configuration files on a remote system from the Fast Clone Console, you must run an FTP server on the remote system and enable remote configuration management.

1. Ensure that a FTP server is running on the remote system where you want to store the configuration files.

2. On the Source DB tab > Source Database view of the Fast Clone Console, select the Remote configuration management option.

A confirmation dialog box appears.

91

3. Click OK.

On the menu bar, the File > Open Remotely and File > Save Remotely commands become available.

Opening a Configuration File on a Remote System from the Fast Clone Console

In the Fast Clone Console, you can open a configuration file that is stored on a remote system by using FTP. You then view or edit the file in the Fast Clone Console. If you edit the file, you can save the updated file back to the remote system.

1. Verify that the Remote configuration management option is selected on the Source DB tab > Source Database view.

2. On the menu bar, click File > Open Remotely.

The Informatica Fast Clone Console Remote window appears, as shown in the following image:

3. Enter connection information for the FTP server that runs on the remote system.



Host address The host name or IP address of the remote system that stores the configuration file.

No default

Initial folder The directory that opens after you connect to the remote system. Leave this field blank to open the root directory.

Blank

Username A user name that is used to access the configuration file with FTP. anonymous

Password A valid password for the specified user. Blank

Port number The port number for the FTP server 21

92 Chapter 6: Remote Configuration Management


Transfer mode The transfer mode that Fast Clone uses to transfer the cloning configuration files to the FTP server. Options are:- Smart by Ext. Automatically determines the transfer mode based on the file

extension.- Ascii Text. Converts ASCII-formatted text configuration files to another

format, if necessary.- Binary Data. Transfers the files as a binary stream of data.

Smart by Ext

List command A command that is used to get a list of files and directories on the remote system.

LIST

Socket mode The mode for establishing the data connection. Options are:- Active. The Fast Clone Console opens a socket and waits for the server to

establish the transfer connection.- Passive. The FTP server chooses a port for the data connection. The Fast

Clone Console issues the PASV command and initiates data connections to the server by using the PASV port.

Passive

4. Click Connect.

If the connection is successful, the Directories tab appears and displays the file hierarchy on the FTP server.

If the connection is not successful, click the Logs tab to review log information.

5. Select a configuration file to open and click Open.

The Fast Clone Console opens the configuration file.

Saving a Configuration File to a Remote SystemIn the Fast Clone Console, you can save a configuration file to a remote system by using FTP.

1. Ensure that the Remote configuration management option is selected on the Source DB tab > Source Database view.

2. On the menu bar, click File > Save Remotely.

The Informatica Fast Clone Console Remote window appears.

3. Enter connection information for the FTP server.



Host address The host name or IP address of the remote system to which you are saving the configuration file.

No default

Initial folder A directory that opens after you connect to the remote system. Leave this field blank to open the root directory.

Blank

Username A user name that is used to access the configuration file with FTP. Anonymous

Saving a Configuration File to a Remote System 93


Password A valid password for the specified user. Blank

Port number The port number for the FTP server. 21

Transfer mode The transfer mode that Fast Clone uses to transfer the cloning configuration files to the FTP server. Options are:- Smart by Ext. Automatically determines the transfer mode based on the file

extension.- Ascii Text. Converts ASCII formatted text configuration files to another

format, if necessary.- Binary Data. Transfers the files as a binary stream of data.

Smart by Ext

List command A command that is used to get a list of files and directories on the remote system.

LIST

Socket mode The mode for establishing the data connection. Options are:- Active. The Fast Clone Console opens a socket and waits for the server to

establish the transfer connection.- Passive. The FTP server chooses a port for the data connection. The Fast

Clone Console issues the PASV command and initiates data connections to the server using the PASV port.

Passive

4. Click Connect.

If the connection is successful, the Directories tab appears and displays the file hierarchy on the FTP server.

If the connection is not successful, click the Logs tab to review log information.

5. Select a directory to which to save the configuration file to and click Save.

The Fast Clone Console saves the configuration file to the specified directory.

94 Chapter 6: Remote Configuration Management

C h a p t e r 7

Fast Clone Command Line Interface


• Command Line Interface Overview, 95

• Syntax, 95

• Command Line Parameters, 96

• Command Line Example, 113

Command Line Interface OverviewYou can run the Fast Clone executable with a configuration file or inline parameters directly from a command line.

If you use the cloning configuration file, run the Fast Clone executable with the config parameter. This parameter specifies a path to the configuration file that includes all of the parameters for the data-cloning job.

If you enter parameters for the data-cloning job directly from the command line, Fast Clone cannot also use a cloning configuration file. You enter the parameters at the command line after the Fast Clone executable name.

SyntaxFast Clone can accept any number of arguments from the command line. You can use the config parameter to specify a configuration file that contains the parameters, or enter parameters individually at the command line when you run the Fast Clone executable.

Informatica recommends that you run Fast Clone with a configuration file. Use the following syntax:

• On a Windows system:

FastReader.exe config=configuration_file [remotehost=Fast_Clone_Server_address:port]• On a Linux or UNIX system:

./FastReader config=configuration_file [remotehost=Fast_Clone_Server_address:port]Include the optional remotehost parameter to send the unload request to a remote Fast Clone Server.

95

If you want to enter Fast Clone parameters directly at the command line, do not use the config parameter. Use the following syntax:

• On a Windows system:

FastReader.exe parameter_1=value parameter_n=value• On a Linux or UNIX system:

./FastReader parameter_1=value parameter_n=valueImportant: For Teradata targets, use the FastReaderTD.exe on Windows or FastReaderTD on Linux or UNIX.

Include a space between each parameter specification. When you specify boolean parameters from the command line, you can use true or false as the value or the shorter form of y or n. For example, both forms of the following parameter are acceptable and have the same function:

compress_on_the_fly=truecompress_on_the_fly=y

Command Line ParametersReview the descriptions of the command line parameters to determine if you want to use them. The parameter descriptions are grouped by general parameter type. The parameter types are similar to the tabs in the Fast Clone Console.

Source Database ParametersSpecify the following command line parameters to connect the source database:

asm_debug

Set this parameter to true only at the request of Informatica Global Customer Support. For Oracle sources in an Automatic Storage Management (ASM) environment, a value of true causes Fast Clone to print an ASM debug log to the asm_debug.log file.


asm_instance_host

The host name or IP address of the system with the ASM instance.

asm_instance_name

The ASM instance name to which Fast Clone connects to unload data.

Default value: No default value is provided.

asm_instance_port

The port number that is used to connect to the ASM instance that is specified in the asm_instance_name parameter.

Default value: 1521

asm_instance_sid

The ASM instance SID.

96 Chapter 7: Fast Clone Command Line Interface

asm_protocol

The protocol that Fast Clone uses to connect to an ASM instance. Valid values are:

• TCP. Use the TCP/IP protocol.

• TCPS. Use the TCP/IP protocol with the Secure Sockets Layer (SSL).

Default value: TCP

asm_sys_password

A password that is defined for the ASM SYS user.

asm_sys_username


Default value: SYS

owner

The name of the source schema owner for the tables to be unloaded. This owner name is also the schema name for the tables.

protocol

The protocol that Fast Clone uses to connect to the source database. Valid values are:



Default value: TCP

source_connection_string

The TNS connection string for the Oracle source database. Grant the SELECT ANY DICTIONARY privilege role if you use the direct path unload method.

userid

The connection string for the Oracle source database in the following format:

username/password@net_service_nameInstead of the net_service_name, you can use the IP address, port, and service name of the source database:

username/password@//IP_address:port/service_name

Source Tables, Columns, and SQL QueriesUse the following command line parameters to specify the source tables and columns from which to unload data and any SQL queries that you want to use for unload processing:

all_tables

Indicates whether to unload all tables in the schema that is identified by the owner parameter and specifies the unload method to use for these tables. Options are:

• 0. Use the conv_path_tables parameter value to determine the tables to unload with the conventional path unload method, and use the tables parameter to determine the tables to unload with the direct path unload method. Specify at least one table in one of the parameters for the unload job to run.

• 1. Let Fast Clone determine whether it can perform a direct path unload. If a direct path unload is not possible, Fast Clone performs a conventional path unload.

Command Line Parameters 97

• 2. Force Fast Clone to perform a direct path unload for all source tables in the schema that is identified by the owner parameter.

• 3. Force Fast Clone to perform a conventional path unload for all source tables in the schema that is identified by the owner parameter.

Default value: 0

conv_path_tables

A space- or comma-separated list of source tables or table partitions from which to unload data with the conventional path unload.

sql_queries

A space- or comma-separated list of SQL queries that are processed with the conventional path unload method. The queries must be enclosed in quotation marks. Add a backslash before each quotation mark. For example:

sql_queries=\"SELECT * FROM TABLE1\",\"SELECT * FROM TABLE2\"Fast Clone writes the results of these queries to the QUERY1.dat and QUERY2.dat files.

tables

A space- or comma-separated list of source tables or table partitions from which to unload data with the direct path unload method. For example:

tables=TABLE1,TABLE2

Target Database ParametersSpecify the following command line parameters to connect to the source database:

destination_database_type

The target database type. Options are:

• flat. Use this option for flat file targets.

• greenplum3. Use this option for Greenplum targets.

• cloudera. Use this option for Cloudera targets.

• hive. Use this option for Hive targets.

• hortonworks. Use this option for Hortonworks targets.

• mssql. Use this option for Microsoft SQL Server targets.

• netezza. Use this option for Netezza targets.

• oracle. Use this option for Oracle targets.

• teradata. Use this option for Teradata targets.

• vertica. Use this option for Vertica targets.

Default value: oracle

destination_host

The host name or IP address of the system where the target database runs.

destination_instance

The target instance name or database name for the tables to be loaded.


destination_load_batchsize

The number of rows in a batch that is passed to the Microsoft SQL Server osql utility for loading data to the target. The osql utility copies each batch to the target server as one transaction.

Fast Clone uses this value as the BATCHSIZE option value in the BULK INSERT statement for the osql utility.

Range of values: 1 through 1000000

Default value: 10000

destination_load_maxerrors

For Greenplum and Teradata targets, the maximum number of errors that can occur when streaming data to the target database with DataStreamer. If the number of errors exceeds this maximum, DataStreamer ends with an error.

For Microsoft SQL Server targets, specifies the maximum number of syntax errors that can appear in the output data files before the osql utility cancels the BULK INSERT operation. The osql utility ignores all of the rows that it cannot parse and counts an error for each row. Fast Clone uses this value as the MAXERRORS option value in the BULK INSERT statement for the osql utility.



destination_password

A valid password for the user that is specified by the destination_user parameter. If you specify a forward slash (/) in the destination_user parameter, Fast Clone ignores the password and uses operating system authentication

destination_port

The port number that Fast Clone uses to connect to the target database.


destination_schema

The target schema name or schema owner for the tables to be loaded.

destination_user

The name of a user who has permissions to connect to the target database and load the source data. If you specify a forward slash (/) in this parameter, Fast Clone ignores the password and uses operating system authentication.

greenplum_loader_home

For Greenplum targets, if you load data to the target database from data files or pipes, the path to the Greenplum loader root directory that Fast Clone uses to locate the gpload utility and generate valid load scripts.

Runtime Command Line ParametersYou can specify runtime settings when you start Fast Clone in the command line.

big_raw_device_offset_mul

For Oracle sources that use raw devices on AIX, the number of offset blocks to skip from the beginning of a DF_LGDSK raw device when the calc_raw_device_offset parameter is set to true. You can specify the size of a single offset block in the raw_device_offset parameter. To get the total number of bytes that


Fast Clone skips on DF_LGDSK raw devices, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.

Valid values: 1 through 64

Default value: 32

binary_length_prefix

For Teradata targets, indicates whether to unload source data in binary format. Each record in binary format can include a 2-byte field that indicates the total length of the record, some optional bytes for null values, and the data.

For Vertica targets, indicates whether to unload source data in the NATIVE VARCHAR or text format. If you set this parameter to true, Fast Clone adds the NATIVE VARCHAR keyword to the COPY or LCOPY command in the output SQL load script. The database server converts CHAR or VARCHAR fields in the data files to Vertica datatypes when loading data. If you set this parameter to false, Fast Clone loads source data in text format with delimiters. Valid values are true and false.


binary_length_prefix_indicators

For Teradata targets, if you set the binary_length_prefix parameter to true, indicates whether to set the indicator bits in the binary output for columns that contain null values. Valid values are:

• true. Set the indicator bits to 1 for null values.

• false. Do not set indicator bits.


binary_length_prefix_little_endian

If you set the binary_length_prefix parameter to true, indicates whether to use LITTLE ENDIAN or BIG ENDIAN for the binary output. For Teradata targets, this parameter determines the format of the integer that specifies the length of records and fields in the binary files. Valid values are:

• true. The binary output uses the LITTLE ENDIAN format.

• false. The binary output uses the BIG ENDIAN format.

Default value: true

calc_raw_device_offset

Indicates whether Fast Clone skips the offset block in raw devices. Valid values are:

• true. Skip the offset block in raw devices.

• false. Read data from the beginning of the raw device.

To get the total number of bytes that Fast Clone skips in a DF_LGDSK raw device, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.

Default value: true

checkpoint_each_x_tables

Specifies the number of tables from which Fast Clone unloads data before taking a checkpoint.


Default value: 0. With this default, Fast Clone takes a checkpoint only before it starts unloading source data.


column_separator

Specifies a character or character sequence that Fast Clone uses to separate columns in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.

You can specify a column separator that uses the \xhh escape sequence in hexadecimal notation.

Also, you can use the following non-printable ASCII characters as column separators:

• \a (alert)

• \b (backspace)

• \n (new line)




If you use non-printable ASCII characters, ensure that the target load utility supports these characters as separators. For more information, see the target database documentation.

Default value: semicolon (;)

compress_network_traffic

Indicates whether to compress network traffic. Valid values are:

• true. Compress network traffic.

• false. Do not compress network traffic.


compress_on_the_fly

Indicates whether to compress the output data files during unload processing to reduce the file sizes. Valid values are:

• true. Compress the output data files. The compression method is specified in the compression_type parameter.

• false. Do not compress the output data files.


compression_level

Specifies the compression level for the GZIP compression method. Higher values result in faster compression and a lower compression ratio. Lower values result in slower compression and a higher compression ratio. For Fast Clone to use this parameter, the compression_type parameter must be set to GZIP and the compress_on_the_fly parameter must be set to true.


Default value: 1

compression_type

If you set the compress_on_the_fly parameter to true, specifies the compression method that Fast Clone uses to compress the output data files. Valid values are:

• GZIP. Provides a better compression ratio but is slower. You can set the GZIP compression level in the compression_level parameter.


• QZIP. Provides a lower compression ratio but is faster.

Default value: GZIP

consistent_lock_on_table_set

If you use the direct path unload method and set the dirty parameter to false, indicates whether to lock all of the selected tables in a table set before unloading the source data. Valid values are:

• true. Fast Clone locks all of the selected source tables in a table set before unloading the source data. Fast Clone unlocks these tables after unloading data from all of them. Enter this value to get consistent point-in-time data across the selected source tables.

• false. Fast Clone holds a lock on each source table only while unloading data from the table.


convert_ascii_to_ebcidic

Indicates whether Fast Clone converts source data to the EBCDIC encoding when unloading data to output data files and pipes. Valid values are:

• true. Convert source data to the EBCDIC encoding.

• false. Preserve the original encoding.


convert_numbers_to_comp3_fl_8b

Indicates whether to write numeric data to the output data files as COMP-3 values. Valid values are:

• true. Convert numeric data to COMP-3 in the output data files. Use this value if you need to read the output file with COBOL or another language that has built-in COMP-3 support.

• false. Preserve the original numeric format.


convert_numbers_to_comp3_implied_decimal

The position of the decimal point in the COMP-3 format.


Default value: 0

convert_utf16_clobs_to_utf8

Indicates whether Fast Clone converts CLOB data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in CLOB columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in CLOB columns. If you do not use UTF-16 encoding for the source data, set this parameter to false to avoid the overhead of character set conversion.


convert_utf16_nchars_to_utf8

Indicates whether Fast Clone converts NCHAR and NVARCHAR2 data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in NCHAR and NVARCHAR2 columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in NCHAR and NVARCHAR2 columns.


Note: If you do not use UTF-16 encoding for the source data, set this parameter to false to avoid the overhead of character set conversion. To use the parameter, you must manually enter it at the command line or in the cloning configuration file. You cannot enter it in the Fast Clone Console.


convert_utf16_nclobs_to_utf8

Indicates whether Fast Clone converts NCLOB data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in NCLOB columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in NCLOB columns.



create_external_table_log_file

For Oracle targets, if you create an SQL script for creating external tables based on the output files, indicates whether to log messages related to this processing to a log file. Valid values are:

• true. The Oracle external table utility logs messages to a log file when accessing data in the output data files. Ensure that the create_external_table_script parameter is also set to true.

• false. The Oracle external table utility does not log these messages.


create_external_table_script

For Oracle targets, indicates whether Fast Clone generates an SQL script to create Oracle external tables based on the output data files. Valid values are:

• true. Create an SQL script to create Oracle external tables based on the output data files.

• false. Do not create an SQL script to create Oracle external tables based on the output data files.


create_load_input_only

Indicates whether Fast Clone creates the load script and control files but skips unloading data from the source database. Valid values are:

• true. Create a load script and the control files. Do not unload data from the source database.

• false. Create a load script and the control files. Unload data from the source database.

Note: Fast Clone does not generate load scripts and control files for flat file, Hadoop, and Hive targets.


create_output_pipe_instead_of_file

Indicates whether Fast Clone creates pipes in the output directory and writes the output data to these pipes. Valid values are:

• true. Fast Clone creates pipes in the output directory.

• false. Fast Clone does not create pipes in the output directory. Use this value to unload data to output data files or to the pipes that you manually created.



date_format

Specifies the date format to use in output files.

The following table specifies the datetime format elements that Fast Clone supports with the direct path unload method:

Format Element Description

DD Day of month

FFnumber Number of fractional secondsValid values: 1 through 9

HH Hour of day in 12-hour format

HH24 Hour of day in 24-hour format

MI Minutes

MM Month

SS Seconds

YY Last 2 digits of the year

YYYY 4-digit year

dbsync_config_xml

Specifies the path and file name for the Data Replication configuration file that can be in XML or SQLite format. If you specify the config parameter, enter an absolute path. If you do not specify the config parameter, enter a path that is relative to the DBSYNC_HOME directory.

Important: If you use a Data Replication configuration in SQLite format, do not copy or move the configuration files from the default DataReplication_installation/configs directory to another location.

dbsync_config_xsd

Specifies the path and file name for the Data Replication configuration schema .XSD file. The path is relative to the DBSYNC_HOME directory.

Default value: config.xsd

dbsync_integration

Indicates whether Fast Clone integrates with Informatica Data Replication. Also, when integration is enabled, determines whether Fast Clone allows user sessions that are connected to the source tables to complete processing before locking the tables. Valid values are:

• false. Do not integrate Fast Clone with Data Replication.

• lock. Integrate Fast Clone with Data Replication. Fast Clone waits for all user sessions that are connected to the source tables to complete processing and close before locking the tables for unload processing.

• lock_disconnect. Integrate Fast Clone with Data Replication. Fast Clone forces all user sessions that are connected to the source tables to disconnect and then locks the tables for unload processing.



dbsync_integration_by_direct

If you enable integration with Data Replication, indicates whether to unload data from the source database with the direct path unload method. Valid values are:

• true. Use the direct path unload method.

• false. Use the conventional path unload method.

Default value: true

dbsync_integration_update_scn

Indicates whether to update the Data Replication configuration file with the SCN for each mapped table after Fast Clone unloads the source data. Valid values are true and false.

Important: Set this parameter to true if you plan to use Data Replication to replicate change data and use Fast Clone to perform initial materialization of the replication target tables.

Default value: true

decimal_separator

Specifies the decimal separator that Fast Clone uses in decimal numbers in unloaded data.

Default value: period (.)

default_long_raw_is_hex

For Oracle targets, indicates whether to unload BLOB and LONG RAW source data in hexadecimal format. Set this parameter to true to unload data with these datatypes in hexadecimal format. For other target types, set this parameter to false to avoid generating incorrect control files.


dest_tables

If you set the dbsync_integration parameter to lock or lock_disconnect, specifies the list of tables that must be resynchronized.

direct_file_access

Indicates whether to read extent information for parallel unload processing directly from the data file. Specify true to read extent information from the data file.


direct_stream

For Greenplum, Netezza, Teradata, and Vertica targets, indicates whether Fast Clone uses DataStreamer to load data to the target database.

• For Greenplum targets, DataStreamer uses the Greenplum gpfdist utility.

• For Netezza targets, Fast Clone unloads data to the named pipes that represent the Netezza external tables. DataStreamer uses bulk Insert to load data from the external tables to the corresponding target tables.

• For Teradata targets, DataStreamer sends the unloaded source data to the Teradata Parallel Transporter (TPT) Load Operator, Update Operator, or Stream Operator, depending on the type of load utility that you specified in the teradata_loader_to_use parameter.

• For Vertica targets, DataStreamer uses the COPY command on the server side and the LCOPY command on the client side to send the unloaded data directly to Vertica targets.


Valid values are:

• true. Use DataStreamer to load data to the target database.

• false. Do not use DataStreamer to load data to the target database.


dirty

Indicates whether to lock the tables that are unloaded with the direct path unload method to perform consistent read operations. Valid values are:

• true. Do not lock the tables before unloading the source data. Use this option if you want to bypass the Oracle consistency mechanism. However, block-level consistency is preserved, because block I/O operations are atomic.

• false. Lock the tables exclusively before unloading the source data.

Default value: true

disable_linux_cache_bypass

For Linux operating systems, indicates whether to use direct I/O mode to read Oracle data files. In direct I/O mode, Fast Clone bypasses operating system caching and reads data directly from disk. Set this parameter to true to use direct I/O mode.


escape_enclosure

Specifies an escape character that Fast Clone inserts before the character that is specified in the optionally_enclosed_by parameter when the character appears in text columns. If you define this parameter, Fast Clone truncates the optionally_enclosed_by value to a single character.

Default value: No default is provided.

explicitly_flush_buffer_cache

Indicates whether Fast Clone clears all data from the buffer cache of the Oracle source database before unloading the data with the direct path unload method. Valid values are:

• true. Fast Clone clears all data from the buffer cache by executing the following command on the Oracle source database:

ALTER SYSTEM FLUSH BUFFER_CACHE;• false. Fast Clone does not clear data from the buffer cache.


fixed_size_characters_by_col_definition

Indicates whether to unload data from character columns in fixed-length format. The fixed length is determined by the column length. If a text value is shorter than the column length, Fast Clone pads the column with spaces up to the full column length. Specify true to unload character data in fixed-length format.


fixed_size_number_by_col_definition

Indicates whether to unload data from NUMBER columns in the fixed-length format. Specify true to unload character data in fixed-length format. The fixed length is defined by the precision of the NUMBER


column, without a decimal separator. If a number is shorter than this length, Fast Clone pads the value with zeroes. The following image shows the fixed-length format of a NUMBER column:

For example, when using fixed-length format, Fast Clone unloads the value 123.45 from the NUMBER(10,5) column as +0012345000.


limit_result_set

Specifies the maximum number of rows that Fast Clone unloads from each table. Set this parameter to -1 to unload all rows from a table.

Valid values: -1 through 1073741823

Default value: -1

load_data_to_same_partition_as_source

For Oracle targets, indicates whether SQL*Loader loads data to a table partition that has the same name as the corresponding source table partition. Specify true to load data from a source table partition to a target table partition of the same name.


load_script_base_name

Specifies the file name of the load script. On Windows, Fast Clone generates load script files with the .cmd extension. On Linux and UNIX, Fast Clone generates load script files with the .sh extension.

Default value: The source schema owner name that is specified by the owner parameter.

lobs_to_separate_files

Indicates whether to write data from BLOB, CLOB, NCLOB, LONG, and LONG RAW source columns to separate files. In the regular data files, Fast Clone includes the file name instead of the source value. Specify true to write data from columns with these datatypes to separate files. For Hive targets, Fast Clone always uses the parameter value of false.


Important: If you use DataStreamer to load source data to Netezza targets, set this parameter to false.

local_time_zone

For Teradata targets, specifies the local time zone to use for unloading data from TIMESTAMP WITH LOCAL TIME ZONE columns. Specify the time zone as an offset from the Oracle database time zone:

"+/-HH:MM"The HH variable is a two-digit number from -13 to +13. The MM variable is a two-digit number from 0 to 59.

For example, enter "+08:00".



log

Indicates whether Fast Clone produces an extended log. Set this parameter to true only at the request of Informatica Global Customer Support.


mb_per_file

Defines the maximum amount of data, in megabytes, that a single output data file (.dat file) can contain.


Default value: -1. With this default value, Fast Clone does not enforce a limit on the size of the output data files.

multiblockread

Specifies the number of Oracle data blocks that Fast Clone fetches per read operation.

Important: Change this parameter only if Fast Clone unload performance is slow. The value of this parameter depends on the extent size for locally managed tablespaces and the table-specific extent sizes for dictionary-managed tablespaces.


Default value: 32

nvl_fixed_size_characters_by_space

For character columns that are unloaded in fixed-length format, indicates whether to replace null values with space characters. Specify true to replace nulls with spaces.


optionally_enclosed_by

Specifies the character or character sequence that Fast Clone uses to enclose character data in output data files. Specify a character or character sequence that does not appear in the source data. The target load utility cannot correctly parse the data files if the enclosure character is a source data value.

If you use the default value and the source columns contain double-quotation marks (") in the data, specify a different character or character string for this parameter or use the escape character in the escape_enclosure parameter to escape the double-quotation marks in the column data.

Note: If you use the character that is defined by the escape_enclosure parameter to escape the optionally_enclosed_by value in the source data, Fast Clone truncates the optionally_enclosed_by value to a single character.

Default value: Double-quotation mark (")

outdir

Specifies the directory in which Fast Clone creates the output files. The output directory can be on a local file system or on a network mapped share, such as NFS or Samba. For Hive targets, specifies the location of flat files that store table data on the HDFS.

Default value: The FAST_READER_HOME environment variable value.

output_file_extension_base

Specifies the extension that Fast Clone uses for output data files. Fast Clone creates a data file for each source table from which data is unloaded.

Default value: dat


parallel_segments

Specifies the maximum number of tables or table partitions that Fast Clone can unload in parallel with the conventional path unload method.


Default value: 1

Note: If you also specify the threads_num parameter, the parallel_segments parameter overrides it.

print_column_names

For flat file targets, indicates whether Fast Clone writes column names in the first row of the output data files. Specify true to write column names in the first row.


Note: This parameter is available for all target types when the unload_data_only parameter is set to true.

raw_device_offset

The size of a single offset block, in bytes, in a raw device. Fast Clone skips the offset block if you set the calc_raw_device_offset parameter to true. To get the total number of bytes that Fast Clone skips on DF_LGDSK raw devices, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.

Default value: 0 for Linux, 4096 for other operating systems

record_separator

A character or character sequence that Fast Clone uses to separate table rows in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.

You can specify a row separator that uses the \xhh escape sequence in hexadecimal notation.

Also, you can use the following non-printable ASCII characters:

• \a (alert)

• \b (backspace)

• \n (new line)




If you use non-printable ASCII characters in row separators, ensure that the target load utility supports these separators. For more information, see target database documentation.

Default value: \n

replace_newline_with_space

Indicates whether to replace new line (\n) character in unloaded data with space characters. Specify true if you use the new line character as the row separator in output files and the source character data includes new line characters. However, with a value of true, unload processing performance might be degraded.



resync

If you set the dbsync_integration parameter to lock or lock_disconnect, indicates whether to resynchronize all of the mapped target tables or a subset of the mapped target tables. Valid values are:

• y. Synchronize all of the mapped target tables with the source tables.

• n. Synchronize the set of tables that you specify in the dest_tables parameter.

• p. Synchronize the mapped target tables for which Sync SCN value is -1.

Default value: p

right_pad_columns

Indicates whether to pad columns on the right with spaces when the length of a column value is less than the full length of the column. Specify true to enable padding.


run_as_service

Indicates whether to run the Fast Clone Server instead of the Fast Clone unload engine. Enter true to run the Fast Clone Server.


select_from_sys

Indicates whether Fast Clone uses SYS-owned dictionary tables, such as sys.obj$ and sys.tab$, to get extent boundaries. Valid values are:

• true. Use SYS-owned dictionary tables to get extent boundaries.

• false. Use dba_extents and dba_segments views to get extent boundaries.

Note: If you set this parameter to false for Oracle sources that use locally managed tablespaces, retrieval of extent information from the dba_extents view is slow, especially if the dictionary is partially analyzed and the optimizer mode is not RULE. In this situation, Informatica recommends that you set this parameter to true and grant the SELECT ANY DICTIONARY permission to the user.


server_config

Specifies the path to the Fast Clone Server configuration .ini file. Enter this parameter if you set the run_as_service parameter to true.

show_increments_rows

The number of rows that Fast Clone unloads with the direct path unload method before writing updated progress information for the unload job to the system console or command prompt window.



skip_all_out_file_ops

If you use DataStreamer to stream data to Greenplum or Teradata targets, Fast Clone skips writing data to output files. If you do not use DataStreamer and need to troubleshoot unload jobs, you can set this parameter to true to write the data to output files.

Default value: The direct_data_stream parameter value.


split_files_by_row_count

The maximum number of rows that Fast Clone writes to an output data file. If Fast Clone needs to unload a greater number of rows from the source table, Fast Clone creates multiple output data files for the table. The file names have the following format:

table_name_file_number.datSpecify this parameter only if you use the direct path unload method.

Note: For parallel unload processing, splitting output into multiple data files based on a number of rows is slower than splitting output based on a chunk size. This performance degradation occurs because of locks between the parallel threads.


teradata_drop_rebuild_tables

For Teradata targets, indicates whether DataStreamer drops and re-creates tables in the target database before it loads the source data. Valid values are: true and false.

Default: false

teradata_load_sessions

For Teradata targets, specifies the number of sessions that the Teradata FastLoad or MultiLoad utility can open to load the source data to each target table.

Default value: 2

teradata_loader_to_use

For Teradata targets, specifies the load utility that the Fast Clone-generated load script calls to load source data to the target. Valid values are:

• teradata_fast_load. Use the Teradata FastLoad utility, or use the TPT Load Operator if using Teradata Parallel Transporter (TPT).

• teradata_multi_load. Use the Teradata MultiLoad utility, or use the TPT Update Operator if using Teradata Parallel Transporter (TPT).

• teradata_parallel_dpump. Use the TPT Stream Operator. In this case, you must use Teradata Parallel Transporter (TPT).

Default value: teradata_fast_load

teradata_nodes_ip_list

For Teradata targets, specifies a comma- or space-separated list of IP addresses of Teradata nodes.

teradata_parallel_transporter

For Teradata targets, indicates whether Fast Clone uses a Teradata Parallel Transporter (TPT) operator to load the source data to the target database. Valid values are true and false.


threads_num

Specifies the maximum number of tables or table partitions that Fast Clone can unload in parallel with the direct path unload method.


Default value: 1


threads_per_segment

Specifies the maximum number of threads that Fast Clone can use to unload data from each table or table partition with the direct path unload method.

Note: Fast Clone might use fewer threads. Fast Clone does not create a thread to unload a chunk of less than 100 data blocks.


Default value: 1

timestamp_format

Specifies the timestamp format in the output.



DD Day of month




MI Minutes

MM Month

SS Seconds


YYYY 4-digit year

trailing_column_separator

Indicates whether to add a column separator after the last column value in each row in the output. Valid values are:

• true. Add a column separator after the last column value.

• false. Do not add a column separator after the last column value.

Note: Fast Clone forces this parameter to false for Oracle targets if the unloaded source tables contain LOB columns and the export_binary_to_separate_file parameter is set to true.

Default value: true

Important: If you use DataStreamer to load data to a Greenplum, Netezza, Teradata, or Vertica target, make sure this parameter is set to true.

truncate_numeric_precision

Specifies the number of significant digits to use for rounding down the source numeric values in the output. This parameter truncates only the fractional part of a number.



Default value: -1. Fast Clone does not round down the source numeric values.

unload_data_only

Indicates whether to unload source data only and skip the creation of control files and the load script. Valid values are:

• true. Do not create control files and a data load script when Fast Clone unloads source data to .dat files.

• false. Create control files and a data load script when Fast Clone unloads source data to .dat files.


For flat file, Hadoop, and Hive targets, Fast Clone always uses the parameter value of true.

use_dbms_diskgroup_instead_of_direct

Indicates whether Fast Clone uses the DBMS_DISKGROUP package to read data from ASM sources. Typically, use of the DBMS_DISKGROUP package is significantly slower than reading data directly from ASM sources because the package has a PL/SQL limit of 32,600 bytes per read.

However, if you use the conventional path method and set this parameter to true, Fast Clone unloads data from ASM by using multiple threads. Fast Clone can use multiple threads from a remote computer, with an even data distribution across the threads.

Valid values are true and false.


Command Line ExampleThe following code sample shows how to run Fast Clone from the command line on Linux or UNIX:

FastReader userid=IFC_USER/IFC_PASSWORD@//192.168.137.111:1521/orcl.mshome.net destination_database_type=oracle owner=IFC_USER conv_path_tables=TABLE1 outdir=oracle_output

Command Line Example 113

A p p e n d i x A

TroubleshootingIf you encounter a problem while using Fast Clone, review the following list of previously reported issues and their solutions before contacting Informatica Global Customer Support.

An ORA-942 "table not found" error occurs when I select an Oracle table owner.The user that connects to the Oracle database lacks the permissions that Fast Clone requires. Use a user ID that has the required permissions. For more information, see "Setting Up an Oracle User for Connecting to Oracle" in the Fast Clone Installation Guide.

When I run Fast Clone, the following error occurs:The procedure entry point OCILobClose could not be located in the dynamic link library OCI.dll

The most likely cause of this error is that the Oracle Client is not installed on the remote database server where Fast Clone runs. Download the Oracle Instant Client from the following Web site: http://www.oracle.com/technetwork/database/features/oci/index.html.

During Fast Clone Console startup, the following error occurs: "'java' is not recognized as an internal or external command, operable program or batch file."

The Fast Clone Console is a Java application that requires the Java Runtime Environment (JRE) 1.8. For Hadoop and Hive Targets, the Fast Clone executable requires 64-bit JRE. If you do not have a supported JRE version installed, download the JRE from the following web site: http://www.oracle.com/technetwork/java/javase/downloads/index.html. If you have a supported JRE version installed, this error might occur because the java.exe or java executable is not specified in the PATH environment variable.

The Fast Clone Console requires X Window on Linux or UNIX, but we are not allowed to install X Window on a Linux or UNIX system in our environment.Install and run the Fast Clone Console on a Windows system that is remote from your Linux or UNIX server. To run an unload operation from the Windows system, complete the following steps:

1. Start gui.cmd on the remote Windows system or gui.sh on the remote Linux or UNIX system. Then create the configuration file.

2. On the Runtime Settings tab > Files Locations view, ensure that the Output directory value points to the Oracle source system where the data unload job will run.

3. Copy or FTP the configuration file to the Oracle source system where the data unload job will run.

4. Start unload.sh on the Oracle source machine.

I downloaded Fast Clone for Sun Solaris. When I run unload.sh, the following error occurs:FastReader: syntax error at line 1: `(' unexpected

114

http://www.oracle.com/technetwork/database/features/oci/index.html

http://www.oracle.com/technetwork/java/javase/downloads/index.html

The most likely cause of this error is that you have a Fast Clone version for Solaris that is not compatible with the CPU type. For example, you might have Fast Clone for SPARC III but the CPU type is SPARC II. Download the correct Fast Clone version for the CPU type.

To determine your CPU type, execute the following command:

/usr/platform/sun4u/sbin/prtdiag

When I try to connect to Oracle while running the Fast Clone engine, I get an error that indicates the TNS listener could not resolve SERVICE_NAME given in connect descriptor. However, when I connect to Oracle from the Fast Clone Console, I do not get an error.The Fast Clone Console uses a JDBC connection driver. With this driver, the Fast Clone Console first tries to use the Oracle SID and then the SERVICE_NAME to establish a connection. The Fast Clone executable uses the SERVICE_NAME by default to connect to the database.

The SERVICE_NAME is specified in a tnsnames.ora connection string as follows:

(DESCRIPTION=(ADDRESS_LIST=(ADDRESS=(PROTOCOL=TCP)(HOST=hostname)(PORT=port)))(CONNECT_DATA=(SERVER=DEDICATED)(SERVICE_NAME=instancename)))

To resolve this issue, try using the Oracle SID value. Select the Use SID instead of SERVICE_NAME option on the Runtime Settings tab > Miscellaneous Conditions view in the Fast Clone Console. Alternatively, specify the following parameter in the configuration file:

use_sid_instead_of_service=trueYou can also configure Fast Clone to connect to the database by using the Oracle Bequeth protocol, without a TNS listener. Either set the parameter instance=/ in the configuration file or select the Connect Direct in Informatica Fast Clone Console option on the Source DB tab.

During startup of the Fast Clone Console on UNIX, the following error message is issued:Exception in thread "main" java.lang.InternalError: Can't connect to X11 window server using 'x.y.z.w' as the value of the DISPLAY variable.

The Fast Clone Console requires an X Window environment on Linux and UNIX systems. However, your X Window environment is not properly configured.

Contact your system administrator for assistance. Verify that you can run X Window programs.

The X Window System is a graphics system on UNIX that provides an infrastructure for displaying windowed graphics. It provides a public protocol by which client programs can query and update information on X servers.

Fast Clone freezes when I try to unload data into a pipe instead of a flat file.Fast Clone can write data to a pipe while a program or script is reading data from the pipe. If no program or script is reading data from the pipe, Fast Clone might appear to hang.

An ORA-12528 error occurs when I run Fast Clone on an Oracle database server that uses Automatic Storage Management (ASM).Add the ASM instance to your listener.ora configuration file. Use the following syntax:

(SID_DESC = (SID_NAME = +ASM) (ORACLE_HOME = ORACLE_HOME_PATH) )Then restart the Oracle listener.

115

A p p e n d i x B

Fast Clone Configuration File Parameters

After you save a configuration file in the Fast Clone Console, you can edit it in any plain text editor.

Configuration File OverviewYou can manually edit the parameters in a cloning configuration file after initially defining it in the Fast Clone Console.

Most parameters have default values and are optional. Review the parameter descriptions to determine if you want to include a parameter.

Ensure that you specify a parameter in the correction section of the configuration file. Use the following syntax:

parameter_name=value

Source Database ParametersIn the [SOURCE] section of the configuration file, specify parameters for connecting to the Oracle source.

The following parameters are supported:

dba_password

A valid password for the specified dba_userid user. If you set the dba_userid parameter to a forward slash (/), Fast Clone ignores the password and uses operating system authentication to control connection to the source database.

Important: If the encrypt_database_passwords parameter is set to false, specify a clear text password. By default, the Fast Clone Console sets the encrypt_database_passwords parameter to true and writes an encrypted password.

dba_userid

A user name that has the authority to connect to the source database. Fast Clone uses this user connection to get table metadata and perform checkpoints before unloading the source data. If you specify a forward slash (/) in this parameter, Fast Clone ignores the password and uses operating system authentication.

116

Important: The user must have permissions that allow SELECT ANY DICTIONARY and ALTER SYSTEM operations on the source database. The slower alternative is to scan the system tablespace. For Fast Clone to perform a SELECT from dictionary, the source database must be running.

host

The host name or IP address of the computer where the source database runs.

Default value: localhost

instance

The Oracle instance name that Fast Clone uses to connect to the source database. If this parameter is not defined or specifies a forward slash (/), Fast Clone uses the ORACLE_SID environment variable to connect to an Oracle source with the Bequeath protocol. If the TWO_TASK environment variable is defined, Fast Clone uses the TWO_TASK environment variable.

owner

The name of the source schema owner for the tables to be unloaded. This owner name is also the schema name for the tables.

port

The port number that Fast Clone uses to connect to the source database.

Default value: 1521

protocol

The protocol that Fast Clone uses to connect to the source database. Valid values are:



Default value: TCP

ASM Connection Settings

Use the following parameters if the source database files are managed by Oracle Automatic Storage Management (ASM). Fast Clone connects to an ASM instance to unload data.

asm_instance_defined

Indicates whether Fast Clone connects to an ASM instance. Valid values are:

• true. Fast Clone connects to an ASM instance.

• false. Fast Clone connects to an Oracle source database without ASM.


asm_instance_host

The host name or IP address of the system with the ASM instance.


asm_instance_name

The ASM instance name to which Fast Clone connects to unload data.


asm_instance_port

The port number that is used to connect to the ASM instance that is specified in the asm_instance_name parameter.

Source Database Parameters 117

Default value: 1521

asm_protocol

The protocol that Fast Clone uses to connect to an ASM instance. Valid values are:



Default value: TCP

asm_sys_password

A password that is defined for the ASM SYS user.

Important: If the encrypt_database_passwords parameter is set to false, specify a clear text password. By default, the Fast Clone Console sets the encrypt_database_passwords parameter to true and writes an encrypted password.

asm_sys_username


Default value: SYS

Source Table ParameterIn the [SOURCE_TABLES] section of the configuration file, specify the tables or materialized views to unload with the direct path unload method. In the [SOURCE_INDIRECT_TABLES] section, specify the tables or views to unload with the conventional path unload method.

In both sections, use the following parameter to identify the tables or views:

table_id

The name of a source table to unload. For partitioned tables, specify the table name followed by the partition name. Each table_id must be a unique key in the configuration file.

To unload data from selected partitions in a source table, specify each partition on a separate row. For example:

table1=table_1.partition_1table2=table_1.partition_3

To unload data from all partitions in a source table, specify the table name only or list all partitions. For example:

table1=table_1or

table1=table_1.partition_1table2=table_1.partition_2table3=table_1.partition_3

Tip: The data load script can load table partitions in parallel because Fast Clone creates control and data files for each partition.

118 Appendix B: Fast Clone Configuration File Parameters

SQL Query ParameterIn the [SOURCE_INDIRECT_STATEMENTS] section of the configuration file, optionally enter one or more SQL query statements to be processed when an unload job runs.

Use the following parameter:

query_id

Defines an SQL query, such as a table join. You can specify the query_id parameters multiple times to define multiple SQL queries. Each query_id must be a unique key in the configuration file. Use the following syntax:

query_id=sql_statementFor example:

query1="SELECT * FROM table1 WHERE ROWNUM<10"query2="SELECT * FROM table2 WHERE col_id>100"

Fast Clone writes the results of these queries to the query1.dat and query2.dat files.

Important: Use a SELECT * query only for source tables in which all of the columns have supported datatypes. If a source table includes any columns with unsupported datatypes, the SELECT query must exclude these columns.

Source Column ParametersIn the [COLUMN_BIT_MASK], [SOURCE_INDIRECT_TABLES_COLUMNS_LIST], and [BIN_COLUMN_BIT_MASK] sections of the configuration file, list the source columns from which to unload data.

The [COLUMN_BIT_MASK] section uses the following parameter to list column bit masks for the tables that are specified in the [SOURCE_TABLES] section and unloaded with the direct path unload method:

table_id_column_bit_mask

A hexadecimal bit mask that is used to select the columns for unload processing. The least significant bit indicates the first column. The most significant bit indicates the last column. Set a column bit in the bit mask to 1 to unload data from the column. Set a column bit in the bit mask to 0 to not process the column.

For example, a table that is identified by the table1 key in the [SOURCE_TABLES] section contains the following columns: COL1, COL2, COL3, COL4. To unload data from only COL4, set the binary bit mask for the table to 1000. The hexadecimal value of binary 1000 is 0x8:

table1_column_bit_mask=0x8Note: If a table does not have a bit mask, Fast Clone unloads data from all columns.

If you set the column_list_clause_byname_filtering parameter to true in the [RUN] section of the configuration file, you must replace the bit mask values that the Fast Clone Console generates with comma-separated lists of column names in any order. For example:

table1_column_bit_mask=COL2,COL1,COL4Fast Clone does not provide command line parameters to specify column bit masks. You can specify column bit masks only in a configuration file.

SQL Query Parameter 119

The [SOURCE_INDIRECT_TABLES_COLUMNS_LIST] section uses the following parameter to list columns for the tables that are specified in the [SOURCE_INDIRECT_TABLES] section and unloaded with the conventional path method:

table_id_column_list

A comma-separated list of column names that identify the columns in a table from which to unload data. For example:

table1_column_list=COL2,COL1,COL4Optionally, you can use SQL expressions to specify custom column formats.

• To specify a custom date or timestamp format, use the following statement syntax:

to_char(COLUMN_NAME, 'custom_format') as COLUMN_NAMENote: This custom format overrides the date_format or timestamp_format parameter value. For information about Oracle datetime format, see the Oracle documentation.

• To specify a value to replace null values, use the following statement syntax:

nvl(COLUMN_NAME, 'null_replacement_value') as COLUMN_NAME• To specify a custom precision value for a NUMBER column, use the following statement syntax:

round(NUMBER_COLUMN_NAME, custom_precision) as NUMBER_COLUMN_NAME• To specify a custom column size to pad column values on the right with blank spaces when the

length of the column value is less than the custom column size, use the following statement syntax:

rpad(COLUMN_NAME, column_size, ' ') as COLUMN_NAMEFor example, to use the custom date format dd/mm/yyyy for the COL4 column and replace null values in COL4 data with 14/12/2017, use the following table_id_column_list statement:

table1_column_list=COL2,COL1,nvl(to_char(COL4, 'dd/mm/yyyy'), '14/12/2017') as COL4The [BIN_COLUMN_BIT_MASK] section uses the following parameter to list column bit masks for the columns that are unloaded in binary format:

table_id_bin_column_bit_mask

A hexadecimal bit mask that is used to select the columns to unload in binary format. The least significant bit indicates the first column. The most significant bit indicates the last column. Set a column bit in the bit mask to 1 to unload data from the column in binary format. For example:

table1_bin_column_bit_mask=0x8

Related Topics:• “ Custom Column Format Parameters” on page 121

WHERE Condition ParameterIn the [WHERE_CONDITION] and [SOURCE_INDIRECT_TABLES_WHERE_CONDITION] sections of the configuration file, optionally specify WHERE conditions for the source tables.

The [WHERE_CONDITION] section defines WHERE conditions for the tables that are unloaded with the direct path unload method. The [SOURCE_INDIRECT_TABLES_WHERE_CONDITION] section defines WHERE conditions for the tables that are unloaded with the conventional path unload method.

In both sections, use the following parameter to define the WHERE condition:


table_id_where_condition

Specifies a WHERE clause that is used to filter the row data to be unloaded from the table that is defined by the table_id key in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] section of the cloning configuration file. For tables that do not have WHERE clause, Fast Clone unloads all rows. For more information about SQL operators for WHERE clauses, see the SQL documentation.

For example, the following statement specifies a WHERE clause for a DATE column:

table1_where_condition=( DATE_COLUMN >= to_date('12.14.2012 10:22:14', 'mm.dd.yyyy hh24:mi:ss') )

Note: You can specify only one WHERE clause for a table_id table.

Custom Column Format ParametersYou can customize date and timestamp formats for unload jobs that use the direct path or conventional path unload method

Custom Formats for Tables That Use the Direct Path Unload Method

If you plan to use the direct path unload method, you can define custom formats for DATE columns in the [DATE_FORMATS] section and custom formats for TIMESTAMP columns in the [TIMESTAMP_FORMATS] section of the cloning configuration file. By default, the [DATE_FORMATS] and [TIMESTAMP_FORMATS] sections are empty. To add a custom format, use the following parameter:

format_ID=custom_format

Specifies a custom DATE or TIMESTAMP format with a unique numeric ID. Ensure that each ID is a one- or two-digit number that is unique within the [DATE_FORMATS] or [TIMESTAMP_FORMATS] section of the configuration file. The length of a custom format must not exceed 99 characters. Also, the number of custom DATE or TIMESTAMP formats in a configuration must not exceed 99.



DD Day of month




MI Minutes

MM Month

SS Seconds

Custom Column Format Parameters 121



YYYY 4-digit year

For example:

[DATE_FORMATS]1=dd/mm/yyyy

[TIMESTAMP_FORMATS]1=dd/mm/yyyy hh24:mi:ss.ff6

In the [table_id] section of the configuration file, assign the custom formats that you defined to columns in the table from which data will be unloaded. Use the following parameter:

COLUMN_NAME

Specifies the column to which to assign a custom date or timestamp format. You can also use this parameter to control numeric precision and padding and to indicate a specific date or timestamp value that replaces nulls.

Use the following general syntax:

COLUMN_NAME=parameter_1:value_1;[parameter_n:value_n;]Important: The COLUMN_NAME value must be in all uppercase.

• To assign a custom date format to the column, use a statement that begins with "d:" and specifies a custom_format_id that you defined for a date format:

d:custom_format_id;Note: This custom format overrides the format that is specified by the date_format parameter.

• To assign a custom timestamp format to the column, use a statement that begins with "t:" and specifies a custom_format_id that you defined for a timestamp format:

t:custom_format_id;Note: This custom format overrides the format that is specified by the timestamp_format parameter.

• To assign a value that replaces null values for the column, use a statement that begins with "n:" and specifies the replacement value:

n:null_replacement_value;For DATE and TIMESTAMP columns, you can specify a null_replacement_value in any format. However, for consistency, use a custom format that is specified by the format_ID=custom_format parameter.

Important: If the specified column is a trailing column and the suppress_trailing_nullcols parameter is set to true, Fast Clone will not unload the column, even if you specify a null replacement value. If the column should not be suppressed, ensure that the suppress_trailing_nullcols parameter is set to false.

• To assign a custom precision to a NUMBER column, use statement that beings with "p:" and specifies the precision value:

p:custom_precision;• To assign a custom padding value to the column, use a statement that begins with "a:" and specifies

the padding value:

a:custom_padding;


For example, assume that you defined the following custom date and timestamp formats:

[DATE_FORMATS]1=dd/mm/yyyy

[TIMESTAMP_FORMATS]1=dd/mm/yyyy hh24:mi:ss.ff6

Now you assign the custom timestamp format and the null replacement value of 14/12/2017 00:00:00.0 to the column TIMESTAMP_WITHOUT_TZ_COL. You also assign the custom date format to the column DATE_COL:

[TABLE1]TIMESTAMP_WITHOUT_TZ_COL=t:1;n:14/12/2017 00:00:00.0;DATE_COL=d:1;

Custom Formats for Tables That Use the Conventional Path Unload Method

To specify custom formats for the columns in source tables that you unload with the conventional path method, use SQL expressions in the [SOURCE_INDIRECT_TABLES_COLUMNS_LIST] section of the cloning configuration file instead of column names that are specified in the [table_id] sections.

• To specify a custom date or timestamp format, use the following statement syntax:

to_char(COLUMN_NAME, 'custom_format') as COLUMN_NAMENote: This custom format overrides the date_format or timestamp_format parameter value. For information about Oracle datetime format, see the Oracle documentation.

• To specify a value to replace null values, use the following statement syntax:

nvl(COLUMN_NAME, 'null_replacement_value') as COLUMN_NAME• To specify a custom precision value for a NUMBER column, use the following statement syntax:

round(NUMBER_COLUMN_NAME, custom_precision) as NUMBER_COLUMN_NAME• To specify a custom column size to pad column values on the right with blank spaces when the length of

the column value is less than the custom column size, use the following statement syntax:

rpad(COLUMN_NAME, column_size, ' ') as COLUMN_NAMEFor example, to use the custom date format dd/mm/yyyy for the COL4 column and replace null values in COL4 data with 14/12/2017, use the following table_id_column_list statement:

table1_column_list=COL2,COL1,nvl(to_char(COL4, 'dd/mm/yyyy'), '14/12/2017') as COL4

Related Topics:• “Source Column Parameters” on page 119

Target Database ParametersIn the [DESTINATION] section of the configuration file, specify parameters for the target database. The output control files and the data load script use these parameters to load data to the target database.


db_type

The target database type. Options are:

• amazon_redshift. Use this option for Amazon Redshift targets.

• cloudera. Use this option for Cloudera targets.

Target Database Parameters 123

• flat. Use this option for flat file targets.

• greenplum3. Use this option for Greenplum targets.

• hive. Use this option for Hive targets.

• hortonworks. Use this option for Hortonworks targets.

• mssql. Use this option for Microsoft SQL Server targets.

• netezza. Use this option for Netezza targets.

• oracle. Use this option for Oracle targets.

• teradata. Use this option for Teradata targets.

• vertica. Use this option for Vertica targets.

Default value: oracle

dba_password

A valid password for the user that is specified by the dba_userid parameter. If you specify a forward slash (/) in the dba_userid parameter, Fast Clone ignores the password and uses operating system authentication.

Important: By default, the Fast Clone Console sets the encrypt_database_passwords parameter to true and writes an encrypted password. However, if the encrypt_database_passwords parameter is set to false, specify a clear text password.

dba_userid

The name of a user who has permissions to connect to the target database and load the source data. If you specify a forward slash (/) in this parameter, Fast Clone ignores the password and uses operating system authentication.

greenplum_loader_home

For Greenplum targets, if you load data to the target database from data files or pipes, the path to the Greenplum loader root directory that Fast Clone uses to locate the gpload utility and generate valid load scripts.

host

The host name or IP address of the system where the target database runs.

instance

The target instance name or database name for the tables to be loaded.

Default value: the value that is specified by the instance parameter in the [SOURCE] section of the cloning configuration file.

mssql_batchsize

The number of rows in a batch that is passed to the Microsoft SQL Server osql utility for loading data to the target. The osql utility copies each batch to the target server as one transaction.

Fast Clone uses this value as the BATCHSIZE option value in the BULK INSERT statement for the osql utility.




mssql_maxerrors

The maximum number of syntax errors that can appear in the output data files before the Microsoft SQL Server osql utility cancels the BULK INSERT operation. The osql utility ignores all of the rows that it cannot parse and counts an error for each row.

Fast Clone uses this value as the value of the MAXERRORS option in the BULK INSERT statement for the osql utility.


owner

The target schema name or schema owner for the tables to be loaded.

port

The port number that Fast Clone uses to connect to the target database.


Note: The default value that is specified by the Fast Clone Console depends on the target type.

protocol

For Oracle targets, the protocol that Fast Clone uses to connect to the target database. Valid values are:



Default value: TCP

s3_bucket

Specifies the name of the Amazon S3 bucket that stores the temporary files with the data to be loaded to the Amazon Redshift target.

s3_file_size

Specifies the maximum size of the temporary files, in kilobytes, that Fast Clone creates in the Amazon S3 bucket before loading the source data to Amazon Redshift.

Default value: 5120 KB

s3_key_id

Specifies an AWS access key ID that Fast Clone must use to access the Amazon S3 account resources including the bucket where Fast Clone creates temporary files.

s3_path

Specifies the name of the directory in the Amazon S3 bucket where Fast Clone creates the temporary files that store source data.

s3_secret_key

Specifies the secret access key for the access key ID that is specified in the s3_key_id parameter. The access key ID must have the authority to access the Amazon S3 account resources.

stream_maxerrors

For Greenplum, Netezza, Teradata, and Vertica targets, the maximum number of errors that can occur when streaming data to the target database with DataStreamer. If the number of errors exceeds this maximum, DataStreamer ends with an error.

Note: For Netezza targets, review the .nzbad file to which DataStreamer writes the data rows that caused an error. DataStreamer does not report any errors in the output log if the number of rows that caused an error is less than the value of this parameter.

Target Database Parameters 125



textfile_charset_collation

The name of a Microsoft SQL Server collation that is used to store character and Unicode data in the output data files. Fast Clone uses this value to correctly specify a column collation in the format file that maps the fields of the data file to the table columns. Fast Clone uses the specified collation for all of the table columns.

Fast Clone uses this value as the FORMATFILE option value in the BULK INSERT statement for the osql utility.

Default value: SQL_Latin1_General_Cp437_BIN

Runtime ParametersIn the [RUN] and [INDIRECT] sections of the configuration file, specify runtime parameters.

In the [RUN] section, you can enter the following generic and direct path unload parameters:

asm_debug

Set this parameter to true only at the request of Informatica Global Customer Support. For Oracle sources in an Automatic Storage Management (ASM) environment, a value of true causes Fast Clone to print an ASM debug log to the asm_debug.log file.


Note: This parameter is not available in the Fast Clone Console. Edit the cloning configuration file to add this parameter manually.

big_raw_device_offset_mul

For Oracle sources that use raw devices on AIX, the number of offset blocks to skip from the beginning of a DF_LGDSK raw device when the calc_raw_device_offset parameter is set to true. You can specify the size of a single offset block in the raw_device_offset parameter. To get the total number of bytes that Fast Clone skips on DF_LGDSK raw devices, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.


Default value: 32

binary_length_prefix

For Teradata targets, indicates whether to unload source data in binary format. Each record in binary format can include a 2-byte field that indicates the total length of the record, some optional bytes for null values, and the data.

For Vertica targets, indicates whether to unload source data in the NATIVE VARCHAR or text format. If you set this parameter to true, Fast Clone adds the NATIVE VARCHAR keyword to the COPY or LCOPY command in the output SQL load script. The database server converts CHAR or VARCHAR fields in the data files to Vertica datatypes when loading data. If you set this parameter to false, Fast Clone loads source data in text format with delimiters. Valid values are true and false.



binary_length_prefix_indicators

For Teradata targets, if you set the binary_length_prefix parameter to true, indicates whether to set the indicator bits in the binary output for columns that contain null values. Valid values are:

• true. Set the indicator bits to 1 for null values.

• false. Do not set indicator bits.


Note: This parameter is not available in the Fast Clone Console. To use the parameter, manually enter it in the cloning configuration file.

binary_length_prefix_little_endian

If you set the binary_length_prefix parameter to true, indicates whether to use LITTLE ENDIAN or BIG ENDIAN for the binary output. For Teradata targets, this parameter determines the format of the integer that specifies the length of records and fields in the binary files. Valid values are:

• true. The binary output uses the LITTLE ENDIAN format.

• false. The binary output uses the BIG ENDIAN format.

Default value: true

blocks_per_lob_read

Specifies the number of Oracle data blocks that Fast Clone fetches from LOB columns per read operation.



Default value: 4

blocks_per_read

Specifies the number of Oracle data blocks that Fast Clone fetches per read operation.



Default value: 32

calc_raw_device_offset

Indicates whether Fast Clone skips the offset block in raw devices. Valid values are:

• true. Skip the offset block in raw devices.

• false. Read data from the beginning of the raw device.

To get the total number of bytes that Fast Clone skips in a DF_LGDSK raw device, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.

Default value: true

Runtime Parameters 127

characterset

For Teradata targets, indicates whether to convert the source character data to UTF-8. By default, Fast Clone preserves the source database encoding. Valid values are:

• NONE. Preserve the source character set.

• UTF8. Unload the data in the UTF-8 format.

Default value: NONE

checkpoint_each_x_tables

Specifies the number of tables from which Fast Clone unloads data before taking a checkpoint.


Default value: 0. With this default, Fast Clone takes a checkpoint only before it starts unloading source data.

column_list_clause_byname_filtering

Indicates whether Fast Clone unloads columns in the custom order that is specified in the TABLE_ID_COLUMN_BIT_MASK parameter in the [COLUMN_BIT_MASK] section of the configuration file. Valid values are:

• true. Fast Clone unloads columns in the custom order that is specified in the TABLE_ID_COLUMN_BIT_MASK parameter.

• false. Fast Clone unloads columns in the order in which the columns appear in the source database.


column_separator

Specifies a character or character sequence that Fast Clone uses to separate columns in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.

You can specify a column separator that uses the \xhh escape sequence in hexadecimal notation.

Also, you can use the following non-printable ASCII characters as column separators:

• \a (alert)

• \b (backspace)

• \n (new line)




If you use non-printable ASCII characters, ensure that the target load utility supports these characters as separators. For more information, see the target database documentation.

Default value: semicolon (;)

compress_network_traffic

Indicates whether to compress network traffic. Valid values are:

• true. Compress network traffic.

• false. Do not compress network traffic.



compress_on_the_fly

Indicates whether to compress the output data files during unload processing to reduce the file sizes. Valid values are:

• true. Compress the output data files. The compression method is specified in the compression_type parameter.

• false. Do not compress the output data files.


compression_level

Specifies the compression level for the GZIP compression method. Higher values result in faster compression and a lower compression ratio. Lower values result in slower compression and a higher compression ratio. For Fast Clone to use this parameter, the compression_type parameter must be set to GZIP and the compress_on_the_fly parameter must be set to true.


Default value: 1

compression_type

If you set the compress_on_the_fly parameter to true, specifies the compression method that Fast Clone uses to compress the output data files. Valid values are:

• GZIP. Provides a better compression ratio but is slower. You can set the GZIP compression level in the compression_level parameter.

• QZIP. Provides a lower compression ratio but is faster.

Default value: GZIP

consistent_lock_on_table_set

If you use the direct path unload method and set the dirty parameter to false, indicates whether to lock all of the selected tables in a table set before unloading the source data. Valid values are:

• true. Fast Clone locks all of the selected source tables in a table set before unloading the source data. Fast Clone unlocks these tables after unloading data from all of them. Enter this value to get consistent point-in-time data across the selected source tables.

• false. Fast Clone holds a lock on each source table only while unloading data from the table.


control_file_name_base

Specifies the naming pattern that Fast Clone uses to generate names for control files, log files, and SQL scripts. You can use the following variables:

• %&schema&% - Represents a default schema name for the user that Fast Clone uses to connect to the source database.

• %&source_schema&% - Represents a source schema name.

• %&target_schema&% - Represents a target schema name.

• %&table&% - Represents a table name.

• %&table_counter&% - Represents a sequence number of a table.

• %&partition&% - Represents a partition name.The partition variable is required if you specify partitions in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] sections of the configuration file.


• %&counter&% - Represents the sequence number of a control file for a table. This variable value is always 1.

• %&yyyy&% - Represents a year.

• %&mm&% - Represents a month.

• %&dd&% - Represents a day.

For example:

control_file_name_base=%&source_schema&%.%&table&%.%&partition&%If you do not specify the control_file_name_base parameter, Fast Clone uses the table name as the name for control files, log files, and SQL scripts.

convert_ascii_to_ebcidic

Indicates whether Fast Clone converts source data to the EBCDIC encoding when unloading data to output data files and pipes. Valid values are:

• true. Convert source data to the EBCDIC encoding.

• false. Preserve the original encoding.


convert_numbers_to_comp3_fl_8b

Indicates whether to write numeric data to the output data files as COMP-3 values. Valid values are:

• true. Convert numeric data to COMP-3 in the output data files. Use this value if you need to read the output file with COBOL or another language that has built-in COMP-3 support.

• false. Preserve the original numeric format.


convert_numbers_to_comp3_implied_decimal

The position of the decimal point in the COMP-3 format.


Default value: 0

convert_utf16_clobs_to_utf8

Indicates whether Fast Clone converts CLOB data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in CLOB columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in CLOB columns. If you do not use UTF-16 encoding for the source data, set this parameter to false to avoid the overhead of character set conversion.


convert_utf16_nchars_to_utf8

Indicates whether Fast Clone converts NCHAR and NVARCHAR2 data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in NCHAR and NVARCHAR2 columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in NCHAR and NVARCHAR2 columns.




convert_utf16_nclobs_to_utf8

Indicates whether Fast Clone converts NCLOB data from UTF-16 to UTF-8 encoding in the output data files. Valid values are:

• true. Convert source character data in NCLOB columns from UTF-16 to UTF-8 encoding.

• false. Do not convert source character data in NCLOB columns.



create_external_table_log_file

For Oracle targets, if you create an SQL script for creating external tables based on the output files, indicates whether to log messages related to this processing to a log file. Valid values are:

• true. The Oracle external table utility logs messages to a log file when accessing data in the output data files. Ensure that the create_external_table_script parameter is also set to true.

• false. The Oracle external table utility does not log these messages.


create_external_table_name_includes_partition

Indicates whether Oracle external table names include partition names. Valid values are:

• true. Add the partition name to the external table name in the following format:

table_partition• false. Do not add the partition name to the external table name.


create_external_table_script

For Oracle targets, indicates whether Fast Clone generates an SQL script to create Oracle external tables based on the output data files. Valid values are:

• true. Create an SQL script to create Oracle external tables based on the output data files.

• false. Do not create an SQL script to create Oracle external tables based on the output data files.


create_load_input_only

Indicates whether Fast Clone creates the load script and control files but skips unloading data from the source database. Valid values are:

• true. Create a load script and the control files. Do not unload data from the source database.

• false. Create a load script and the control files. Unload data from the source database.

Note: Fast Clone does not generate load scripts and control files for flat file, Hadoop, and Hive targets.



create_output_pipe_instead_of_file

Indicates whether Fast Clone creates pipes in the output directory and writes the output data to these pipes. Valid values are:

• true. Fast Clone creates pipes in the output directory.

• false. Fast Clone does not create pipes in the output directory. Use this value to unload data to output data files or to the pipes that you manually created.


date_format

Specifies the date format to use in output files.



DD Day of month




MI Minutes

MM Month

SS Seconds


YYYY 4-digit year

For example, the following format values are valid:

• YYYY-MM-DD HH24:MI:SS

• DDMMYYYY

• YYYY-MM-DD

• HH24:MI:SS

Default value: YYYY-MM-DD HH24:MI:SS

dbsync_config_xml

Specifies the path and file name for the Data Replication configuration file that can be in XML or SQLite format. The path is relative to the DBSYNC_HOME directory.

Important: If you use a Data Replication configuration in SQLite format, do not copy or move the configuration files from the default DataReplication_installation/configs directory to another location.

Default value: config.xml


dbsync_config_xsd

Specifies the path and file name for the Data Replication configuration schema .XSD file. The path is relative to the DBSYNC_HOME directory.

Default value: config.xsd

dbsync_integration

Indicates whether Fast Clone integrates with Informatica Data Replication. Also, when integration is enabled, determines whether Fast Clone allows user sessions that are connected to the source tables to complete processing before locking the tables. Valid values are:

• false. Do not integrate Fast Clone with Data Replication.

• lock. Integrate Fast Clone with Data Replication. Fast Clone waits for all user sessions that are connected to the source tables to complete processing and close before locking the tables for unload processing.

• lock_disconnect. Integrate Fast Clone with Data Replication. Fast Clone forces all user sessions that are connected to the source tables to disconnect and then locks the tables for unload processing.


dbsync_integration_by_direct

If you enable integration with Data Replication, indicates whether to unload data from the source database with the direct path unload method. Valid values are:

• true. Use the direct path unload method.

• false. Use the conventional path unload method.

Default value: true

dbsync_integration_update_scn

Indicates whether to update the Data Replication configuration file with the SCN for each mapped table after Fast Clone unloads the source data. Valid values are true and false.

Important: Set this parameter to true if you plan to use Data Replication to replicate change data and use Fast Clone to perform initial materialization of the replication target tables.

Default value: true

decimal_separator

Specifies the decimal separator that Fast Clone uses in decimal numbers in unloaded data.

Default value: period (.)

default_long_raw_is_hex

For Oracle targets, indicates whether to unload BLOB and LONG RAW source data in hexadecimal format. Set this parameter to true to unload data with these datatypes in hexadecimal format. For other target types, set this parameter to false to avoid generating incorrect control files.


destination_loader_to_use

For Teradata and Vertica targets, specifies the load utility or command that Fast Clone uses to load data to the target. Enter a load utility or command that Fast Clone supports.


Valid values for Teradata loaders:

• teradata_fast_load. Use the Teradata FastLoad utility, or use the TPT Load Operator if using Teradata Parallel Transporter (TPT).

• teradata_multi_load. Use the Teradata MultiLoad utility, or use the TPT Update Operator if using Teradata Parallel Transporter (TPT).

• teradata_parallel_dpump. Use the TPT Stream Operator. In this case, you must use Teradata Parallel Transporter (TPT).

Valid values for Vertica loaders:

• isql_copy. Use this command to load data from the data files that are located on the target database server.

• isql_lcopy. Use this command to load data from the data files that are located on the system where you run the target load utility.


direct_data_stream

For Greenplum, Netezza, Teradata, and Vertica targets, indicates whether Fast Clone uses DataStreamer to load data to the target database.

• For Greenplum targets, DataStreamer uses the Greenplum gpfdist utility.

• For Netezza targets, Fast Clone unloads data to the named pipes that represent the Netezza external tables. DataStreamer uses bulk Insert to load data from the external tables to the corresponding target tables.

• For Teradata targets, DataStreamer send the unloaded data directly to the Teradata Parallel Data Pump, FastLoad, or MultiLoad utility for loading to the target.

• For Vertica targets, DataStreamer uses the COPY command on the server side and the LCOPY command on the client side to send the unloaded data directly to Vertica targets.

Valid values are:

• true. Use DataStreamer to load data to the target database.

• false. Do not use DataStreamer to load data to the target database.


direct_select_from_sys

Indicates whether Fast Clone uses SYS-owned dictionary tables, such as sys.obj$ and sys.tab$, to get extent boundaries. Valid values are:

• true. Use SYS-owned dictionary tables to get extent boundaries.

• false. Use dba_extents and dba_segments views to get extent boundaries.

Note: If you set this parameter to false for Oracle sources that use locally managed tablespaces, retrieval of extent information from the dba_extents view is slow, especially if the dictionary is partially analyzed and the optimizer mode is not RULE. In this situation, Informatica recommends that you set this parameter to true and grant the SELECT ANY DICTIONARY permission to the user.



dirty

Indicates whether to lock the tables that are unloaded with the direct path unload method to perform consistent read operations. Valid values are:

• true. Do not lock the tables before unloading the source data. Use this option if you want to bypass the Oracle consistency mechanism. However, block-level consistency is preserved, because block I/O operations are atomic.

• false. Lock the tables exclusively before unloading the source data.

Default value: true

disable_linux_cache_bypass

For Linux operating systems, indicates whether to use direct I/O mode to read Oracle data files. In direct I/O mode, Fast Clone bypasses operating system caching and reads data directly from disk. Set this parameter to true to use direct I/O mode.


enclose_dates

Indicates whether to use the character that is specified by the optionally_enclosed_by parameter to enclose, or delimit, dates. Set this parameter to true to use this character.


enclose_numbers

Indicates whether to use the character that is specified by the optionally_enclosed_by parameter to enclose, or delimit, numbers. Set this parameter to true to use this character.


encrypt_database_passwords

Indicates whether the password that is specified by the dba_password parameter in the [SOURCE] section of the configuration file is encrypted. Valid values are:

• true. The password is encrypted.

• false. The password is not encrypted.

Default value: true

escape_enclosure

Specifies an escape character that Fast Clone inserts before the character that is specified in the optionally_enclosed_by parameter when the character appears in text columns. If you define this parameter, Fast Clone truncates the optionally_enclosed_by value to a single character.


explicitly_flush_buffer_cache

Indicates whether Fast Clone clears all data from the buffer cache of the Oracle source database before unloading the data with the direct path unload method. Valid values are:

• true. Fast Clone clears all data from the buffer cache by executing the following command on the Oracle source database:

ALTER SYSTEM FLUSH BUFFER_CACHE;• false. Fast Clone does not clear data from the buffer cache.



export_all_tables_for_owner

Indicates whether to unload data from all tables in the specified schema or from only the tables that you specify. If you choose to unload data from all tables in the schema, indicates whether to use the direct path unload or conventional path unload method.

Valid values:

• NONE. Fast Clone unloads data from only the source tables that you specified in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] section of the configuration file. You must specify at least one table in one of these sections.

• AUTO. Fast Clone unloads data from all tables in the specified schema by using the direct path unload method if possible. If Fast Clone cannot use the direct path unload method, Fast Clone unloads data by using the conventional path unload method.

• ALL_DIRECT. Fast Clone unloads all tables in the specified schema by the using direct path unload method.

• ALL_CONVENTIONAL. Fast Clone unloads all tables in the specified schema by using the conventional path unload.

Default value: NONE

export_binary_to_separate_files

Indicates whether to write data from BLOB, CLOB, NCLOB, LONG, and LONG RAW source columns to separate files. In the regular data files, Fast Clone includes the file name instead of the source value. Specify true to write data from columns with these datatypes to separate files. For Hive targets, Fast Clone always uses the parameter value of false.


Important: If you use DataStreamer to load source data to Netezza targets, set this parameter to false.

fixed_size_characters_by_col_definition

Indicates whether to unload data from character columns in fixed-length format. The fixed length is determined by the column length. If a text value is shorter than the column length, Fast Clone pads the column with spaces up to the full column length. Specify true to unload character data in fixed-length format.


fixed_size_number_by_col_definition

Indicates whether to unload data from NUMBER columns in the fixed-length format. Specify true to unload character data in fixed-length format. The fixed length is defined by the precision of the NUMBER column, without a decimal separator. If a number is shorter than this length, Fast Clone pads the value with zeroes. The following image shows the fixed-length format of a NUMBER column:

For example, when using fixed-length format, Fast Clone unloads the value 123.45 from the NUMBER(10,5) column as +0012345000.



global_date_nvl

Specifies a date value that Fast Clone inserts instead of null values in DATE columns.

Note: You can specify a date value in any format. However, for consistency, use the format that is specified by the date_format parameter.


greenplum_gpfdist_connection_timeout

For Greenplum targets, specifies the time interval, in seconds, that Fast Clone waits for a response from Greenplum when opening a connection to the target. If this interval elapses without a response, Fast Clone times out.


Default value: 55

greenplum_gpfdist_current_hostname

For Greenplum targets, specifies the host name or IP address of the system where you run the Fast Clone instance that starts the gpfdist utility for loading data.


greenplum_gpfdist_listen_hostname

For Greenplum targets, specifies the IP addresses on which the gpfdist utility listens for incoming requests. Use 0.0.0.0 to have the gpfdist utility listen for requests from all interfaces.


greenplum_gpfdist_listen_port

For Greenplum targets, specifies the listener port for the gpfdist utility. The Greenplum target uses this port to submit requests to the gpfdist utility to load data.


Default value: 8000

ip_instead_of_file

If the use_ip_instead_of_file parameter is set to true, specifies the IP address or host name and the port number of the Fast Clone Server to which Fast Clone sends output files. Use the following format for the value:

host_or_ipaddress:portDefault value: No default is provided.

limit_result_set

Specifies the maximum number of rows that Fast Clone unloads from each table. Set this parameter to -1 to unload all rows from a table.


Default value: -1

load_data_to_same_partition_as_source

For Oracle targets, indicates whether SQL*Loader loads data to a table partition that has the same name as the corresponding source table partition. Specify true to load data from a source table partition to a target table partition of the same name.



load_script_base_name

Specifies the file name of the load script. On Windows, Fast Clone generates load script files with the .cmd extension. On Linux and UNIX, Fast Clone generates load script files with the .sh extension.

Default value: The source schema owner name that is specified by the owner parameter in the [SOURCE] section of the cloning configuration file.

local_time_zone

For Teradata targets, specifies the local time zone to use for unloading data from TIMESTAMP WITH LOCAL TIME ZONE columns. Specify the time zone as an offset from the Oracle database time zone:

"+/-HH:MM"The HH variable is a two-digit number from -13 to +13. The MM variable is a two-digit number from 0 to 59.

For example, enter "+08:00".


log

Indicates whether Fast Clone produces an extended log. Set this parameter to true only at the request of Informatica Global Customer Support.


mb_per_file

Defines the maximum amount of data, in megabytes, that a single output data file (.dat file) can contain.


Default value: -1. With this default value, Fast Clone does not enforce a limit on the size of the output data files.

netezza_control_chars_enabled

If you use DataStreamer to load data to a Netezza target, specifies the value of the EscapeChar option for the Netezza external tables. This option indicates whether the control characters, such as delimiter and backslash characters, are escaped in the data fields of the external tables. The Netezza external tables use a backslash as an escape character.

Default value: true

netezza_error_log_directory

If you use DataStreamer to load data to a Netezza target, specifies the directory in which DataStreamer creates the following Netezza log files:

• table_name.schema_name.nzlog. This log file includes load statistics and diagnostic messages that the Netezza ODBC driver issues when loading data to the target tables from the corresponding external tables.

• table_name.schema_name.nzbad. This log file includes the external table rows that DataStreamer could not load to the corresponding target table.

Default value: The output directory


netezza_pipe_directory

If you use DataStreamer to load data to a Netezza target, specifies the directory in which Fast Clone creates named pipes that represent Netezza external tables.

On Windows, DataStreamer ignores this parameter because the named pipes are created in the named pipe directory that is mounted under the special path \\.\pipe\.

Default value: The output directory

netezza_quoted_value

If you use DataStreamer to load data to a Netezza target, specifies the value of the QuotedValue option for the Netezza external tables. DataStreamer requires this option to build correct Insert statements when loading data to the target tables from the corresponding external tables. Options are:

• single. Indicates that the data values in the external tables are enclosed with single quotation marks.

• double. Indicates that the data values in the external tables are enclosed with double quotation marks.

• no. Indicates that the data values in the external tables are not quoted.

Ensure that the quotation character that you specify for the load operation matches the quotation character that you specify for the unload operation. For unload jobs, Fast Clone uses the quotation character that you specify in the Enclosed by field on the Runtime Settings tab > Format Settings view.

Default value: no

nvl_fixed_size_characters_by_space

For character columns that are unloaded in fixed-length format, indicates whether to replace null values with space characters. Specify true to replace nulls with spaces.


ocfs_direct_io

For Oracle sources in an Oracle RAC and Oracle Cluster File System (OCFS) environment on Linux or UNIX, indicates whether Fast Clone uses direct I/O mode to read Oracle data files. Direct I/O mode minimizes the effects of cache on reading data from Oracle data files. Set this parameter to true if the Oracle data files are located in an OCFS.


optionally_enclosed_by

Specifies the character or character sequence that Fast Clone uses to enclose character data in output data files. Specify a character or character sequence that does not appear in the source data. The target load utility cannot correctly parse the data files if the enclosure character is a source data value.

If you use the default value and the source columns contain double-quotation marks (") in the data, specify a different character or character string for this parameter or use the escape character in the escape_enclosure parameter to escape the double-quotation marks in the column data.

Note: If you use the character that is defined by the escape_enclosure parameter to escape the optionally_enclosed_by value in the source data, Fast Clone truncates the optionally_enclosed_by value to a single character.

Default value: Double-quotation mark (")

outdir

Specifies the directory in which Fast Clone creates the output files. The output directory can be on a local file system or on a network mapped share, such as NFS or Samba. For Hive targets, specifies the location of flat files that store table data on the HDFS.


Default value: The FAST_READER_HOME environment variable value.

output_file_extension_base

Specifies the extension that Fast Clone uses for output data files. Fast Clone creates a data file for each source table from which data is unloaded.

Default value: dat

output_file_name_base

Specifies the naming pattern that Fast Clone uses to create the base name of the output data files. In the naming pattern, you can include the following variables:

• %&schema&% - Represents a default schema name for the user that Fast Clone uses to connect to the source database.

• %&source_schema&% - Represents a source schema name.

• %&target_schema&% - Represents a target schema name.

• %&table&% - Represents a table name.

• %&table_counter&% - Represents a sequence number of a table.

• %&partition&% - Represents a table partition name. This partition variable is required if you specify partitions in the [SOURCE_TABLES] or [SOURCE_INDIRECT_TABLES] sections of the configuration file.

• %&counter&% - Represents a sequence number of a data file for a table. This counter variable is required if Fast Clone splits the output data files based on the mb_per_file and split_files_by_row_count parameters.

• %&yyyy&% - Represents a year.

• %&mm&% - Represents a month.

• %&dd&% - Represents a day.

For example:

output_file_name_base=%&source_schema&%.%&table&%.%&partition&%If you do not specify the output_file_name_base parameter, Fast Clone uses the table name as the name for output data files.

output_ocfs_direct_io

For Oracle sources in an Oracle RAC and Cluster File System (OCFS) environment on Linux or UNIX, indicates whether Fast Clone uses direct I/O mode to write output files. Direct I/O mode minimizes the effects of cache on writing the output files. Set this parameter to true if you configured Fast Clone to generate output files in an OCFS.


print_column_names

For flat file targets, indicates whether Fast Clone writes column names in the first row of the output data files. Specify true to write column names in the first row.


Note: This parameter is available for all target types when the unload_data_only parameter is set to true.

raw_device_offset

The size of a single offset block, in bytes, in a raw device. Fast Clone skips the offset block if you set the calc_raw_device_offset parameter to true. To get the total number of bytes that Fast Clone skips on DF_LGDSK raw devices, multiply the raw_device_offset value by the big_raw_device_offset_mul parameter value.


Default value: 0 for Linux, 4096 for other operating systems

read_securefile_by_direct

For Oracle 11g and Oracle 12c sources, enables Fast Clone to unload data with the direct path unload method from columns that have SecureFile LOB datatypes. The read_securefile_by_direct parameter also enables Fast Clone to unload data with the direct path unload method for the extended NVARCHAR2, RAW, and VARCHAR2 datatypes that were introduced in Oracle 12c. Valid values are:

• true. Fast Clone reads SecureFile LOB and extended datatypes with the direct path unload method.

• false. Fast Clone reads SecureFile LOB and extended datatypes with the conventional path unload method.

Default value: true

record_separator

A character or character sequence that Fast Clone uses to separate table rows in the output data files. Specify a character or character sequence that does not appear in the source data. Otherwise, the target load utility cannot correctly parse the data files. For more information, see “Selecting Column and Row Separator and Enclosure Character to Use for Data Loading” on page 49.

You can specify a row separator that uses the \xhh escape sequence in hexadecimal notation.


• \a (alert)

• \b (backspace)

• \n (new line)




If you use non-printable ASCII characters in row separators, ensure that the target load utility supports these separators. For more information, see target database documentation.

Default value: \n

replace_new_line_with_space

Indicates whether to replace new line (\n) character in unloaded data with space characters. Specify true if you use the new line character as the row separator in output files and the source character data includes new line characters. However, with a value of true, unload processing performance might be degraded.


right_pad_columns

Indicates whether to pad columns on the right with spaces when the length of a column value is less than the full length of the column. Specify true to enable padding.


show_increments_rows

The number of rows that Fast Clone unloads with the direct path unload method before writing updated progress information for the unload job to the system console or command prompt window.




skip_all_out_file_ops

If you use DataStreamer to stream data to Greenplum or Teradata targets, Fast Clone skips writing data to output files. If you do not use DataStreamer and need to troubleshoot unload jobs, you can set this parameter to true to write the data to output files.

Default value: The direct_data_stream parameter value.

skip_unreadable_block

Indicates whether Fast Clone skips blocks that it cannot read from Oracle files when unloading data with the direct path unload method. Valid values are:

• true. Fast Clone skips the unreadable blocks and continues unload processing.

• false. Fast Clone ends with an error.

Default value: true

split_files_by_row_count

The maximum number of rows that Fast Clone writes to an output data file. If Fast Clone needs to unload a greater number of rows from the source table, Fast Clone creates multiple output data files for the table. The file names have the following format:

table_name_file_number.datSpecify this parameter only if you use the direct path unload method.

Note: For parallel unload processing, splitting output into multiple data files based on a number of rows is slower than splitting output based on a chunk size. This performance degradation occurs because of locks between the parallel threads.


Default value: -1. With this default value, Fast Clone does not create multiple output data files based on row count.

suppress_trailing_nullcols

Indicates whether Fast Clone includes trailing columns that contain null values in the output data files. Valid values are:

• true. Do not include trailing columns that contain null values.

• false. Include trailing columns that contain null values.

Default value: true

If the trailing_column_separator parameter is set to false, Fast Cone always includes trailing null columns, regardless of the suppress_trailing_nullcols parameter setting.

Important: If you use DataStreamer to load data to an Amazon Redshift, Greenplum, Netezza, Teradata, or Vertica target, set this parameter to false. If you create the configuration file in a text editor, set the suppress_trailing_nullcolls parameter to false.

teradata_downgrade_to_multiload_if_table_not_empty

For Teradata targets, indicates whether DataStreamer tries to use the TPT Update Operator if the TPT Load Operator fails to load source data to target tables. Valid values are:

• true. DataStreamer tries to use the TPT Update Operator if the TPT Load Operator returns an error.

• false. DataStreamer does not try to use the TPT Update Operator if the TPT Load Operator returns an error.


Default value: true

If you do not use DataStreamer, this parameter is ignored.

teradata_drop_rebuild_tables

For Teradata targets, indicates whether DataStreamer drops and re-creates tables in the target database before it loads the source data. Valid values are: true and false.

Default: false

teradata_load_sessions

For Teradata targets, specifies the number of sessions that the Teradata FastLoad or MultiLoad utility can open to load the source data to each target table.

Default value: 2

teradata_loader_to_use

For Teradata targets, the Teradata load utility that Fast Clone generated load scripts call to load source data to the target. Valid values are:

• teradata_fast_load. Use the FastLoad utility, or use the TPT Load Operator if using the Teradata Parallel Transporter (TPT).

• teradata_multi_load. Use the MultiLoad utility, or use the TPT Update Operator if using the Teradata Parallel Transporter (TPT).

• teradata_parallel_dpump. Use the TPT Stream Operator. In this case, you must use the Teradata Parallel Transporter (TPT).


Note: This parameter is deprecated. Use the destination_loader_to_use parameter instead.

threads_num

Specifies the maximum number of tables or table partitions that Fast Clone can unload in parallel with the direct path unload method.


Default value: 1

threads_per_segment_num

Specifies the maximum number of threads that Fast Clone can use to unload data from each table or table partition with the direct path unload method.

Note: Fast Clone might use fewer threads. Fast Clone does not create a thread to unload a chunk of less than 100 data blocks.


Default value: 1

timestamp_format

Specifies the timestamp format in the output.




DD Day of month




MI Minutes

MM Month

SS Seconds


YYYY 4-digit year

For example:

• YYYY-MM-DD HH24:MI:SS.FF6

• YYYY-MM-DD HH24:MI:SS

• DDMMYYYY

• YYYY-MM-DD

• HH24:MI:SS

Note: For the conventional path unload method, Fast Clone supports all timestamp formats.

Default value: YYYY-MM-DD HH24:MI:SS.FF6

trailing_column_separator

Indicates whether to add a column separator after the last column value in each row in the output. Valid values are:

• true. Add a column separator after the last column value.

• false. Do not add a column separator after the last column value.

Note: Fast Clone forces this parameter to false for Oracle targets if the unloaded source tables contain LOB columns and the export_binary_to_separate_file parameter is set to true.

Default value: true

Important: If you use DataStreamer to load data to a Greenplum, Netezza, Teradata, or Vertica target, make sure this parameter is set to true.

truncate_numeric_precision

Specifies the number of significant digits to use for rounding down the source numeric values in the output. This parameter truncates only the fractional part of a number.



Default value: -1. Fast Clone does not round down the source numeric values.

unload_data_only

Indicates whether to unload source data only and skip the creation of control files and the load script. Valid values are:

• true. Do not create control files and a data load script when Fast Clone unloads source data to .dat files.

• false. Create control files and a data load script when Fast Clone unloads source data to .dat files.


For flat file, Hadoop, and Hive targets, Fast Clone always uses the parameter value of true.

use_asm_global_cache

For Oracle ASM sources, indicates whether Fast Clone caches ASM AUNs in memory. Valid values are true and false.

Default value: true

use_dbms_diskgroup_instead_of_direct

Indicates whether Fast Clone uses the DBMS_DISKGROUP package to read data from ASM sources. Typically, use of the DBMS_DISKGROUP package is significantly slower than reading data directly from ASM sources because the package has a PL/SQL limit of 32,600 bytes per read.

However, if you use the conventional path method and set this parameter to true, Fast Clone unloads data from ASM by using multiple threads. Fast Clone can use multiple threads from a remote computer, with an even data distribution across the threads.

Valid values are true and false.


use_ip_instead_of_file

Indicates whether Fast Clone sends the source data to the Fast Clone Server that is defined by the ip_instead_of_file parameter. Valid values are true and false.


use_sid_instead_of_service

Indicates whether to use the Oracle SID instead of the service name when you cannot connect to the source instance by using the instance name. Valid values are true and false.


use_ssl_connection

For Amazon Redshift and Greenplum targets, the protocol that Fast Clone uses to connect to the target database. Valid values are:

• true. Use the TCP/IP protocol.

• false. Use the TCP/IP protocol with the Secure Sockets Layer (SSL).


use_teradata_parallel_transporter

For Teradata targets, indicates whether Fast Clone uses a Teradata Parallel Transporter (TPT) operator to load the source data to the target database. Valid values are true and false.



validate_block_scn

Determines whether to write a warning message to a log file when the SCN value of an unloaded data block is greater than the Start SCN value. The Start SCN value corresponds to point immediately after the checkpoint when Fast Clone started unloading data. Fast Clone compares these values to detect possible data inconsistencies in the unloaded data. Valid values are:

• true . Write a warning message to a log file when a block SCN value is greater than the Start SCN value.

• false. Do not write the warning message.


Note: To use the parameter, you must manually enter it in the cloning configuration file. You cannot enter it at the command line or in the Fast Clone Console.

In the [INDIRECT] section of the configuration file, you can enter the following runtime parameters for the conventional path method:

blob_fetch_chunk_size_for_indirect_kb

Specifies the amount of data that Fast Clone can unload from LOB columns at one time.


Default value: 128

enforce_oci9_limitations

Indicates whether Fast Clone unloads a single row at a time when a source table contains RAW columns. Valid values are:

• true. Force the fetch_array_size_for_indirect parameter to 1 if a source table contains RAW columns.

• false. Always use the fetch_array_size_for_indirect parameter.

Default value: true

fetch_array_size_for_indirect

Specifies the number of rows that Fast Clone can unload from the Oracle source database at one time.


Default value: 500

fetch_blob_as_single_chunk_for_indirect

Indicates whether to unload a single chunk of LOB data that has the size specified by the fetch_blob_as_single_chunk_for_indirect_kb parameter. Valid values are:

• true. Unload only a single chunk of LOB data.

• false. Unload all LOB data in multiple chunks.


fetch_blob_as_single_chunk_for_indirect_kb

Specifies the maximum amount of data that Fast Clone unloads from a LOB column if the fetch_blob_as_single_chunk_for_indirect parameter is set to true.


Default value: 128

oracle_use_direct_file_access_to_fetch_extent_map

Indicates whether to read extent information for parallel unload processing directly from the data file. Specify true to read extent information from the data file.



Note: In the Fast Clone Console, the corresponding Use direct file access to fetch extent map option is selected (true) by default.

statement_unload_threads_num

Specifies the maximum number of tables or table partitions that Fast Clone can unload in parallel with the conventional path unload method.


Default value: 1

statement_unload_threads_per_segment_num

Specifies the maximum number of threads that Fast Clone can use to unload data from each table or partition with the conventional path method.

Note: If you use multiple threads for a single table, CPU usage might increase. Usually, multiple threads are used to unload data from large tables. Fast Clone might use fewer threads than this parameter defines because there no reason to create a thread to export a chunk of less than 100 data blocks.


Default value: 1

Character Replacement ParameterIn the [CHARACTER_REPLACEMENT] section of the configuration file, specify character replacement rules.

By default the [CHARACTER_REPLACEMENT] section is empty. You can use the following parameter to define a replacement rule:

replacement_rule_id="CharacterToBeReplaced_NewCharacter"

Ensure that each replacement_rule_id value in the [CHARACTER_REPLACEMENT] section is unique. For the value, use the following syntax: CharacterToBeReplaced_NewCharacter

Important: The character to be replaced and the new character must be separated with a single underscore.

For example:

replace1="a_b"Tip: To remove the CharacterToBeReplaced character from source data, set NewCharacter to an empty value. For example, to remove the tab character (\t), define replace99="\t_".

You can use hexadecimal notation in the CharacterToBeReplaced or NewCharacter value in a character replacement rule. If you want to replace a backslash (\) character, you must use hexadecimal notation.

Restriction: You must not use multibyte characters in the CharacterToBeReplaced or NewCharacter value in a character replacement rule.


• \a (alert)

• \b (backspace)

• \n (new line)

Character Replacement Parameter 147




Reporting ParametersIn the [REPORTING] section of the configuration file, specify reporting parameters.


create_run_report

Indicates whether to create an XML report file in the output directory. The report file contains the number of unloaded rows and unload failures for each table. Valid values are true and false.


job_name

Specifies the job name that Fast Clone uses as a base for the XML report file name. Fast Clone also uses the job name in the SNMP trap text.

Default value: FastReader_Extract

Note: The Fast Clone Console sets this parameter to an empty string.

manual_report_file_name

Indicates whether to use the path and file name that are specified by the report_file parameter to generate the XML report file. Valid values are:

• true. Use the path and file name that are specified by the report_file parameter.

• false. Use the file name base that is specified by the job_name parameter.


report_file

Specifies the path and file name to use for the XML report file.

report_on_begin

Indicates whether to send an SNMP trap to an SNMP manager when Fast Clone starts unloading the source data. Valid values are true and false.


report_on_end

Indicates whether to send an SNMP trap to an SNMP manager when Fast Clone finishes unloading the source data. Valid values are true and false.


send_snmp_trap

Indicates whether to send SNMP traps. Valid values are true and false.


snmp_community

Specifies the Fast Clone SNMP community to which Fast Clone sends SNMP traps.


Default value: Informatica

snmp_enterprise_oid

Specifies the Fast Clone enterprise object identifier (OID).

Default value: 1.3.6.1.2.1.1.1.2.0.1

snmp_monitor_ip

Specifies the SNMP manager IP address.

Default value: 127.0.0.1

snmp_trap_begin_detail_oid

Specifies the Fast Clone SNMP trap detail OID for the trap that Fast Clone sends when it starts unloading the source data.

Default value: 1.3.6.1.2.1.1.1.2.0.3

snmp_trap_end_detail_oid

Specifies the Fast Clone SNMP trap detail OID for the trap that Fast Clone sends when it finishes unloading the source data.

Default value: 1.3.6.1.2.1.1.1.2.0.4

snmp_trap_oid

Specifies the Fast Clone SNMP trap OID.

Default value: 1.3.6.1.2.1.1.1.2.0.2

Transparent Data Encryption ParametersIn the [MKEY_ID] and [MKEY_VALUES] sections of the configuration file, specify master key IDs and values for Oracle sources that use Transparent Data Encryption (TDE).

Important: Fast Clone uses these parameters only with the direct path unload method. For the conventional path unload method, ensure that the Oracle wallet is open before you start unloading the source data.

Master Key IDs

The [MKEY_ID] section contains a list of master key IDs from the Oracle wallet. Use the following parameter to specify a master key ID:

master_key_id

A master key ID. Each master_key_id value in the configuration file must be a unique key.

Add all of the master key IDs, except for ORACLE.SECURITY.DB.ENCRYPTION.MASTERKEY, to the [MKEY_ID] section. For example:

KEY1=AX66JYiXI0/Kv0BpzPZ31PsAAAAAAAAAAAAAAAAAAAAAAAAAAAAAKEY2=BVRVfkdd3wD3uMD23F8L2tECAwAAAAAAAAAAAAAAAAAAAAAAAAAA

Tip: To get a list of master key IDs from an Oracle wallet, run the following command in the console:

orapki wallet display -wallet wallet_file_nameThe Oracle orapki utility is located in the ORACLE_HOME/bin directory.

Transparent Data Encryption Parameters 149

Master Key Values

The [MKEY_VALUES] section contains a list of master key values from the Oracle wallet. Use the following parameter to specify a master key value:

master_key_id

The master key value of the master key ID that is defined by the matching master_key_id key in the [MKEY_ID] section.

Add master key values for all master key IDs that are specified in the [MKEY_ID] section. For example:

KEY1=AEMAASAATegG3c24WbhSe7nCMuv8qObLm9F+QKpgNGfvqj8ewdIDEAAlV+AR6hr9rRxzIBe7JZYsBQcAeHALAQ8VDQ==KEY2=AEMAASAAhe+zE9L3EmcIxfTq35K9fyvFZUdyWaYu/fhl0hgt/DgDEAAmyAKGUoutcRMHnfl/xGdABQcAeHALAQ8VDQ==

Tip: To get a master key value from an Oracle wallet, run the following command in the console:

mkstore -wrl wallet_file_name -viewEntry master_key_IDThe Oracle mkstore utility is located in the ORACLE_HOME/bin directory.


A p p e n d i x C

Glossarycloning configuration file

A text file that contains all of the parameters for a data-cloning job. Informatica recommends that you use the Fast Clone Console to create and manage cloning configuration files to reduce the risk of error. Alternatively, you can create and edit configuration files in a text editor.

conventional path unload method

Fast Clone unload method that uses the OCI to retrieve metadata and data from the Oracle source. This method usually provides an acceptable and efficient level of performance but is slower than the direct path unload method. With the conventional path unload method, you can run Fast Clone on a system that is remote from the source database server without any special permissions. You can also unload data from Oracle views and cluster tables, and perform table JOIN operations. See also direct path unload method on page 151.

data cloning

The general process of copying source data to the target database. This process includes unloading source data to output files or pipes and loading the data from the output files to a target database by using the target load utility. If you use the DataStreamer component, this process includes unloading source data and streaming it directly to a target database by using the target load utility.

data files

Files to which Fast Clone writes the unloaded source data. Fast Clone generates the data files in the output directory when you run unload jobs that write data to files. You can customize the format of the data files, specify row and column separators to use in the files, and an enclosure character for record fields. By default, Fast Clone names data files by using the source table or table partition name with the .dat extension. If you configured Fast Clone to unload LOB data to separate files, Fast Clone also generates separate data files for storing LOB data. Fast Clone names these data files based on the record identifier for the record that contains the LOB data.

DataStreamer

An optional add-on component that you can purchase for faster loads of data to Greenplum, Netezza, and Teradata targets. DataStreamer streams the unloaded Oracle data directly to the Greenplum parallel file distribution server (gpfdist), to the Netezza external tables, or to the Teradata Parallel Data Pump, FastLoad, or MultiLoad utility for loading to the targets. You must use the direct path unload method. Because DataStreamer does not use Fast Clone data files, you do not need to reserve storage for these files.

direct path unload method

Fast Clone unload method that reads data directly from Oracle data files. This method provides high-performance unloads of data without increasing overhead on the source system. However, you cannot use

this method to unload data from a remote Oracle database server, perform SQL JOIN operations, or unload data from views or cluster tables. See also conventional path unload method on page 151.

Fast Clone Console

A graphical user interface from which you configure and manage data-cloning processes. From the Fast Clone Console, you can generate a cloning configuration file and run unload jobs on a local or remote system. You can run the Fast Clone Console on the Oracle source system, a target system, or a standalone system.

Fast Clone executable

The executable that runs unload jobs. The executable file name is FastReader.exe on Windows or FastReader on Linux or UNIX, . You can start the Fast Clone executable from the command line or from the Fast Clone Console. You can also use the Fast Clone executable with the RUN_AS_SERVICE=true option to run the Fast Clone Server.

Note: For Teradata targets, use FastReaderTD.exe file on Windows or FastReaderTD on Linux or UNIX.

Fast Clone Server

An optional add-on component that you can purchase to enable network communication across systems in a distributed Fast Clone topology. The Fast Clone Server runs as a Windows service or as a daemon on Linux or UNIX. You can run the Fast Clone Server on an Oracle source system to initiate data unload jobs from a remote system. You can also run the Fast Clone Server on the target system to accept output files from the source system.

Fast Clone Server configuration file

A configuration file for the Fast Clone Server. This file contains parameters that control how the Fast Clone Server runs. When you start the Fast Clone Server from the command line, you enter the SERVER_CONFIG parameter to specify the server configuration file. Fast Clone provides a predefined server configuration file, server.ini, in the top-level Fast Clone installation directory. You can edit this file to override the default Fast Clone Server settings.

load script

A .cmd script on Windows or .sh script on Linux and UNIX that you run to load data to a target. Fast Clone generates the load script in the output directory when you unload data to the data files or pipes. The load script invokes the target load utility and specifies the data files or pipes from which to get data. Typically, you run this script on the target system to load data to the target database. You can also run the load script on another system where the target load utility is installed. By default, Fast Clone names the load script based on the source owner or schema name. See also output files on page 152.

output files

Files that the Fast Clone executable generates in the output directory when you unload data to data files or pipes. These files include the load script and data files. Optionally, these files can also include control files, SQL scripts, and format files for the target load utility. When you run the load script, Fast Clone invokes the target load utility, and the load utility uses the other output files to load data to the target. See also load script on page 152 and data files on page 151.

remote configuration management

An option in the Fast Clone Console that you can use to manage cloning configuration files that are stored on remote systems. After you enabled remote configuration management in the Fast Clone Console, you can open, edit, and save the cloning configuration files on a remote system by using FTP.

152 Glossary

server configuration file

See Fast Clone Server configuration file on page 152.

unload job

The process of unloading data from an Oracle source. To unload the source data, Fast Clone either reads the Oracle data files directly or uses the OCI to retrieve data from an Oracle source.

unload script

The unload.cmd script on Windows or unload.sh script on Linux and UNIX that Fast Clone provides in the top-level installation directory. This script accepts a path to the cloning configuration file as a command line parameter and invokes the Fast Clone executable with this configuration file for data unload processing.

Appendix C: Glossary 153

I n d e x

AAmazon Redshift targets

Amazon S3 load settings 62defining in replication configurations 42

Amazon S3 load settings Amazon Redshift targets 62

Automatic Storage Management (ASM) connecting to an Oracle ASM instance 28

Ccharacter replacement rules

defining rules for replacing characters in source data 48character set conversion 73cloning configuration files

changing the column order in output files 37character replacement parameter 147configuring an Oracle ASM connection 28configuring Fast Clone Server tasks in the Console 60configuring miscellaneous conditions 54configuring parallel processing for direct path unloads 50configuring reporting and SNMP traps 69configuring runtime settings 43configuring target load settings in the Console 62configuring the path and file name 51customizing unload processing for table partitions 33DATE and TIMESTAMP format parameters 121defining ASCII flat files as targets 42defining character replacement rules in the Console 48defining HDFS targets in the Fast Clone Console 42defining SQL queries for unload processing in the Console 31defining the source database in the Console 26defining the target database in the Console 38emote management overview 91enabling remote management 91Fast Clone integration with Data Replication 60manually editing 116opening from a remote system 92Oracle Transparent Data Encryption parameters 149overview 23reporting parameters 148runtime parameters 126saving a file to a remote system 93saving in the Fast Clone Console 70selecting source tables and materialized views 30source columns 119source database parameters 116source tables 118SQL queries 119target database parameters 123WHERE conditions 120

column separator for data load processing 49

command line example 113overview 95runtime parameters 99source database parameters 96source table and column parameters 97syntax 95target database parameters 98

configuration files, cloning changing the column order in output files 37character replacement parameter 147configuring an Oracle ASM connection 28configuring Fast Clone Server tasks in the Console 60configuring miscellaneous conditions 54configuring parallel processing for direct path unloads 50configuring reporting and SNMP traps 69configuring runtime settings 43configuring target load settings in the Console 62configuring the path and file name 51customizing unload processing for table partitions 33DATE and TIMESTAMP format parameters 121defining ASCII flat files as targets 42defining character replacement rules in the Console 48defining HDFS targets in the Fast Clone Console 42defining SQL queries for unload processing in the Console 31defining the source database in the Console 26defining the target database in the Console 38enabling remote management 91Fast Clone integration with Data Replication 60manually editing 116opening from a remote system 92Oracle Transparent Data Encryption parameters 149overview 23remote management overview 91reporting parameters 148runtime parameters 126saving a file to a remote system 93saving in the Fast Clone Console 70selecting source tables and materialized views 30source columns 119source database parameters 116source tables 118SQL queries 119target database parameters 123WHERE conditions 120

configuration files, Fast Clone Server parameters 19

console See Fast Clone Console custom date and timestamp formats

specifying for a source column 36

Ddata load processing

disabling or dropping target constraints 88

154

data load processing (continued)enabling or creating target constraints 90overview 81running the load script 89

Data Replication integration with Fast Clone 60use with Fast Clone 18

data unload processing character set of the unloaded data 73description of unload methods 11excluding source columns from unload processing 34overview 72parallel processing for direct path unloads 50pending unloads to named pipes 76performance tuning for conventional path unloads 58running data unload jobs 75selecting source tables and materialized views 30unloading data from encrypted columns and tablespaces 74unloading data from Oracle views 74unloading data to fixed-length record format files 79unloading data to flat files 42unloading data to manually created pipes 77unloading data to named pipes 75unloading data to pipes generated by Fast Clone 76user-defined SQL queries 31using a separate output file for each partition 33

data unloading processing generating output data files in XML format 79using WHERE clauses to filter source data 35

DataStreamer overview 11

date format configuring custom format for a source column 36

direct path unload method configuring parallel processing for direct path unloads 50

Eenclosed by character

for data load processing 49encrypted source columns

opening the Oracle wallet 70encrypted source data

unloading data from TDE-encrypted columns and tablespaces 74

FFast Clone Console

creating configuration files in the Console 23starting 24toolbar buttons 24

Fast Clone executable overview 10

Fast Clone Server configuration file 19configuring server tasks in the Fast Clone Console 60installing and uninstalling on Windows 22overview 10, 19starting 22stopping 22

file names and locations configuring in the Fast Clone Console 51

flat file targets defining flat file targets in the Fast Clone Console 42

GGreenplum targets

load settings 63

HHadoop targets

defining in the Fast Clone Console 42HDFS targets

defining in the Fast Clone Console 42

Lload processing

column and row separators in data 49load script

configuring the base file name in the Console 51load settings

configuring target load settings in the Console 62Greenplum targets 63Microsoft SQL Server targets 64Oracle targets 66Teradata targets 67Vertica targets 68

loading data overview 81

Mmaterialized views

selecting materialized views for unload processing 30Microsoft SQL Server targets

load settings 64miscellaneous conditions

configuring in the Fast Clone Console 54mutithreaded processing

for conventional path unload processing 58

Nnamed pipes

scenarios that cause pending cloning processes 76unloading data to generated pipes 76unloading data to manually created pipes 77unloading data to named pipes 75

Netezza targets load settings 65

OOracle sources

configuring an Oracle ASM connection 28defining the source database in the Fast Clone Console 26

Oracle targets load settings 66

Oracle views unloading data from 74

Oracle wallet opening 70

output data files generating in XML format 79

Index 155

output files changing column order 37configuring the directory and file name extension 51format settings for load processing 44

Ppartitioned source tables

excluding partitions from unload processing 33using a separate output file for each partition 33

pipes scenarios that cause pending cloning processes 76unloading data to generated pipes 76unloading data to manually created pipes 77unloading data to named pipes 75

Rreplacement rules

defining rules for replacing characters in source data 48reporting

configuring in the Fast Clone Console 69row separator

for data load processing 49runtime settings

configuring in the Fast Clone Console 43

Ssaving configuration files

in the Fast Clone Console 70SNMP traps

configuring in the Fast Clone Console 69source columns

excluding from unload processing 34specifying a custom date or timestamp format 36

source database defining in the Fast Clone Console 26

source tables selecting tables for unload processing 30

SQL queries defining for unload processing in the Console 31

starting the Fast Clone Console 24

Ttarget tables

generating based on source table schema 86generating in the Fast Clone Console 86validating table schema 88

targets configuration considerations 84defining flat file targets in the Fast Clone Console 42

targets (continued)defining HDFS targets in the Fast Clone Console 42defining the target database in the Fast Clone Console 38supported targets 9

Teradata targets load settings 67

timestamp format configuring custom format for a source column 36

toolbar buttons in the Fast Clone Console 24

topologies 15Transparent Data Encryption (TDE)

unloading data from encrypted columns and tablespaces 74

Uunload processing

character set of the unloaded data 73description of unload methods 11excluding source columns from unload processing 34

output files unloading data to fixed-length format files 79

overview 72parallel processing for direct path unloads 50pending unloads to named pipes 76performance tuning for conventional path unloads 58running data unload jobs 75selecting source tables and materialized views 30unloading data from encrypted columns and tablespaces 74unloading data from Oracle views 74unloading data to fixed-length record format files 79unloading data to flat files 42unloading data to manually created pipes 77unloading data to named pipes 75unloading data to pipes generated by Fast Clone 76user-defined SQL queries 31using a separate output file for each partition 33

unload script configuring the path and file name in the Console 51

unloading processing generating output data files in XML format 79using WHERE clauses to filter source data 35

usage scenarios 9

VVertica targets

load settings 68

WWHERE clauses

filtering source column data 35

156 Index

U s e r G u i d e - Informatica · 2020-01-04 · Preface This I n f o r m a t i c a F a s t C l o...

Documents

Transcript of U s e r G u i d e - Informatica · 2020-01-04 · Preface This I n f o r m a t i c a F a s t C l o...