Enterprise Data Catalog Scanner Configuration Guide > Configuring Data Integration Resources > PowerCenter
  

PowerCenter

You can configure a PowerCenter resource type to extract metadata from PowerCenter repository objects. Use PowerCenter to extract data from multiple sources, transform the data according to business logic you build in the client application, and load the transformed data into file and relational targets.

Objects Extracted

Enterprise Data Catalog extracts mapping metadata from the PowerCenter source.

Permissions to Configure the Resource

Prerequisites

If the domain is SSL-enabled and cluster is Kerberos-enabled, perform the following steps:

Connecting to an SSL-enabled PowerCenter Resource

Perform the following steps before you create the PowerCenter resource if you want to connect to an SSL-enabled PowerCenter data source:
  1. 1. Disable the Catalog Service.
  2. 2. Run the following command to export the certificates from the domain truststore of PowerCenter:
    1. a. $INFAINSTALL/java/bin/keytool -export -keystore
    2. b. $INFAINSTALL/services/shared/security/infa_truststore.jks -alias <alias_name> -file
    3. c. $INFAINSTALL/services/shared/security/certExportFromSSLDomainTruststore.cert
  3. 3. On the node where Enterprise Data Catalog runs, run the following command to import the certificate into the Informatica domain truststore:
    1. a. $LDMINSTALL/java/bin/keytool -import -file certExportFromSSLDomainTruststore.cert -alias <new_alias_name>-keystore
    2. b. $LDMINSTALL/services/shared/security/infa_truststore.jks -storepass pass2038@infaSSL
    Note: Make sure that you use a new alias name when you import the certificate.
  4. 4. Copy the truststore and the keystore files from the PowerCenter domain to the Informatica domain truststore.
  5. 5. Run the following commands to configure the environment variables:
    1. a. Edit the infaservice.sh script on the Informatica domain using the following command: $INFAINSTALL/tomcat/bin/infaservice.sh
    2. b. In the script, add the following line to set the environment variable: INFA_TRUSTSTORE=<the location where you copied the truststore and keystore files.> Ensure that you add the line after the following command in the script: unset INFA_TRUSTSTORE INFA_TRUSTSTORE_PASSWORD INFA_KEYSTORE INFA_KEYSTORE_PASSWORD
    3. c. For a Hortonworks cluster, open the YARN application and set the INFA_TRUSTSTORE=<the location where you copied the truststore and keystore files.> variable in the advanced YARN environment variables section, and restart YARN.
    4. d. For a Cloudera cluster, edit the yarn-env.sh script and set the INFA_TRUSTSTORE=<the location where you copied the truststore and keystore files.> variable, and restart YARN.
  6. 6. Recycle the Catalog Service.

Parameter Files

If a PowerCenter source repository uses parameter files in sessions and workflows, you can configure Catalog Administrator to read the parameter files when you create the PowerCenter resource. A parameter file is a .zip file that contains all the parameters and variables and their associated values configured for workflows, worklets, or sessions in PowerCenter repository. PowerCenter parameters can represent flat file sources, flat file lookups, flat file targets, relational connections, expressions at the transformation level, or objects in SQL overrides. The PowerCenter resource parses the parameter files and substitutes the parameter values to extract metadata for the flat file sources, flat file lookups, flat file targets, relational connections, and objects in SQL overrides.
Generating a parameter file archive
You can create a parameter file archive manually within a directory or use the parameter file utility to create the archive. The parameter file utility converts PowerCenter files into a format that the Enterprise Data Catalog PowerCenter resource can parse. You must run the parameter file utility before you run the PowerCenter resource.
Note: Before you run the parameter file utility, verify that you have set the JAVA_HOME variable.
Running the parameter file utility
  1. 1. Download the parameter file utility from the following location: <Informatica installation directory>/services/CatalogService/ScannerBinaries/PCParamFileUtility 
  2. 2. Copy the parameter file utility to the machine where PowerCenter runs.
  3. 3. Encrypt the Informatica domain and PowerCenter passwords:
  4. 4. Navigate to the following location: <Informatica installation directory>/services/CatalogService/ScannerBinaries/PCParamFileUtility/config
  5. 5. In the config.properties file, enter values for the following parameters:
  6. Parameter
    Description
    domain.jdbc.driver=
    Informatica domain driver.
    domain.jdbc.url=
    Informatica domain URL.
    domain.jdbc.user=
    Informatica domain user name.
    domain.jdbc.password=
    Informatica domain encrypted password.
    pcrs.jdbc.driver=
    PowerCenter driver.
    pcrs.jdbc.url=
    PowerCenter URL.
    pcrs.jdbc.user=
    PowerCenter user name.
    pcrs.jdbc.password=
    PowerCenter encrypted password.
    integration.service=
    Integration service name.
    Note: That value can be empty If you do not have any integration service related to the corresponding PowerCenter service.
  7. 6. On the machine where PowerCenter runs, depending on your operation system, perform one of the following actions:
The utility generates a .zip file that contains the converted files. Upload the file to the PowerCenter resource before you run it.
Creating the parameter files manually within a directory
  1. 1. Create a directory with the same name as the directory configured for the PowerCenter repository in the PowerCenter repository.
  2. 2. Create parameter files named as <workflow name>.prm.
  3. 3. Place all the parameter definitions that are needed for the workflow and the corresponding sessions in the directory you created.
  4. 4. Zip the folder. The zip file can contain multiple folders containing parameter files.
Create multiple parameter files
Create parameter files named as <PowerCenter folder name>.<workflow name>.prm. Each parameter file applies to the workflow with which it is named.
Create a single parameter file
Create a single parameter file named as parameters.prm, and then zip the parameter file. The parameter file must contain the parameters that apply to all the workflows within the PowerCenter folders.

Parameter File Requirements

Use a text editor to create the parameter file. To enable the PowerCenter resource to read parameter values from a parameter file, the file must have a .prm extension. You must group parameters and variables in different sections of the parameter file. Each section is preceded by a heading that identifies the folder, workflow, worklet, and session to which you want to pass parameter or variable values. You define parameters and variables directly below the heading, entering each parameter or variable on a new line.
The following table describes the headings that define each section in the parameter file and the scope of the parameters and variables that you define in each section:
Heading
Scope
[Global]
All folders, workflows, worklets, and sessions.
[folder name.WF:workflow name]
The named workflow and all sessions within the workflow.
[folder name.WF:workflow name.WT:worklet name]
The named worklet and all sessions within the worklet.
[folder name.WF:workflow name.WT:worklet name.WT:worklet name...]
The nested worklet and all sessions within the nested worklet.
[folder name.WF:workflow name.ST:session name]
-or-
[folder name.WF:workflow name.WT:worklet name.ST:session name]
-or-
[folder name.WF:workflow name.WT:worklet name.WT:worklet name.ST:session name]
-or-
[folder name.session name]
-or-
[session name]
The named session.

Sample Parameter File

The following example shows a sample parameters.prm file:
[Map_Param.WF:WF_Src_Tgt_map_param_case.ST:s_src_tgt_tbl_override_default_map_param]
$$Src_OwnName=MM_PERF6
$$Src_TblName=TBL_SAME_COL
$$Tgt_Tbl_Prefix=MM_PERF6
$$Tgt_TblName=INVENTORY_Q4_2005
[Param_lookup.WF:wf_M_LKP_schema_tble
$$LKP_SCHEMA=TEST_DATA
$$LKP_TBL=LKP_TBL_PARAM
[Param_lookup.WF:wf_M_LKP_schema_tbl_sess_param]
$Param_Lkp_Schema=TEST_DATA
$Param_Lkp_Tbl=LKP_TBL_PARAM[
Param_session.WF:wf_session_param.ST:s_session_param]
$Param_Schema_Name=CROSS_RESOURCE_LINKING_DUP
$Param_SrcTbl_Name=SRC_TBL_NAME_OVERRIDE_PARAM
$Param_TgtTbl_Name=TGT_TBL_NAME_OVERRIDE_PARAM
[Param_Sql_override.WF:wf_M_schema_table_map_parm_sql.ST:s_M_schema_table_map_parm_sql]
$$Map_Schema_Name=CROSS_RESOURCE_LINKING
$$Map_Tbl_Name=SRC_TBL_NAME_OVERRIDE_DUP

Basic Information

The General tab includes the following basic information about the resource:
Information
Description
Name
The name of the resource.
Description
The description of the resource.
Resource type
The type of the resource.
Execute On
You can choose to execute on the default catalog server or offline.

Resource Connection Properties

The following table describes the properties for the PowerCenter resource:
Property
Description
Gateway Host Name or Address
PowerCenter domain gateway host name or address.
Gateway Port Number
PowerCenter domain gateway port number.
Informatica Security Domain
LDAP security domain name if one exists. Otherwise, enter "Native."
Repository Name
Name of the PowerCenter repository.
Repository User Name
Username for the PowerCenter repository.
Repository User Password
Password for the PowerCenter repository.
PowerCenter Version
PowerCenter repository version.
Note: Informatica does not provide support for PowerCenter versions earlier than 9.6.0.
PowerCenter Code Page
Code page for the PowerCenter repository.
The following table describes the Additional and Advanced properties for source metadata settings in the Metadata Load Settings tab:
Property
Description
Enable Source Metadata
Select to extract metadata from the data source.
Note: You can run Data Flow Analytics on the resource even if you have not enabled the source metadata setting.
Parameter File
Specify the parameter file zip that you want to attach from a local system.
Auto assign Connections
Specifies whether Enterprise Data Catalog assigns the connection is automatically.
Enable Reference Resources
Option to extract metadata about assets that are not included in this resource, but referred to in the resource. Examples include source and target tables in PowerCenter mappings, and source tables and files from Tableau reports.
Repository subset
Enter the file path list separated by semicolons for the Informatica PowerCenter Repository object.
Note: If you want to run Data Flow Analytics on the resource, you need to enter the file path for the Informatica PowerCenter Repository object. If you do not enter the file path to run Data Flow Analytics, by default, the catalog runs Data Flow Analytics on all the PowerCenter Repository objects.
Detailed Lineage
Select to extract and ingest metadata related to transformation logic for assets that include transformations. A transformation indicates generation, modification, or passage of data between source and target connections. A transformation logic displays the mappings or data flow relation types between source assets and target assets related to the asset you select in Enterprise Data Catalog.
Note: To view the detailed lineage for assets that include web service transformations in the Catalog, you must select the Enable Reference Resources option.
Memory
Specify the memory value required to run a scanner job.
Specify one of the following memory values:
  • - Low
  • - Medium
  • - High
Note: Informatica recommends that you use the High memory value if you enable the Data Flow Analytics.
For more information about the memory values, see the Tuning Enterprise Data Catalog Performance article on How To-Library Articles tab in the Informatica Doc Portal
Custom Options
JVM parameters that you can set to configure scanner container. Use the following arguments to configure the parameters:
  • - -Dscannerloglevel=<DEBUG/INFO/ERROR>. Changes the log level of scanner to values, such as DEBUG, ERROR, or INFO. Default value is INFO.
  • - -Dscanner.container.core=<No. of core>. Increases the core for the scanner container. The value should be a number.
  • - -Dscanner.yarn.app.environment=<key=value>. Key pair value that you need to set in the Yarn environment. Use a comma to separate the key pair value.
  • - -Dscanner.pmem.enabled.container.memory.jvm.memory.ratio=<1.0/2.0>. Increases the scanner container memory when pmem is enabled. The default value is 1.
The following table describes the Repository details and Advanced properties for data flow analytics settings in the Metadata Load Settings tab:
Property
Description
Enable Data Flow Analytics
Select to enable Data Flow Analytics and view analytical data about PowerCenter mappings in the catalog.
Note: You must configure Data Asset Analytics for the Catalog Service to run Data Flow Analytics on the resource.
Select PowerCenter Repository Type
Type of database for the PowerCenter repository. Select from the following options:
  • - Oracle
  • - Microsoft SQL Server
  • - IBM DB2
  • - Sybase
  • - PostgreSQL
Username
Name for the PowerCenter repository database user account.
Password
Password for the PowerCenter repository database user account.
Database Connection String
JDBC connection string to connect to the secure database, including the host name and port number and the security parameters for the database.
Custom Options
Specify the custom options to run the scanner job.