Developer Transformation Guide > Data Processor Transformation > Data Processor Transformation Settings
  

Data Processor Transformation Settings

Configure code pages, XML processing options, and logging settings in the Data Processor transformation Settings view.

Character Encoding

A character encoding is a mapping of the characters from a language or group of languages to hexadecimal code.
When you design a Script, you define the encoding of the input documents and the encoding of the output documents. Define the working encoding to define how the IntelliScript editor displays characters and how the Data Processor transformation processes the characters.

Working Encoding

The working encoding is the code page for the data in memory and the code page for the data that appears in the user interface and work files. You must select a working encoding that is compatible with the encoding of the schemas that you reference in the Data Processor transformation.
The following table shows the working encoding settings:
Setting
Description
Use the Data Processor Default Code Page
Uses the default encoding from the Data Processor transformation.
Other
Select the encoding from the list.
XML Special Characters Encoding
Determines the representation of XML special characters. You can select None or XML.
  • - None.
  • Leave as & < > " '
    Entity references for XML special characters are interpreted as text. For example, the character > appears as
    >
    Default is none.
  • - XML. Convert to & < > " '
  • Entity references for XML special characters are interpreted as regular characters. For example, &gt; appears as the following character:
    >

Input Encoding

The input encoding determines how character data is encoded in input documents. You can configure the encoding for additional input ports in a Script.
The following table describes the encoding settings in the Input area:
Setting
Description
Use Encoding Specified in Input Document
Use the codepage that the source document defines, such as the encoding attribute of an XML document.
If the source document does not have an encoding specification, the Data Processor transfomation uses the encoding settings from the Settings view.
Use Working Encoding
Use the same encoding as the working encoding.
Other
Select the input encoding from a drop-down list.
XML Special Characters Encoding
Determines the representation of XML special characters. You can select None or XML.
  • - None.
  • Leave as &amp; &lt; &gt; &quot; &apos;
    Entity references for XML special characters are interpreted as text, for example, the character > appears as
    &gt;
    Default in None.
  • - XML. Convert to & < > " '
  • Entity references for XML special characters are interpreted as regular characters. For example, &gt; appears as the following character:
    >
Byte Order
Describes how multi-byte characters appear in the input document. You can select the following options:
  • - Little-endian. The least significant byte appears first. Default.
  • - Big-endian. The most significant byte appears first.
  • - No binary conversion.

Output Encoding

The output encoding determines how character data is encoded in the main output document.
The following table describes the encoding settings in the Output area:
Setting
Description
Use Working Encoding
The output encoding is the same as the working encoding.
Other
The user selects the output encoding from the list.
XML Special Characters Encoding
Determines the representation of XML special characters. You can select None or XML.
  • - None.
  • Leave as &amp; &lt; &gt; &quot; &apos;
    Entity references for XML special characters are interpreted as text, for example, the character > appears as
    &gt;
    Default.
  • - XML. Convert to & < > " '
  • Entity references for XML special characters are interpreted as regular characters. For example, &gt; appears as the following character:
    >
Same as Input Encoding
The output encoding is the same as the input encoding.
Byte order
Describes how multi-byte characters appear in the input document. You can select the following options:
  • - Little-endian. The least significant byte appears first. Default.
  • - Big-endian. The most significant byte appears first.
  • - No binary conversion.

Rules and Guidelines for Character Encoding

Use the following rules and guidelines when you configure encodings:

Output Settings

Configure output control settings to control whether the Data Processor transformation creates event logs and saves output documents.
You can control the types of messages that the Data Processor transformation writes to the design-time event log. If you save the parsed input documents with the event logs, you can view the context where the error occurred in the Event view.
The following table describes the settings in the Design-Time Events area:
Setting
Description
Log design-time events
Determines whether to create a design-time event log. By default, the Data Processor transformation logs notifications, warnings, and failures in the design-time event log. You can exclude the following event types:
  • - Notifications
  • - Warnings
  • - Failures
Save parsed documents
Determines when the Data Processor transformation saves a parsed input document. You can select the following options:
  • - Always.
  • - Never
  • - On failure
The default is always.
The following table describes the settings in the Run-Time Events area:
Setting
Description
Log run-time events
Determines whether an event log is created when you run the transformation from a mapping.
  • - Never.
  • - On failure
The default is Never.
The following table describes the settings in the Output area:
Setting
Description
Disable automatic output
Determines whether the Data Processor transformation writes the output to the standard output file. Disable standard output in the following situations:
  • - You pass the output of a Parser to the input of another component before the transformation creates an output file.
  • - You use a WriteValue action to write data directly to the output from a Script instead of passing data through the output ports.
Disable value compression
Determines whether the Data Processor transformation uses value compression to optimize memory use.
Important: Do not disable value compression except when Informatica Global Customer Support advises you to do disable it.

Processing Settings

The processing settings define how the Data Processor transformation processes an element without a defined datatype. The settings affect Scripts. The settings do not affect elements that an XMap processes.
The following table describes the processing settings that affect XML processing in Scripts:
Setting
Description
Treat as xs:string
The Data Processor transformation treats an element with no type as a string. In the Choose XPath dialog box, the element or attribute appears as a single node.
Treat as xs:anyType
The Data Processor transformation treats an element with no type as anyType. In the Choose XPath dialog box, the element or attribute appears as a tree of nodes. One node is of xs:string type, and all named complex data types appear as tree nodes.
The following table describes a processing setting that affects Streamer processing:
Setting
Description
Chunk size
This setting defines the amount of data that the Streamer reads each time from an input file stream. The Data Processor transformation applies this setting to a Streamer with a file input.
The following table describes a processing setting that affects hierarchical to relational transformation processing:
Setting
Description
Normalize XML input
This setting determines if the Data Processor transformation normalizes XML input. By default, the transformation performs normalization for XML input. In some cases, you might choose to skip automatic normalization to increase performance.

XMap Settings

The XMap setting defines how the Data Processor transformation processes XMap input elements that are not transformed to output elements. The unread elements are passed to a dedicated port named XMap_Unread_Input_Values. The setting takes affect only when the XMap is selected as the start-up component. The setting does not affect elements that the XMap processes.
To pass unread XMap elements to a dedicated port, enable the setting Write unread elements to an additional output port.

XML Output Configuration

The XML generation settings define characteristics of XML output documents.
The following table describes the XML generation settings in the Schema Title area:
Setting
Description
Schema location
Defines the schemaLocation for the root element of the main output document.
No namespace schema location
Defines the xsi:noNamespaceSchemaLocation attribute of the root element of the main output document.
Configure XML Output Mode settings to determine how the Data Processor transformation handles missing elements or attributes in the input XML document. The following table describes the XML generation settings in the XML Output Mode area:
Setting
Description
As is
Do not add or remove empty elements. Default is enabled.
Full
All required and optional elements defined in the output schema are written to the output. Elements that have no content are written as empty elements.
Compact
Removes empty elements from the output.
If Add for Elements is enabled, then the Data Processor transformation removes only the optional elements.
If Add for Elements is disabled, the Data Processor transformation removes all empty elements. The XML output might not be valid.
The following table describes the XML generation settings in the Default Values for Required Nodes area:
Setting
Description
Add for elements
When the output schema defines a default value for a required element, the output includes the element with a default value. Default is enabled.
Add for attributes
When the output schema defines a default value for a required attribute, the output includes the attribute with its default value. Default is enabled.
Validate added values
Determines whether the Data Processor transformation validates empty elements that are added by the Full mode output. Default is disabled.
If Validate added values is enabled and the schema does not allow empty elements, the XML output might not be valid.
The following table describes the XML generation settings in the Processing Instructions area:
Setting
Description
Add XML processing instructions
Defines the character encoding and XML version of the output document. Default is selected.
XML version
Defines the XML version. The XML version setting has the following options:
  • - 1.0
  • - 1.1
Default is 1.0.
Encoding
Defines the character encoding that is specified in the processing instruction. The Encoding setting has the following options:
  • - Same as output encoding. The output encoding in the processing instruction is the same as the output encoding defined in the Data Processor transformation settings.
  • - Custom. Defines the output encoding in the processing instruction. The user types the value in the field.
Add custom processing instructions
Adds other processing instructions to the output document. Enter the processing instruction exactly as it appears in the output document. Default is Disabled.
The following table describes the XML generation settings in the XML Root area:
Setting
Description
Add XML root element
Adds a root element to the output document. Use this option when the output document contains more than one occurrence of the root element defined in the output schema. Default is Disabled.
Root element name
Defines a name for the root element to add to the output document.