Developer Transformation Guide > Consolidation Transformation > Consolidation Transformation Views
  

Consolidation Transformation Views

The Consolidation transformation contains views for ports, strategies, and advanced properties.

Consolidation Transformation Strategies View

The Strategies view contains properties for simple, row-based, and advanced strategies.
The following list describes the types of consolidation strategies:
Simple Strategy
A simple strategy analyzes all the values of a port in a record group and selects one value. You specify a simple strategy for each port. The Consolidation transformation uses the port values selected by all simple strategies to create a consolidated record. Examples of simple strategies include the most frequent value in a port, the longest value in a port, or the most frequent non-blank value in a port.
Row-Based Strategy
A row-based strategy analyzes rows in the record group and selects one row. The Consolidation transformation uses the port values from that row to create a consolidated record. Examples of row-based strategies include the highest character count, the lowest number of blank fields, or the highest count of the most frequent fields.
Advanced Strategy
An advanced strategy analyzes a record group using strategies that you define. You build advanced strategies by using consolidation functions in an expression. The Consolidation transformation creates a consolidated record based on the output of the expression. The expression that you create can also use all of the functions available in the Decision transformation.

Consolidation Transformation Advanced Properties

The Consolidation transformation contains advanced properties that determine the sort behavior, the output mode, the cache memory behavior, and the tracing level.
You can configure the following advanced properties:
Sort
Determines whether the transformation sorts the input rows on the Group By port data. The property is enabled by default.
Select the property if the input rows are not presorted.
Case Sensitive Sort
Determines whether the sort operation is case-sensitive. The property is enabled by default.
Output Mode
Determines whether the transformation writes all records as output or writes the consolidated records as output. The default value is All.
Cache File Directory
Specifies the directory to which the Data Integration Service writes temporary data for the current transformation. The Data Integration Service writes temporary files to the directory when the volume of input data is greater than the available system memory. The Data Integration Service deletes the temporary files after the mapping runs.
You can enter a directory path on the property, or you can use a parameter to identify the directory. Specify a local path on the Data Integration Service machine. The Data Integration Service must be able to write to the directory. The default value is the CacheDir system parameter.
Cache File Size
Determines the amount of system memory that the Data Integration Service uses to sort the input data on the transformation. The default value is 400,000 bytes. You can use a parameter to specify the cache file size.
Before it sorts the data, the Data Integration Service allocates the amount of memory that you specify. If the sort operation generates a greater amount of data, the Data Integration Service writes the excess data to the cache file directory. If the sort operation requires more memory than the system memory and the file storage can provide, the mapping fails.
Note: If you enter a value of 65536 or higher, the transformation reads the value in bytes. If you enter a lower value, the transformation reads the value in megabytes.
Tracing Level
Amount of detail that appears in the log for this transformation. You can choose terse, normal, verbose initialization, or verbose data. Default is normal.

Cache File Size

The cache file size property determines the amount of system memory that the Data Integration Service assigns to the Consolidation transformation for sort operations. Configure the property with a value that is less than or equal to the amount of RAM on the Data Integration Service host machine.
For best performance, specify a cache file size of at least 16 MB.
Before it begins a sort operation, the Data Integration Service allocates the memory that the cache file size property specifies. The Data Integration Service passes all input data into the Consolidation transformation before it performs a sort operation.
If the volume of input data is greater than the cache file size, the Data Integration Service writes data to the cache file directory. When it writes data to the cache file directory, the Data Integration Service consumes disk space that represents at least twice the input data volume.
Use the following formula to determine the size of incoming data:
[number_of_input_rows *( Sum (column_size) + 16]
The following table lists the possible data types and the column size values to apply in cache file data calculations:
Data Type
Column Size
Binary
Precision + 8.
Round to nearest multiple of 8.
Date/Time
29
Decimal, high precision off (all precision)
16
Decimal, high precision on (precision <=18)
24
Decimal, high precision on (precision >18, <=28)
32
Decimal, high precision on (precision >28)
16
Decimal, high precision on (negative scale)
16
Double
16
Real
16
Integer
16
String, Text
Unicode mode: 2*(precision + 5)
ASCII mode: precision + 9