Sorted Data
To improve job performance, you can configure an Aggregator transformation to use sorted data.
When you configure an Aggregator transformation to use sorted data, you must sort data earlier in the data flow and indicate that the incoming data is sorted. Add the group by fields as sort fields in the upstream Source transformation.
When the Mapping Configuration task performs aggregate calculations on sorted data, the task caches sequential rows of the same group. When the task reads data for different group, it performs aggregate calculations for the cached group, and continues with the next group.
When you do not use sorted data, the Aggregator transformation performs aggregate calculations after it reads all data.
To configure the Aggregator transformation to process sorted data, on the Group By tab, select Incoming Fields are Sorted.