Administrator Guide > Understanding Globalization > PowerCenter Code Page Conversion
  

PowerCenter Code Page Conversion

When in data movement mode is set to Unicode, the PowerCenter Client accepts input in any language and converts it to UCS-2. The PowerCenter Integration Service converts source data to UCS-2 before processing and converts the processed data from UCS-2 to the target code page before loading.
When you run a session, the PowerCenter Integration Service converts source, target, and lookup queries from the PowerCenter repository code page to the source, target, or lookup code page. The PowerCenter Integration Service also converts the name and call text of stored procedures from the PowerCenter repository code page to the stored procedure database code page.
At run time, the PowerCenter Integration Service verifies that it can convert the following queries and procedure text from the PowerCenter repository code page without data loss:

Choosing Characters for PowerCenter Repository Metadata

You can use any character in the PowerCenter repository code page when inputting PowerCenter repository metadata. If the PowerCenter repository uses UTF-8, you can input any Unicode character. For example, you can store German, Japanese, and English metadata in a UTF-8 enabled PowerCenter repository. However, you must ensure that the PowerCenter Integration Service can successfully perform SQL transactions with source, target, lookup, and stored procedure databases. You must also ensure that the PowerCenter Integration Service can read from source and lookup files and write to target and lookup files. Therefore, when you run a session, you must ensure that the PowerCenter repository metadata characters are encoded in the source, target, lookup, and stored procedure code pages.

Example

The PowerCenter Integration Service, PowerCenter repository, and PowerCenter Client use the ISO 8859-1 Latin1 code page, and the source database contains Japanese data encoded using the Shift-JIS code page. Each code page contains characters not encoded in the other. Using characters other than 7-bit ASCII for the PowerCenter repository and source database metadata can cause the sessions to fail or load no rows to the target in the following situations:
Because the US-ASCII code page is a subset of both the ISO 8859-1 and Shift-JIS code pages, you can avoid these data inconsistencies if you use 7-bit ASCII characters for all of your metadata.