Rules and Guidelines for Intelligent Structure Models
Consider the following rules and guidelines when you work with Intelligent structure model:
- •If the Intelligent structure model does not match the input file, or only partially matches the input file, there might be a large amount of unidentified data and data loss. It is important therefore to choose a representative file to create the model.
- •When you create an Intelligent structure model in Intelligent Structure Discovery, ensure that the names for output groups follow the Developer tool naming conventions.
- - To ensure compatibility, an element name must contain only English letters (A- Z, a-z), numerals (0-9), and underscores.
- - Do not use reserved logical terms such as And, Or, or other reserved names.
- - Do not start element names with a number.
- - Do not use duplicate names for different elements.
- •Ensure that the Intelligent structure model is valid before you add it to the Column Projection properties of the data object properties.
- •You can only use a Read transformation with an Intelligent structure model in a mapping. Do not create or use a Write transformation with an Intelligent structure model, as the mapping will fail.
- •If the Intelligent structure model version is incompatible with the Big Data Management version, import the model in a compatible model version from Data Integration.
- •A data object with an Intelligent structure model parse PDF forms, Microsoft Word tables, and XML files whose size is less than the supported Hadoop split size of 256 M.
- •An Intelligent structure model can parse the data within PDF form fields but not data outside the fields.
- •An Intelligent structure model can parse data within Microsoft Word tables. Other data is unidentified.