Big Data Management User Guide > Processing Unstructured and Semi-structured Data with an Intelligent Structure Model > Processing Unstructured and Semi-structured Data with Intelligent Structure Model Overview
  

Processing Unstructured and Semi-structured Data with Intelligent Structure Model Overview

You can use CLAIRE™ Intelligent Structure Discovery to parse semi-structured or structured data in mappings that run on the Spark engine.
Long, complex files with little or no structure can be difficult to understand much less parse. CLAIRE™ Intelligent Structure Discovery can automatically discover the structure in unstructured data.
CLAIRE™ uses machine learning algorithms to decipher data in semi-structured or unstructured data files and create a model of the underlying structure of the data. You can generate an Intelligent structure model, a model of the pattern, repetitions, relationships, and types of fields of data discovered in a file, in Informatica Intelligent Cloud Services.
To use the model, you export it from Data Integration, and then can associate it with a data object in a Big Data Management mapping. You can run the mapping on the Spark engine to process the data. The mapping uses the Intelligent structure model to extract and parse data from input files based on the structure expressed in the model.