Data Discovery Guide > Part III: Data Discovery with Informatica Developer > Column Profiles on Semi-structured Data Sources > Column Profiles on Semi-structured Data Sources Overview
  

Column Profiles on Semi-structured Data Sources Overview

You can create data objects from Avro, JSON, Parquet, and XML data sources and then create a column profile on the data objects.
Avro, JSON, Parquet, and XML formats are semi-structured data sources. To use the semi-structured data sources to create a column profile, you can perform the following tasks:
  1. 1. Create a physical data object on the semi-structured data source.
  2. 2. Create and run a column profile on the physical data object.
You can create flat file data objects for JSON or XML data sources. You can create complex file data objects for Avro, JSON, Parquet, and XML data sources in Hadoop Distributed File System (HDFS).