Extract Structured JSON Data from Unstructured Data using Databricks Mosaic AI > Introduction to Extract Structured JSON Data from Unstructured Data using Databricks Mosaic AI recipe
  

Introduction to Extract Structured JSON Data from Unstructured Data using Databricks Mosaic AI recipe

The Extract Structured JSON Data from Unstructured Data using Databricks Mosaic AI recipe is based on REST and SOAP APIs. Use this recipe to extract structured JSON data from unstructured text. This recipe leverages the Databricks Mosaic AI Large Language Model (LLM) to accurately identify and organize relevant information. Automating this data transformation simplifies the analysis and integration of unstructured data into structured workflows.
The recipe allows you to upload a file with context, provide field names as a user prompt to extract data, read the text from the file, extract the required fields, and convert them into JSON format.
You can upload files of the following formats:
.txt, .xml, .json, .html, .csv, .md, and .yaml