To use the Simple RAG Consumption with Databricks Mosaic AI recipe, the following prerequisites must be met:
1Existing database table
You must have a database table available on which the vector index search will be performed.
2Enable datafeed
Enable the datafeed for this database table to allow continuous data ingestion.
3Create a vector endpoint
In Databricks, create a vector endpoint. A Databricks vector endpoint is a scalable, serverless API hosted on Databricks that enables real-time serving of vector embeddings generated by machine learning models.
The following image shows the option to create a vector endpoint:
4Create a vector search index
Under the created vector endpoint, create a vector search index for the database table.
- Ensure that the database table has read and write permissions.
- The datafeed must be enabled on the table to support indexing.
The following image shows the vector search index option:
The following image shows a sample creation of a vector search index:
After creation, the vector endpoint will appear on the Vector Search page as shown in the following image: