Author: Matillion
Date Posted: Oct 24, 2024
Last Modified: Oct 24, 2024
Databricks Mosaic AI Vector Search
Issue RAG queries inside Databricks with Data Productivity Cloud pipelines using Mosaic AI Vector Search.
These pipelines demonstrate how to run Retrieval Augmented Generation (RAG) queries inside Databricks, preparing the prompt by using Mosaic AI Vector Search. The demonstration uses Edgar Allan Poe’s famous poem: The Raven.
To use Mosaic AI Vector Search in Databricks, you will need to:
- Create a Vector Search Endpoint
- Create reference data inside a Delta table, with Change Data Feed enabled
- Use the reference data to create a Vector Search Index
- Query the Index using VECTOR_SEARCH
- Build an augmented prompt for AI_QUERY, to perform RAG
Creating a Vector Search Endpoint
Vector Search Endpoints are a type of compute resource, so in your Databricks console go to Compute > Vector Search and press Create.
One Vector Search Endpoint can handle many Vector Search Indexes, so you may choose to re-use an existing endpoint if you have one already.
Creating reference data inside a Delta table
Run the Data Productivity Cloud pipeline 1 Mosaic AI Vector Search Setup
to create the two Delta tables used in this demonstration:
stg_index_source
- containing the reference data for the vector indexstg_questions
- containing some example questions to ask via RAG
Enabling Change Data Feed
To enable Change Data Feed on a Delta table, use the following SQL:
ALTER TABLE `your-catalog`.`your-schema`.`your-table`
SET TBLPROPERTIES (delta.enableChangeDataFeed = true)
This is done automatically inside the 1 Mosaic AI Vector Search Setup
pipeline. Note that, for efficiency, data is loaded into the table first, then Change Data Feed is enabled afterwards.
Creating a Vector Search Index
Now that the reference data exists, you can create a Vector Search Index using the Vector Search Endpoint created earlier.
In your Databricks console go to Catalog, locate your source table, then follow Create > Vector Search Index.
In the settings, it’s easiest to choose “Compute Embeddings” to have Databricks manage them for you. You must then choose an embedding model, and in the screenshot below it’s databricks-gte-large-en
which is provided by Databricks.
Query a Vector Search Index using VECTOR_SEARCH
Databricks offers the VECTOR_SEARCH function as the SQL interface to search a Vector Index. You supply the name of the index, the text to search, and the number of results. For example:
SELECT * FROM VECTOR_SEARCH(
index => "your-catalog.default.name_of_new_vector_index",
query => "Your search text here",
num_results => 3);
You can see this in action in the 3 Vector Search
pipeline. The Vector Search component takes questions from one table (stg_questions
) and lateral joins to VECTOR_SEARCH function calls: one per question.
To flatten these “wide” format results into a usable prompt, the following component concatenates the top three search results into a single prompt like this:
`search_result[0]`.`document_text` || '\n' ||
`search_result[1]`.`document_text` || '\n' ||
`search_result[2]`.`document_text` || '\n'
The last component in the pipeline is an AI Query that invokes an LLM (Meta-Llama-3.1-70B-Instruct
in this case) for every question in the stg_questions
table.
Even in this simple example, the advantages of RAG are obvious. When you run this yourself, you may see slightly different result, but here’s what I found:
Question | No RAG | LLM with RAG |
---|---|---|
What does the raven say | A reference to Edgar Allan Poe’s classic poem “The Raven”! According to the poem, the raven says: “Quoth the Raven, ‘Nevermore.’” In other words, the raven’s ominous and repetitive response to the narrator’s questions is a single word: “Nevermore.” This word drives home the finality of death and the loss of the narrator’s beloved, Lenore. | The raven says: “Nevermore” |
What kind of shore is it | It seems like you’re asking about the characteristics of a shore, but without more context, it’s a bit challenging to provide a specific answer. Shores can vary greatly depending on their location, the body of water they border (ocean, lake, river), and the geological and climatic «+400 more words» | The shore is described as “Nightly” |
With the extra context provided by the Vector Index Search, the LLM is able to quickly produce concise and accurate answers.
Downloads
Licensed under: Matillion Free Subscription License
- Download Databricks-Mosaic-AI-Vector-Search-RAG.zip
- Target: Databricks