In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > LangChain > LangChain Multi-Query Retriever - Improving Search with Query Variations

LangChain Multi-Query Retriever - Improving Search with Query Variations

Author: Venkata Sudhakar

When ShopMax India customers search for products, a single embedding query often misses relevant results because the query wording does not match how the documents are indexed. LangChain's MultiQueryRetriever solves this by using an LLM to generate multiple variations of the original query, runs all of them against the vector store, and deduplicates the results - dramatically improving recall without manual query tuning.

MultiQueryRetriever wraps any existing retriever. When invoked, it sends the original question to an LLM with a prompt asking for alternative phrasings, then runs each variation as a separate similarity search. Results from all queries are merged and deduplicated by document ID. This approach is especially useful when customers use informal language, abbreviations, or different terminology from what is in the product catalog.

The example below shows ShopMax India using MultiQueryRetriever on a product FAQ vector store to improve retrieval for a query about headphone battery life.

from langchain_openai import ChatOpenAI, OpenAIEmbeddings
from langchain_community.vectorstores import Chroma
from langchain_core.documents import Document
from langchain.retrievers.multi_query import MultiQueryRetriever
import logging

logging.basicConfig()
logging.getLogger("langchain.retrievers.multi_query").setLevel(logging.INFO)

docs = [
    Document(page_content="Sony WH-1000XM5 offers up to 30 hours of battery life on a single charge.", metadata={"product": "Sony WH-1000XM5"}),
    Document(page_content="The WH-1000XM5 can be charged via USB-C and supports quick charge - 3 minutes gives 3 hours playback.", metadata={"product": "Sony WH-1000XM5"}),
    Document(page_content="Bose QC45 provides up to 24 hours wireless playback with ANC enabled.", metadata={"product": "Bose QC45"}),
    Document(page_content="JBL Tune 770NC has 70 hours total battery with ANC off, 44 hours with ANC on.", metadata={"product": "JBL Tune 770NC"}),
]

vectorstore = Chroma.from_documents(docs, OpenAIEmbeddings())
base_retriever = vectorstore.as_retriever(search_kwargs={"k": 2})

llm = ChatOpenAI(model="gpt-4o-mini", temperature=0)
multi_retriever = MultiQueryRetriever.from_llm(retriever=base_retriever, llm=llm)

results = multi_retriever.invoke("How long does the Sony headphone battery last?")
for doc in results:
    print(doc.metadata.get("product"), ":", doc.page_content[:80])

It gives the following output,

INFO:langchain.retrievers.multi_query:Generated queries: ['What is the battery duration of Sony headphones?', 'Sony headphone playtime on full charge', 'How many hours does Sony WH-1000XM5 battery last?']
Sony WH-1000XM5 : Sony WH-1000XM5 offers up to 30 hours of battery life on a single charge.
Sony WH-1000XM5 : The WH-1000XM5 can be charged via USB-C and supports quick charge - 3 minu

In production, set include_original=True to include the original query alongside the generated variations. Tune the number of query variations by customizing the prompt - 3 variations is a good balance between recall and latency. MultiQueryRetriever adds one LLM call per retrieval, so cache frequently asked questions at the application level to avoid repeated query generation for common ShopMax product questions. Monitor which generated queries produce the most unique results to assess retrieval quality.

Send your comments, suggestions or queries regarding this site to [email protected].