In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > LangChain > LangChain Conversational RAG with Chat History

LangChain Conversational RAG with Chat History

Author: Venkata Sudhakar

ShopMax India's customer support chatbot needs to handle multi-turn conversations where follow-up questions reference earlier context. For example, after asking 'What are the specs of the Samsung Galaxy S24?', the customer might ask 'Does it support 5G?' - the second question only makes sense in the context of the first. LangChain's Conversational RAG chain solves this by reformulating follow-up questions into standalone questions before retrieval.

Conversational RAG combines a history-aware retriever with a question-answer chain. The history-aware retriever uses the LLM to rewrite the current question using prior chat messages into a standalone question that can be answered without conversation context. This rewritten question is then used to retrieve documents, which are passed to the QA chain along with the original chat history to generate the final answer.

The example below builds a conversational product support bot for ShopMax India that answers product questions and handles follow-up queries using chat history.

from langchain_openai import ChatOpenAI, OpenAIEmbeddings
from langchain_community.vectorstores import Chroma
from langchain_core.documents import Document
from langchain.chains import create_history_aware_retriever, create_retrieval_chain
from langchain.chains.combine_documents import create_stuff_documents_chain
from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain_core.messages import HumanMessage, AIMessage

llm = ChatOpenAI(model="gpt-4o-mini", temperature=0)

docs = [
    Document(page_content="Samsung Galaxy S24 has a 6.2 inch display, Exynos 2400 chip, 8GB RAM, 128GB storage, and supports 5G connectivity."),
    Document(page_content="Samsung Galaxy S24 battery is 4000mAh with 25W fast charging. It runs Android 14 with One UI 6.1."),
    Document(page_content="Samsung Galaxy S24 camera has 50MP main, 12MP ultrawide, and 10MP telephoto with 3x optical zoom."),
]

vectorstore = Chroma.from_documents(docs, OpenAIEmbeddings())
retriever = vectorstore.as_retriever()

contextualize_prompt = ChatPromptTemplate.from_messages([
    ("system", "Given chat history and the latest user question, reformulate a standalone question."),
    MessagesPlaceholder("chat_history"),
    ("human", "{input}"),
])
history_retriever = create_history_aware_retriever(llm, retriever, contextualize_prompt)

qa_prompt = ChatPromptTemplate.from_messages([
    ("system", "Answer using the context: {context}"),
    MessagesPlaceholder("chat_history"),
    ("human", "{input}"),
])
qa_chain = create_stuff_documents_chain(llm, qa_prompt)
rag_chain = create_retrieval_chain(history_retriever, qa_chain)

history = []
q1 = "What are the specs of Samsung Galaxy S24?"
res1 = rag_chain.invoke({"input": q1, "chat_history": history})
print("Q1:", res1["answer"])
history += [HumanMessage(content=q1), AIMessage(content=res1["answer"])]

q2 = "Does it support 5G?"
res2 = rag_chain.invoke({"input": q2, "chat_history": history})
print("Q2:", res2["answer"])

It gives the following output,

Q1: The Samsung Galaxy S24 has a 6.2-inch display, Exynos 2400 chip, 8GB RAM, 128GB storage, 4000mAh battery with 25W charging, and a 50MP main camera with 3x optical zoom.
Q2: Yes, the Samsung Galaxy S24 supports 5G connectivity.

In production, store chat history in a session-based store like Redis keyed by session ID rather than in-memory lists. Limit history length by trimming older messages to avoid exceeding the context window. The contextualize step adds one extra LLM call per turn, so monitor latency in high-volume scenarios. For ShopMax India support, partition knowledge bases by product category so retrieval is faster and more accurate.

Send your comments, suggestions or queries regarding this site to [email protected].