In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > Prompt Engineering > Retrieval-Augmented Prompting - Dynamic Context Injection

Retrieval-Augmented Prompting - Dynamic Context Injection

Author: Venkata Sudhakar

Retrieval-augmented prompting dynamically injects relevant context into the prompt at request time, rather than embedding all possible knowledge in the system prompt. At ShopMax India, a customer asking about return policy for a specific product category gets a prompt that includes only the relevant policy section retrieved from a knowledge base - keeping prompts focused and token costs low.

The pattern works in three steps: retrieve relevant documents or chunks based on the query (using keyword search, vector similarity, or a lookup table), inject the retrieved content into the prompt as context, then ask the LLM to answer using only the provided context. Placing retrieved context immediately before the question gives the LLM the best signal. Adding an instruction to say unknown when context is insufficient improves reliability.

The example below shows ShopMax India dynamically injecting policy context into a customer support prompt. A simple keyword lookup retrieves the relevant policy section, which is injected into the prompt before the LLM answers the customer question.

import anthropic

client = anthropic.Anthropic()

POLICY_KB = {
    "return": "ShopMax India Return Policy: Items can be returned within 7 days of delivery for a full refund. Items must be in original packaging with all accessories. Large appliances require technician inspection before return approval. Refund is processed in 3-5 business days.",
    "delivery": "ShopMax India Delivery Policy: Standard delivery is 3-5 business days for metros (Mumbai, Delhi, Bangalore, Hyderabad, Chennai). Express delivery available for Rs 199 extra. Free delivery on orders above Rs 10000.",
    "warranty": "ShopMax India Warranty Policy: All products carry manufacturer warranty. Warranty claims require purchase invoice and product serial number. Physical damage and voltage fluctuation damage are not covered.",
    "payment": "ShopMax India Payment Policy: Accepts UPI, credit cards, debit cards, net banking, and EMI. No-cost EMI available on orders above Rs 15000 with partner banks."
}

def retrieve_context(query):
    query_lower = query.lower()
    for key, content in POLICY_KB.items():
        if key in query_lower:
            return content
    return "No specific policy found for this query."

questions = [
    "What is the return window for a Samsung TV?",
    "How long does delivery take to Bangalore?",
    "Is voltage damage covered under warranty?"
]

for question in questions:
    context = retrieve_context(question)
    prompt = "Context:\n" + context + "\n\nCustomer question: " + question + "\n\nAnswer using only the context above. Be concise."
    response = client.messages.create(
        model="claude-haiku-4-5", max_tokens=128,
        messages=[{"role": "user", "content": prompt}]
    )
    print("Q:", question)
    print("A:", response.content[0].text.strip())
    print()

It gives the following output,

Q: What is the return window for a Samsung TV?
A: You can return your Samsung TV within 7 days of delivery for a full refund.
Large appliances like TVs require a technician inspection before return approval.
Ensure the item is in original packaging with all accessories.

Q: How long does delivery take to Bangalore?
A: Bangalore is a metro city, so standard delivery takes 3-5 business days.
Express next-day delivery is available for Rs 199 extra.

Q: Is voltage damage covered under warranty?
A: No, voltage fluctuation damage is explicitly not covered under the ShopMax
India warranty policy. Only manufacturer defects are covered.

At ShopMax India, use vector embeddings instead of keyword lookup for production retrieval - it handles paraphrased queries like send it back matching the return policy section. Chunk policies at the section level (200-500 tokens) rather than the document level for more precise retrieval. Add a fallback: if no relevant context is retrieved, route the question to a human agent rather than letting the LLM answer from its training data, which may be outdated or inaccurate for your specific policies.

Send your comments, suggestions or queries regarding this site to [email protected].