In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > Anthropic Claude API > Claude with FastAPI - Building a Production REST API

Claude with FastAPI - Building a Production REST API

Author: Venkata Sudhakar

Wrapping Claude in a FastAPI service gives ShopMax India a production-grade REST API for AI features. Customer-facing apps, internal dashboards, and mobile apps all call the same endpoint, while Claude access is centralized with rate limiting, authentication, and logging in one place.

FastAPI's async support pairs naturally with the Anthropic async client. Requests hit the endpoint, the async Claude call runs without blocking other requests, and the response streams back. Pydantic models validate both request payloads and response shapes. Adding API key authentication via a header dependency protects the endpoint from unauthorized use.

The following example shows ShopMax India's product assistant endpoint. It accepts a customer query and product context, calls Claude with a ShopMax system prompt, and returns a structured JSON response including the answer and token usage.

from fastapi import FastAPI, HTTPException, Header
from pydantic import BaseModel
import anthropic
import asyncio

app = FastAPI(title="ShopMax India AI API")
client = anthropic.AsyncAnthropic()

VALID_API_KEYS = {"shopmax-internal-key-2024", "shopmax-mobile-key-2024"}

class QueryRequest(BaseModel):
    customer_id: str
    question: str
    product_context: str = ""

class QueryResponse(BaseModel):
    customer_id: str
    answer: str
    input_tokens: int
    output_tokens: int

def verify_api_key(x_api_key: str = Header(...)):
    if x_api_key not in VALID_API_KEYS:
        raise HTTPException(status_code=401, detail="Invalid API key")
    return x_api_key

@app.post("/api/v1/product-assistant", response_model=QueryResponse)
async def product_assistant(request: QueryRequest, x_api_key: str = Header(...)):
    verify_api_key(x_api_key)
    system = (
        "You are ShopMax India product assistant. "
        "Help customers with product information, comparisons, and recommendations. "
        "Be concise and mention prices in Rs when known."
    )
    user_content = request.question
    if request.product_context:
        user_content = "Product context: " + request.product_context + "\n\nCustomer question: " + request.question
    response = await client.messages.create(
        model="claude-haiku-4-5",
        max_tokens=512,
        system=system,
        messages=[{"role": "user", "content": user_content}]
    )
    return QueryResponse(
        customer_id=request.customer_id,
        answer=response.content[0].text,
        input_tokens=response.usage.input_tokens,
        output_tokens=response.usage.output_tokens
    )

@app.get("/health")
async def health():
    return {"status": "ok", "service": "ShopMax AI API"}

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=8000)

It gives the following output,

INFO:     Started server process [12345]
INFO:     Uvicorn running on http://0.0.0.0:8000

# Test with curl:
# curl -X POST http://localhost:8000/api/v1/product-assistant \
#   -H "x-api-key: shopmax-internal-key-2024" \
#   -H "Content-Type: application/json" \
#   -d '{"customer_id": "CUST-881", "question": "Which TV under Rs 50000 has best picture quality?"}'

{
  "customer_id": "CUST-881",
  "answer": "For TVs under Rs 50,000 at ShopMax India, the LG 43-inch 4K UHD offers excellent picture quality with IPS panel technology...",
  "input_tokens": 98,
  "output_tokens": 87
}

In production at ShopMax India, add request timeout middleware (typically 30 seconds for Claude calls) to prevent hung connections. Use a connection pool via AsyncAnthropic with httpx limits set to match your expected concurrency. Add structured logging with correlation IDs so you can trace a specific customer's AI request across logs. For high-traffic periods like Diwali sales, cache common product queries in Redis with a short TTL to avoid redundant Claude calls for the same popular questions.

Send your comments, suggestions or queries regarding this site to [email protected].