In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Agentic AI > ADK Agent Testing > ADK Agent Token Cost Estimation Before Deployment

ADK Agent Token Cost Estimation Before Deployment

Author: Venkata Sudhakar

ShopMax India runs thousands of customer queries each day across order tracking, stock checks, and returns processing. Before deploying a new ADK agent to production, the engineering team needs to predict daily API costs using realistic token usage samples. Token cost estimation lets teams catch over-budget agents in pre-deployment testing and avoid surprise invoices.

The Google Generative AI SDK records input and output token counts in usage_metadata on every response. By sampling token usage across representative queries and multiplying by the per-token price, you can project daily and monthly costs. This approach lets you compare agent versions, set cost budgets as test assertions, and block deploys that exceed the acceptable spend threshold.

The example below builds a cost estimator using five representative ShopMax India queries. Two pytest tests validate that the projected daily cost stays within 15% of the baseline budget and that no single query exceeds the per-query cost ceiling.

import pytest

INPUT_PRICE_PER_1K = 0.00150
OUTPUT_PRICE_PER_1K = 0.00600
QUERY_VOLUME_PER_DAY = 5000
BASELINE_DAILY_COST = 5.50

def estimate_query_cost(input_tokens, output_tokens):
    input_cost = (input_tokens / 1000.0) * INPUT_PRICE_PER_1K
    output_cost = (output_tokens / 1000.0) * OUTPUT_PRICE_PER_1K
    return input_cost + output_cost

def estimate_daily_cost(token_samples):
    avg_input = sum(s["input"] for s in token_samples) / len(token_samples)
    avg_output = sum(s["output"] for s in token_samples) / len(token_samples)
    per_query = estimate_query_cost(avg_input, avg_output)
    return per_query * QUERY_VOLUME_PER_DAY

SAMPLE_TOKEN_USAGE = [
    {"input": 320, "output": 85, "query": "Track order ORD-7821 from Mumbai"},
    {"input": 280, "output": 60, "query": "Is Samsung TV in stock in Delhi?"},
    {"input": 650, "output": 210, "query": "Return policy for electronics?"},
    {"input": 410, "output": 130, "query": "Cancel order ORD-9012 from Bangalore"},
    {"input": 290, "output": 75, "query": "Top phones under Rs 20000"},
]

def test_deployment_cost_within_budget():
    daily = estimate_daily_cost(SAMPLE_TOKEN_USAGE)
    cost_increase_pct = ((daily - BASELINE_DAILY_COST) / BASELINE_DAILY_COST) * 100
    print("Estimated daily cost: Rs " + str(round(daily * 83, 2)))
    print("Cost vs baseline: " + str(round(cost_increase_pct, 1)) + "%")
    assert daily > 0, "Cost must be positive"
    assert cost_increase_pct <= 15.0, (
        "Projected cost " + str(round(daily, 4)) +
        " exceeds baseline by " + str(round(cost_increase_pct, 1)) + "%"
    )

def test_per_query_cost_ceiling():
    for sample in SAMPLE_TOKEN_USAGE:
        cost = estimate_query_cost(sample["input"], sample["output"])
        assert cost < 0.003, (
            "Single query cost " + str(round(cost, 6)) +
            " exceeds ceiling for: " + sample["query"]
        )

It gives the following output,

Estimated daily cost: Rs 521.66
Cost vs baseline: 14.3%
.. (2 passed in 0.04s)

In production, replace the hard-coded token samples with actual usage logs collected from a staging run. Store price constants in environment variables so they can be updated without code changes when the provider changes rates. Set the cost_increase_pct threshold in your CI pipeline as a required gate so that any prompt change that inflates token usage by more than 15% blocks the merge automatically.

Send your comments, suggestions or queries regarding this site to [email protected].