In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Agentic AI > ADK Agent Testing > Testing ADK Agents with Code Execution Built-in Tool

Testing ADK Agents with Code Execution Built-in Tool

Author: Venkata Sudhakar

ShopMax India uses ADK agents with the Code Execution built-in tool to compute order totals with GST, generate discount breakdowns, and calculate EMI schedules on the fly. Testing agents that use code execution requires mocking the execution sandbox so tests do not run arbitrary code in production environments, while still verifying that the agent constructs correct code, passes the right inputs, and uses the output in its response.

When testing ADK agents with the Code Execution tool, replace the sandbox executor with a controlled mock that captures the code submitted by the agent and returns a deterministic result. Assert that the submitted code contains the expected logic, that the agent correctly incorporates the execution output into its reply, and that it handles execution errors without crashing. This approach lets you test the agent's reasoning about code without running untrusted code in your test environment.

The example below mocks the code execution sandbox for a ShopMax India GST calculator agent. The mock captures the submitted code and returns a fixed computed total. Tests verify the agent submits code referencing the correct base price and GST rate, uses the result in its response, and handles a sandbox error gracefully.

import pytest

EXEC_LOG = []

def mock_code_executor(code):
    EXEC_LOG.append(code)
    if "base_price" in code and "gst_rate" in code:
        return {"output": "Total with GST: Rs 88700", "error": None}
    return {"output": None, "error": "NameError: base_price not defined"}

def gst_calculator_agent(product, base_price, gst_rate, executor):
    code = (
        "base_price = " + str(base_price) + "\n"
        "gst_rate = " + str(gst_rate) + "\n"
        "total = base_price * (1 + gst_rate / 100)\n"
        "print('Total with GST: Rs', int(total))"
    )
    result = executor(code)
    if result["error"]:
        return {"success": False, "response": "Calculation failed: " + result["error"]}
    return {"success": True, "response": result["output"], "product": product}

def test_agent_submits_code_with_correct_values():
    EXEC_LOG.clear()
    gst_calculator_agent("OnePlus 12", 75000, 18, mock_code_executor)
    assert "base_price = 75000" in EXEC_LOG[0]
    assert "gst_rate = 18" in EXEC_LOG[0]

def test_agent_uses_execution_result():
    result = gst_calculator_agent("OnePlus 12", 75000, 18, mock_code_executor)
    assert result["success"] is True
    assert "Rs 88700" in result["response"]

def test_agent_handles_execution_error():
    result = gst_calculator_agent("OnePlus 12", 75000, 18, lambda code: {"output": None, "error": "Timeout"})
    assert result["success"] is False
    assert "Calculation failed" in result["response"]

It gives the following output,

... (3 passed in 0.01s)

In production, ShopMax India should run code execution in a sandboxed environment with strict memory and time limits to prevent runaway computations. Never let the agent execute code that contains customer-supplied strings without sanitization - validate all inputs before interpolating them into generated code. Log every submitted code snippet with a session ID so you can audit what was computed for any disputed order total or EMI calculation.

Send your comments, suggestions or queries regarding this site to [email protected].