In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Agentic AI > ADK Agent Testing > ADK Agent CI/CD Pipeline with GitHub Actions

ADK Agent CI/CD Pipeline with GitHub Actions

Author: Venkata Sudhakar

ShopMax India deploys ADK agent updates frequently - prompt tuning, new tools, model upgrades. Without a CI/CD pipeline, regressions slip into production undetected. A well-structured pipeline runs all test layers automatically on every pull request: unit tests first (fast, cheap), then integration tests, then regression and contract tests. Only code that passes every layer gets merged and deployed, giving the team confidence to ship agent updates quickly.

The pipeline is organised in four stages with a fail-fast strategy: if unit tests fail, the remaining stages are skipped to save time. Each stage has a time budget - unit tests must complete in under 30 seconds, integration tests in under 2 minutes. The pipeline script is a plain Python file that can be run locally or triggered by GitHub Actions, GitLab CI, or any CI system. The same script runs in both environments, making local debugging straightforward.

The example shows ShopMax India's CI pipeline orchestrator. It runs four test stages in sequence, reports per-stage results, and exits with a non-zero code on any failure so the CI system marks the build as failed.

import subprocess
import sys
import time

PIPELINE_STAGES = [
    {"name": "Unit Tests", "cmd": ["python", "-m", "pytest", "tests/unit/", "-q"], "timeout": 30},
    {"name": "Integration Tests", "cmd": ["python", "-m", "pytest", "tests/integration/", "-q"], "timeout": 120},
    {"name": "Regression Tests", "cmd": ["python", "-m", "pytest", "tests/regression/", "-q"], "timeout": 120},
    {"name": "Contract Tests", "cmd": ["python", "-m", "pytest", "tests/contracts/", "-q"], "timeout": 60},
]

def run_stage(stage):
    start = time.time()
    result = subprocess.run(
        stage["cmd"],
        capture_output=True,
        text=True,
        timeout=stage["timeout"]
    )
    elapsed = round(time.time() - start, 1)
    passed = result.returncode == 0
    status = "PASSED" if passed else "FAILED"
    print(stage["name"] + ": " + status + " (" + str(elapsed) + "s)")
    if not passed:
        print(result.stdout[-300:])
    return passed

def run_pipeline():
    print("ShopMax India - ADK Agent CI Pipeline")
    print("=" * 40)
    for stage in PIPELINE_STAGES:
        if not run_stage(stage):
            print("Pipeline failed at: " + stage["name"])
            sys.exit(1)
    print("=" * 40)
    print("All stages passed. Ready to deploy.")

run_pipeline()

It gives the following output,

ShopMax India - ADK Agent CI Pipeline
========================================
Unit Tests: PASSED (2.1s)
Integration Tests: PASSED (18.4s)
Regression Tests: PASSED (31.2s)
Contract Tests: PASSED (4.7s)
========================================
All stages passed. Ready to deploy.

Set per-stage time budgets and fail the pipeline if a stage exceeds them - slow tests are a warning sign of missing mocks or real network calls in unit tests. Run the pipeline on every pull request and block merges on failure. Cache pip dependencies in CI to keep pipeline startup under 30 seconds. Add a nightly pipeline variant that runs the full suite with higher example counts (Hypothesis max_examples=1000) and real LLM calls against a staging endpoint, separate from the PR pipeline which always uses mocks.

Send your comments, suggestions or queries regarding this site to [email protected].