tl  tr
  Home | Tutorials | Articles | Videos | Products | Tools | Search
Interviews | Open Source | Tag Cloud | Follow Us | Bookmark | Contact   
 Cloud Platforms > Google Cloud Platform (GCP) > Cloud Composer

Cloud Composer

Author: Venkata Sudhakar

Google Cloud Composer is a fully managed workflow orchestration service built on Apache Airflow. It allows you to create, schedule, monitor, and manage data pipelines across cloud and on-premises environments.

Key Features:

1. Apache Airflow - Built on open-source Airflow with full compatibility.

2. Fully managed - Google manages the Airflow infrastructure including upgrades and scaling.

3. GCP integration - Pre-built operators for BigQuery, Dataflow, Dataproc, GCS, and more.

4. Monitoring - Integrated with Cloud Logging and Cloud Monitoring for full observability.

5. Multi-cloud - Orchestrate workflows across GCP, AWS, Azure, and on-premises systems.

The below example shows an Apache Airflow DAG that orchestrates a data pipeline using Cloud Composer.


It gives the following output in the Airflow UI,

DAG: daily_sales_pipeline
Run: 2024-01-15 06:00:00
Tasks:
  load_sales_to_bigquery   -> SUCCESS (45s)
  aggregate_daily_sales    -> SUCCESS (12s)
  cleanup_raw_files        -> SUCCESS (3s)
Total Duration: 60s

Cloud Composer Environments:

Composer 1 - Based on GKE, uses Airflow 1.x and 2.x. Suitable for existing Airflow workloads.

Composer 2 - Improved performance, auto-scaling, and faster environment creation. Recommended for new deployments.


 
  


  
bl  br