Overview

Welcome to the AWS + Apache Iceberg Data Stack Template!

This template provides everything you need to build a modern data lakehouse on AWS using Apache Iceberg.

Iceberg Stack Overview

The infrastructure is fully automated with Terraform and includes:

  • An example ingestion pipeline: dlt + lambda

  • A dbt project transformation Iceberg data with Athena

  • AWS Step Functions to orchestrate ingestion + transformation

  • A Github CI workflow

The template is designed to be fully modular and customizable. You can easily swap out any component with your preferred tools:

  • Replace dlt with any other ingestion framework

  • Use an alternative to dbt for transformations

  • Switch to a different orchestration service instead of Step Functions


Next Steps

  1. Key Concepts - Understand the core architecture and components

  2. Get Started - Set up your environment and run your first deployment

Last updated