Overview

Welcome to the AWS + Apache Iceberg Data Stack Template!

This template provides everything you need to build a modern data lakehouse on AWS using Apache Iceberg.

The infrastructure is fully automated with Terraform and includes:

An example ingestion pipeline: dlt + lambda
A dbt project transformation Iceberg data with Athena
AWS Step Functions to orchestrate ingestion + transformation
A Github CI workflow

The template is designed to be fully modular and customizable. You can easily swap out any component with your preferred tools:

Replace dlt with any other ingestion framework
Use an alternative to dbt for transformations
Switch to a different orchestration service instead of Step Functions

Next Steps

Key Concepts - Understand the core architecture and components
Get Started - Set up your environment and run your first deployment

NextKey Concepts

Last updated 11 months ago