Dagster & ETL
Learn how to ingest data to power your assets. You’ll build custom pipelines and see how to use Embedded ETL and Dagster Components to build out your data platform.
This course is geared towards those who have some familiarity with Dagster and have ETL solutions.
You'll need to know the basics of Dagster to complete this course. We recommend completing check out the Dagster Essentials course if you've never used Dagster before or want a refresher before getting started.
While you don’t need to be a Python expert to get started, you do need some Python familiarity to complete this course and use Dagster.
About this course
What is ETL?
ETL and Dagster
Project preview
Prerequisites and installation
Set up local
Set up codespaces
Github Codespace
Overview
File import
Data integrity
Partitions
Complex partitions
Triggering partitions
Knowledge check
Cloud storage
Overview
APIs
API resource
ETL with API
API Dagster assets
Knowledge check
Triggering API job
Backfilling from APIs
Overview
dlt
Basic dlt
Dagster and dlt
Knowledge check
Refactoring static data with dlt
Refactoring APIs with dlt
Overview
Database replication
Knowledge check
Sling
Sling database replication set up
Dagster and Sling
Managing Sling assets