Welcome to Data Engineering 101

Master the Art of Data Engineering

From fundamentals to advanced concepts, build your expertise in modern data infrastructure and practices.

Profile Photo

Hi, I'm Victor Elexpe 👋
Software Engineer specialized in Data Engineering and I'm building this guide to help you get started in the field of Data Engineering.

Essential Knowledge

A carefully curated guide focusing on the core concepts and practical skills needed in modern data engineering. No fluff, just the essentials.

High-Demand Career

Data Engineering is one of the fastest-growing tech careers. Companies across industries need skilled professionals to handle their data infrastructure.

Real-World Impact

Learn how to enable data-driven decisions, build scalable pipelines, design data warehouses, and power analytics and AI/ML applications.

Always Current

This guide is continuously updated to reflect the latest industry practices, tools, and technologies in data engineering.

Perfect for Beginners

Whether you're starting your journey or transitioning careers, this guide provides a clear path to understanding data engineering fundamentals.

Request new content

Have a suggestion or feedback for what to add? Send me a message on X and I'll do my best to add it.

Your Learning Journey

1

Foundations & Core Concepts

Start your journey with the essential building blocks of data engineering. Learn how modern data stacks work, understand basic data operations, and get hands-on with SQL fundamentals that power data manipulation.

2-3 hours
Data Engineering OverviewBasic SQLData Types & StructuresETL Fundamentals
2

Data Storage & Modeling

Dive into the world of data warehousing and learn how to design efficient data models. Master different schema types and understand how to structure data for optimal performance and accessibility.

4-5 hours
Data ModelingStar SchemaData Warehouse DesignDimensional ModelingData Lakehouse
3

Data Integration & Pipelines

Build robust data pipelines and learn modern ETL/ELT practices. Get hands-on experience with industry-standard tools and understand how to ensure reliable data flows in your infrastructure.

4-5 hours
ETL vs ELTApache AirflowTransformations FundamentalsData Quality ChecksPipeline Monitoring
4

Cloud & Modern Platforms

Explore the major cloud platforms and modern data tools that power today's data infrastructure. Learn how to work with cloud-native services and implement scalable data solutions.

5-6 hours
AWS ServicesGCP PlatformVersion Control SystemsAzure Data StackModern Data Stack Tools