ACADEMY

Academy / Courses

Understanding Datasets, Databases, and Data Lakes

32 lessons✓ Certificate of completion

About this course

A comprehensive technical course covering the full data engineering stack: relational databases, data lakes, cloud platforms, file formats, security, AI integration, and hands-on labs. Designed for engineers who want to build production-grade data platforms.

Course content

The Modern Data Landscape

  • The Modern Data Landscape
  • Datasets in Depth

Database Fundamentals

  • What Is a Database?
  • The 15 Major Types of Databases

Relational Databases and SQL

  • Relational Databases and SQL Foundations
  • Oracle Database — Technical Deep Dive
  • Microsoft SQL Server — Technical Deep Dive
  • PostgreSQL — Technical Deep Dive
  • MariaDB and MySQL — Technical Deep Dive

Database Administration

  • Database Administration Fundamentals

Data Warehouses, Lakes, and Lakehouses

  • Data Warehouses, Data Lakes, and Lakehouses
  • Data Lake Architecture
  • Dataset Sources and Access Methods

Data Engineering and Processing

  • Data Ingestion and Pipeline Design
  • Data Processing with Python, SQL, DuckDB, and Spark

File Formats and Cloud Platforms

  • File Formats for Analytics and AI
  • Cloud Data Platforms

Security, Operations, and AI Integration

  • Security, Governance, and Compliance
  • Backup, Restore, Retention, and Disaster Recovery
  • Databases and AI Integration
  • Observability and Troubleshooting

Hands-On Labs

  • Lab Environment Setup
  • Lab 1 — Build a Database Source
  • Lab 2 — Build the Data Lake Folder Structure
  • Lab 3 — Ingest Data into Bronze
  • Lab 4 — Clean and Standardize Data into Silver
  • Lab 5 — Build Gold Analytics Tables
  • Lab 6 — Create a Dashboard
  • Lab 7 — AI Integration Lab

Capstone, Career, and Reference

  • Capstone Project
  • Job Preparation and Career Roadmap
  • Glossary and References