← The catalogue
Learning path
The 5 core skills of a Data Engineer
A Data Engineer's job is to move reliable data, on time, at scale. These are the five skills that hold it up — and the open-source tools in the kiosk to master them.
Take the self-assessment →01
SQL and data modeling
The language of data. Query, transform and model schemas others can rely on without breaking.
02
Programming with Python
The glue of the whole stack: automate, connect APIs and write the logic tools don't ship with.
In the kioskPython
03
Orchestration and pipelines
Chain steps, schedule runs and recover from failures with no manual babysitting.
04
Processing at scale (batch & streaming)
When data won't fit on one machine: process it distributed, in batches or in real time.
05
Storage, formats and lakehouse
Where and how to store data so it's cheap to keep and fast to query.