Role Roadmap
Data Engineer
Design and run distributed data pipelines, lakehouses, and streaming systems.
7 technologies in this stack — ordered Core → Edge → Hybrid bonus. Click any tile to start practicing.
Core — lands the role. Baseline skills hiring managers screen for. Master these and you can land the role at market CTC.
Edge — lifts your CTC band. Rarer skills that strengthen negotiation — they can push the offer toward the top of the band (e.g. ₹30–40L+ for senior data roles).
Hybrid bonus — broadens your profile. Cross-domain extras (e.g. Power BI for a Data Engineer) that open hybrid roles and raise your profile weightage with recruiters.
- Core1
Databricks
Unity Catalog, Delta Live Tables, Workflows, MLflow, and Lakehouse architecture.
Open practice tracks → - Core2
Apache PySpark
Distributed data processing with Python and Apache Spark — the most asked skill in data engineering interviews.
Open practice tracks → - Core3
SQL & Advanced SQL
Query optimization, window functions, CTEs, and complex joins. Asked in every data role.
Open practice tracks → - Edge4
Apache Kafka
Event streaming, producer/consumer patterns, partitioning, exactly-once semantics.
Open practice tracks → - Edge5
Apache Airflow
DAG design, operators, sensors, TaskFlow API, scheduling, and production debugging.
Open practice tracks → - Hybrid bonus6
dbt (Data Build Tool)
Models, tests, macros, incremental strategies, and CI/CD for data transformation.
Open practice tracks → - Hybrid bonus7
Snowflake
Virtual warehouses, clustering, Time Travel, Streams, Tasks, and cost optimization.
Open practice tracks →