A list of useful resources to learn Data Engineering from scratch
$ npx skills add adilkhash/Data-Engineering-HowToAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Practical Data Engineering: A Hands-On Real-Estate Project Guide
A list of useful resources to learn Data Engineering from scratch
$ npx skills add adilkhash/Data-Engineering-HowToPrivacy and Security focused Segment-alternative, in Golang and React
$ npx skills add rudderlabs/rudder-server🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.
$ npx skills add Multiwoven/multiwovenApache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowMemphis.dev is a highly scalable and effortless data streaming platform
$ npx skills add superstreamlabs/memphis10 Weeks, 20 Lessons, Data Science for All!
$ npx skills add microsoft/Data-Science-For-BeginnersChange data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
$ npx skills add debezium/debeziumConduit streams data between data stores. Kafka Connect replacement. No JVM required.
$ npx skills add ConduitIO/conduitFlink CDC is a streaming data integration tool
$ npx skills add apache/flink-cdcThe leader in Customer Data Infrastructure
$ npx skills add snowplow/snowplow🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows with Estuary. 🌊
$ npx skills add estuary/flowModern SeaTunnel Web UI with visual DAG pipelines, batch & streaming sync, connector management, built-in metrics, and runtime logs.
$ npx skills add weifuwan/seatunnel-webAn open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
$ npx skills add whylabs/whylogsingestr is a CLI tool to copy data between any databases with a single command seamlessly.
$ npx skills add bruin-data/ingestrExample end to end data engineering project.
$ npx skills add damklis/DataEngineeringProjectOLake - Fastest Databases, Kafka & S3 Replication to Apache Iceberg with Table optimization (Called OLake Fusion). ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supported sources : Postgres, MongoDB, MySQL, Oracle, MSSql, DB2, Kafka, S3.
$ npx skills add datazip-inc/olakeHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Practical Data Engineering if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.