Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowExample end to end data engineering project.
$ npx skills add damklis/DataEngineeringProjectCode for "Efficient Data Processing in Spark" Course
$ npx skills add josephmachado/efficient_data_processing_sparkChange data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
$ npx skills add debezium/debeziumDrop-in replacement for Apache Spark UI
$ npx skills add dataflint/sparkPrivacy and Security focused Segment-alternative, in Golang and React
$ npx skills add rudderlabs/rudder-server🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.
$ npx skills add Multiwoven/multiwovenMemphis.dev is a highly scalable and effortless data streaming platform
$ npx skills add superstreamlabs/memphisEnd-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.
$ npx skills add DataWithBaraa/databricks_bootcamp_2026Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
$ npx skills add apache/shardingsphereOpen-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
$ npx skills add airbytehq/airbyteCLI task management & automation tool
$ npx skills add pydoit/doitDrop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
$ npx skills add lakehq/sailFlink CDC is a streaming data integration tool
$ npx skills add apache/flink-cdcConduit streams data between data stores. Kafka Connect replacement. No JVM required.
$ npx skills add ConduitIO/conduitThe leader in Customer Data Infrastructure
$ npx skills add snowplow/snowplowHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep E2e Data Engineering if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.