Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Code for "Efficient Data Processing in Spark" Course
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowCLI task management & automation tool
$ npx skills add pydoit/doitA list of useful resources to learn Data Engineering from scratch
$ npx skills add adilkhash/Data-Engineering-HowToSmarter data pipelines for audio.
$ npx skills add spotify/klioPrivacy and Security focused Segment-alternative, in Golang and React
$ npx skills add rudderlabs/rudder-server🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.
$ npx skills add Multiwoven/multiwovenStreaming reactive and dataflow graphs in Python
$ npx skills add 1kbgz/tributaryExample end to end data engineering project.
$ npx skills add damklis/DataEngineeringProjectPractical Data Engineering: A Hands-On Real-Estate Project Guide
$ npx skills add ssp-data/practical-data-engineeringMemphis.dev is a highly scalable and effortless data streaming platform
$ npx skills add superstreamlabs/memphisFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
$ npx skills add pandas-dev/pandasEnd-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.
$ npx skills add DataWithBaraa/databricks_bootcamp_2026scikit-learn: machine learning in Python
$ npx skills add scikit-learn/scikit-learnEmpowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
$ npx skills add apache/shardingsphereAn end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
$ npx skills add airscholar/e2e-data-engineeringOpen-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
$ npx skills add airbytehq/airbyteHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Efficient Data Processing Spark if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.