Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
$ npx skills add apache/airflowOpen-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
$ npx skills add airbytehq/airbyteConduit streams data between data stores. Kafka Connect replacement. No JVM required.
$ npx skills add ConduitIO/conduitFlink CDC is a streaming data integration tool
$ npx skills add apache/flink-cdcThe leader in Customer Data Infrastructure
$ npx skills add snowplow/snowplow🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows with Estuary. 🌊
$ npx skills add estuary/flowModern SeaTunnel Web UI with visual DAG pipelines, batch & streaming sync, connector management, built-in metrics, and runtime logs.
$ npx skills add weifuwan/seatunnel-webPrivacy and Security focused Segment-alternative, in Golang and React
$ npx skills add rudderlabs/rudder-serveringestr is a CLI tool to copy data between any databases with a single command seamlessly.
$ npx skills add bruin-data/ingestr🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.
$ npx skills add Multiwoven/multiwovenMemphis.dev is a highly scalable and effortless data streaming platform
$ npx skills add superstreamlabs/memphisEnd-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.
$ npx skills add DataWithBaraa/databricks_bootcamp_2026Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
$ npx skills add debezium/debeziumLong list of geospatial tools and resources
$ npx skills add sacridini/Awesome-GeospatialApache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
$ npx skills add apache/devlakeProbably the best curated list of data science software in Python.
$ npx skills add krzjoa/awesome-python-data-scienceHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Awesome Open Source Data Engineering if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.