Alternatives

Awesome Open Source Data Engineering alternatives for AI agents.

Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.

Current skill

Awesome Open Source Data Engineering

A curated list of open source tools used in analytics platforms and data engineering ecosystem

49
Quality
74
Trust
569
Stars
#1

Airflow

Similarity 130Trust 98Excellent 100

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

46K starsJun 16, 2026 pushdata-analysisPythonETL
$ npx skills add apache/airflow
#2

Airbyte

Similarity 129Trust 94Excellent 100

Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

21K starsJun 14, 2026 pushdata-analysisPythonData Analysis
$ npx skills add airbytehq/airbyte
#3

Conduit

Similarity 125Trust 89Excellent 85

Conduit streams data between data stores. Kafka Connect replacement. No JVM required.

600 starsJun 13, 2026 pushdata-analysisGoData Pipeline
$ npx skills add ConduitIO/conduit
#4

Flink Cdc

Similarity 125Trust 94Excellent 100

Flink CDC is a streaming data integration tool

6.4K starsJun 3, 2026 pushdata-analysisJavaData Pipeline
$ npx skills add apache/flink-cdc
#5

Snowplow

Similarity 125Trust 91Excellent 100

The leader in Customer Data Infrastructure

7.0K starsJun 15, 2026 pushdata-analysisScalaData Pipeline
$ npx skills add snowplow/snowplow
#6

Flow

Similarity 124Trust 85Strong 82

🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows with Estuary. 🌊

936 starsJun 12, 2026 pushdata-analysisRustData Pipeline
$ npx skills add estuary/flow
#7

Seatunnel Web

Similarity 124Trust 89Strong 84

Modern SeaTunnel Web UI with visual DAG pipelines, batch & streaming sync, connector management, built-in metrics, and runtime logs.

505 starsJun 14, 2026 pushdata-analysisTypeScriptData Pipeline
$ npx skills add weifuwan/seatunnel-web
#8

Rudder Server

Similarity 123Trust 89Excellent 100

Privacy and Security focused Segment-alternative, in Golang and React

4.4K starsJun 12, 2026 pushdata-analysisGoData Pipeline
$ npx skills add rudderlabs/rudder-server
#9

Ingestr

Similarity 123Trust 88Excellent 100

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

3.7K starsJun 13, 2026 pushdata-analysisGoData Pipeline
$ npx skills add bruin-data/ingestr
#10

Multiwoven

Similarity 122Trust 93Excellent 100

🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.

1.7K starsJun 10, 2026 pushdata-analysisRubyData Pipeline
$ npx skills add Multiwoven/multiwoven
#11

Memphis

Similarity 121Trust 84Excellent 92

Memphis.dev is a highly scalable and effortless data streaming platform

3.4K starsMar 2, 2026 pushdata-analysisGoData Pipeline
$ npx skills add superstreamlabs/memphis
#12

Databricks Bootcamp 2026

Similarity 120Trust 81Strong 73

End-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.

344 starsJan 19, 2026 pushdata-analysisJupyter NotebookData Pipeline
$ npx skills add DataWithBaraa/databricks_bootcamp_2026
#13

Debezium

Similarity 119Trust 97Excellent 100

Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.

13K starsJun 12, 2026 pushdata-analysisJavaData Pipeline
$ npx skills add debezium/debezium
#14

Awesome Geospatial

Similarity 118Trust 94Excellent 100

Long list of geospatial tools and resources

5.1K starsJun 14, 2026 pushdata-analysisData AnalysisClaude Code
$ npx skills add sacridini/Awesome-Geospatial
#15

Devlake

Similarity 118Trust 96Excellent 100

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

3.0K starsJun 12, 2026 pushdata-analysisGoData Analysis
$ npx skills add apache/devlake
#16

Awesome Python Data Science

Similarity 117Trust 91Excellent 100

Probably the best curated list of data science software in Python.

3.5K starsApr 13, 2026 pushdata-analysisData AnalysisClaude Code
$ npx skills add krzjoa/awesome-python-data-science

How to choose

When should you switch?

Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Awesome Open Source Data Engineering if it already passes your workflow test and repository review.

Next step

Compare top candidates side by side

Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.