Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
$ npx skills add airbytehq/airbyteAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.
$ npx skills add airbytehq/airbyteClickHouse® is a real-time analytics database management system
$ npx skills add ClickHouse/ClickHouseOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
$ npx skills add trinodb/trinoFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
$ npx skills add pandas-dev/pandasThe world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
$ npx skills add StarRocks/starrocksThe Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
$ npx skills add gchq/CyberChefTurn any AI agent into an AI Scientist. The #1 Agent Skills library for science, used by 160,000+ scientists worldwide. 140 ready-to-use skills plus 100+ scientific databases covering biology, chemistry, medicine, and drug discovery. Compatible with Cursor, Claude Code, Codex, Pi, Antigravity, and the open Agent Skills standard.
$ npx skills add K-Dense-AI/scientific-agent-skillsStreamlit — A faster way to build and share data apps.
$ npx skills add streamlit/streamlit10 Weeks, 20 Lessons, Data Science for All!
$ npx skills add microsoft/Data-Science-For-Beginners🕸️ Web apps in pure Python 🐍
$ npx skills add reflex-dev/reflex微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
$ npx skills add 666ghj/BettaFish🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
$ npx skills add lukasmasuch/best-of-ml-pythonAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
$ npx skills add akfamily/akshareGoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
$ npx skills add allinurl/goaccessOpenRefine is a free, open source power tool for working with messy data and improving it
$ npx skills add OpenRefine/OpenRefineStatsmodels: statistical modeling and econometrics in Python
$ npx skills add statsmodels/statsmodelsHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Big Data Mapreduce Course if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.