Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
$ npx skills add HKUSTDial/awesome-data-agentsAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
Machine learning with dataframes
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
$ npx skills add HKUSTDial/awesome-data-agentsEasy to use Python library of customized functions for cleaning and analyzing data.
$ npx skills add akanz1/klibOpenRefine is a free, open source power tool for working with messy data and improving it
$ npx skills add OpenRefine/OpenRefineFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
$ npx skills add pandas-dev/pandasStreamlit โ A faster way to build and share data apps.
$ npx skills add streamlit/streamlit๐ธ๏ธ Web apps in pure Python ๐
$ npx skills add reflex-dev/reflexAKShare is an elegant and simple financial data interface library for Python, built for human beings! ๅผๆบ่ดข็ปๆฐๆฎๆฅๅฃๅบ
$ npx skills add akfamily/akshareDanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
$ npx skills add javascriptdata/danfojsStatsmodels: statistical modeling and econometrics in Python
$ npx skills add statsmodels/statsmodelsA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
$ npx skills add scikit-learn-contrib/imbalanced-learnC++ DataFrame for statistical, financial, and ML analysis in modern C++
$ npx skills add hosseinmoein/DataFrameApache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
$ npx skills add apache/hamiltonData processing for and with foundation models! ๐ ๐ ๐ฝ โก๏ธ โก๏ธ๐ธ ๐น ๐ท
$ npx skills add datajuicer/data-juicerChat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
$ npx skills add sinaptik-ai/pandas-aiJava dataframe and visualization library
$ npx skills add jtablesaw/tablesawThe Universal Storage Engine
$ npx skills add TileDB-Inc/TileDBHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep Skrub if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.