SkillAgentSearch skills...

DeepAnalyze

DeepAnalyze is the first agentic LLM for autonomous data science. ๐ŸŽˆไฝ ็š„AIๆ•ฐๆฎๅˆ†ๆžๅธˆ๏ผŒ่‡ชๅŠจๅˆ†ๆžๅคง้‡ๆ•ฐๆฎ๏ผŒไธ€้”ฎ็”Ÿๆˆไธ“ไธšๅˆ†ๆžๆŠฅๅ‘Š๏ผ

Install / Use

/learn @ruc-datalab/DeepAnalyze

README

<p align="center" width="100%"> <img src="assets/logo.png" alt="DeepAnalyze" style="width: 60%; min-width: 300px; display: block; margin: auto;"> </p>

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

arXiv homepage model data star Badge wechat

twitter twitter twitter twitter twitter twitter

Authors: Shaolei Zhang, Ju Fan*, Meihao Fan, Guoliang Li, Xiaoyong Du

Renmin University of China, Tsinghua University

DeepAnalyze is the first agentic LLM for autonomous data science. It can autonomously complete a wide range of data-centric tasks without human intervention, supporting:

  • ๐Ÿ›  Entire data science pipeline: Automatically perform any data science tasks such as data preparation, analysis, modeling, visualization, and report generation.
  • ๐Ÿ” Open-ended data research: Conduct deep research on diverse data sources, including structured data (Databases, CSV, Excel), semi-structured data (JSON, XML, YAML), and unstructured data (TXT, Markdown), and finally produce analyst-grade research reports.
  • ๐Ÿ“Š Fully open-source: The model, code, training data, and demo of DeepAnalyze are all open-sourced, allowing you to deploy or extend your own data analysis assistant.
<p align="center" width="100%"> <img src="./assets/deepanalyze.jpg" alt="deepanalyze" style="width: 70%; min-width: 300px; display: block; margin: auto;"> </p>

๐Ÿ”ฅ News

  • [2026.03.16] Updated WebUI v2, featuring a smoother UI, support for the HeyWhale API, and support for Docker-based sandboxed code execution. More details in Readme .

  • [2025.12.28] ANNOUNCEMENT: DeepAnalyze API Keys Are Now Available ๐ŸŽ‰๐ŸŽ‰๐ŸŽ‰ You can now apply for your API key via this Google Form or this Feishu Form. For full details and usage instructions, please refer to the Guide or the Feishu Wiki.

  • [2025.11.13]: DeepAnalyze now supports OpenAI-style API endpointsis and is accessible through the Command Line Terminal UI. Thanks to the contributor @LIUyizheSDU

  • [2025.11.08]: DeepAnalyze is now accessible through the JupyterUI, building based on jupyter-mcp-server. Thanks to the contributor @ChengJiale150.

  • [2025.10.28]: We welcome all contributions, including improving the DeepAnalyze and sharing use cases (see CONTRIBUTION.md). All merged PRs will be listed as contributors.

  • [2025.10.27]: DeepAnalyze has attracted widespread attention, gaining 1K+ GitHub stars and 200K+ Twitter views within a week.

  • [2025.10.21]: DeepAnalyze's paper, code, model, training data are released!

๐Ÿ–ฅ Demo

WebUI

https://github.com/user-attachments/assets/04184975-7ee7-4ae0-8761-7a7550c5c8fe

<p align="center" width="100%"> Upload the data, DeepAnalyze can perform data-oriented deep research ๐Ÿ” and any data-centric tasks ๐Ÿ›  </p>
  • Clone this repo and download DeepAnalyze-8B.
  • Deploy DeepAnalyze-8B via vllm: vllm serve DeepAnalyze-8B
  • Run these scripts to launch the API and interface, and then interact through the browser (http://localhost:4000):
    cd demo/chat/frontend
    npm install
    cd ..
    bash start.sh
    
    # stop the api and interface
    bash stop.sh
    
    # stop vllm if needed
    
  • If you want to deploy under a specific IP, please replace localhost with your IP address in ./demo/chat/backend.py and ./demo/chat/frontend/lib/config.ts

WebUI v2

https://github.com/user-attachments/assets/2dd1d2aa-6fb9-4202-bc8d-cbe874844725

<p align="center" width="100%"> Upload the data, DeepAnalyze can perform data-oriented deep research ๐Ÿ” and any data-centric tasks ๐Ÿ›  </p>
  • A more streamlined UI

  • Added support for HeyWhale API keys

  • Added support for a Docker-based sandbox code execution environment.

  • The usage method is the same as WebUI.

    cd demo/chat_v2/frontendย  
    npm install
    cd ..
    cp .env.example .env 
    bash start.sh
    # stop the api and interface
    bash stop.sh
    
    # stop vllm if needed
    

JupyterUI

https://github.com/user-attachments/assets/a2335f45-be0e-4787-a4c1-e93192891c5f

<p align="center" width="100%"> Familiar with Jupyter Notebook? Try DeepAnalyze through the JupyterUI! </p>
  • This Demo runs Jupyter Lab as frontend, creating a new notebook, converting <Analyze|Understand|Answer> to Markdown cells, converting <Code> to Code cells and executing them as <Execute>.
  • Go to demo/jupyter to see more and try!
  • ๐Ÿ‘Thanks a lot to the contributor @ChengJiale150.

CLI

https://github.com/user-attachments/assets/018acae5-b979-4143-ae1e-5b74da453c1d

<p align="center" width="100%"> Try DeepAnalyze through the command-line interface </p>
  • Deploy DeepAnalyze-8B via vllm: vllm serve DeepAnalyze-8B

  • Start the API server and launch the CLI interface:

    cd API
    python start_server.py  # In one terminal
    
    cd demo/cli
    python api_cli.py       # In another terminal (English)
    # or
    python api_cli_ZH.py    # In another terminal (Chinese)
    
  • The CLI provides a Rich-based beautiful interface with file upload support and real-time streaming responses.

  • Supports both English and Chinese interfaces .

[!TIP]

Clone this repository to deploy DeepAnalyze locally as your data analyst, completing any data science tasks without any workflow or closed-source APIs.

๐Ÿ”ฅ The UI of the demo is an initial version. Welcome to further develop it, and we will include you as a contributor.

๐Ÿš€ Quick Start

๐Ÿ”‘ Use the DeepAnalyze API

API keys are now available!

To request your key, please fill out one of the following application forms:

๐Ÿ“š For comprehensive usage instructions, please refer to the API guide:

Model Download

Download model in RUC-DataLab/DeepAnalyze-8B ยท Hugging Face or DeepAnalyze-8B ยท ๆจกๅž‹ๅบ“

๐Ÿ“Š Memory Configuration Recommended Parameters Table

| GPU Memory | Model Type | Recommended max-model-len | Use FP8 KV Cache | |------------|------------|--------------------------|-----------------------| | 16GB | 8-bit Quantized | 8192 | โœ“ | | 16GB | 4-bit Quantized | 49152 | โœ“ | | 24GB | Original Model | 16384 | โœ“ | | 24GB | 8-bit Quantized | 98304 | โœ“ | | 24GB | 4-bit Quantized | 131072 | โœ“ | | 40GB | Original Model | 131072 | โœ“ | | 40GB | 8-bit Quantized | 131072 | | | 80GB | Original Model | 131072 |

Related Skills

View on GitHub
GitHub Stars3.8k
CategoryData
Updated3h ago
Forks552

Languages

Python

Security Score

100/100

Audited on Mar 22, 2026

No findings