EdaSQL
edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.
Install / Use
/learn @selva221724/EdaSQLREADME
<img src="https://img.shields.io/pypi/v/edaSQL"> <img src="https://img.shields.io/readthedocs/edasql"> <img src="https://img.shields.io/static/v1?label=license&message=MIT&color=green"> <img src="https://img.shields.io/pypi/wheel/edaSQL"> <img src = "https://img.shields.io/pypi/pyversions/edaSQL"> <img src = "https://img.shields.io/github/commit-activity/w/selva221724/edaSQL"> <img src = "https://img.shields.io/github/languages/code-size/selva221724/edaSQL">
SQL Bridge Tool to Exploratory Data Analysis
edaSQL is a library to link SQL to Exploratory Data Analysis and further more in the Data Engineering. This will solve many limitations in the SQL studios available in the market. Use the SQL Query language to get your Table Results.
Installation
Install dependency Packages before installing edaSQL
pip install pyodbc
pip install ipython
Optional dependency for better visualization - Jupyter Notebook
pip install notebook
Now Install using pip . Offical Python Package Here!!
pip install edaSQL
(OR)
Clone this Repository. Run this from the root directory to install
python setup.py install
Documentation
<img src="https://blog.readthedocs.com/_static/logo-opengraph.png" width="20%" height="20%">Read the detailed documentation in readthedocs.io (still under the development)
License
The license for edaSQL is MIT license
Need help?
Stuck on your edaSQL code or problem? Any other questions? Don't hestitate to send me an email (selva221724@gmail.com).
edaSQL Jupyter NoteBook Tutorial
Access the sample Jupyter Notebook here!!
Access the Sample Data Used in this Repo
- CSV
- DataBase Backup ( you can restore the DB in SQL Studio )
edaSQL for DataFrame: If you are using the CSV or Excel as a source , Read using the Pandas & start from the 3. Data Overview
Import Packages
import edaSQL
import pandas as pd
1. Connect to the DataBase
edasql = edaSQL.SQL()
edasql.connectToDataBase(server='your server name',
database='your database',
user='username',
password='password',
sqlDriver='ODBC Driver 17 for SQL Server')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/db_connected.png">
2. Query Data
sampleQuery = "select * from INX"
data = pd.read_sql(sampleQuery, edasql.dbConnection)
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/data_sample.png">
<div id="Chapter1"></div>
3. Data Overview
insights = edaSQL.EDA(dataFrame=data,HTMLDisplay=True)
dataInsights =insights.dataInsights()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/1.png">
deepInsights = insights.deepInsights()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/2.png">
4. Correlation
eda = edaSQL.EDA(dataFrame=data)
eda.pearsonCorrelation()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/3.png">
eda.spearmanCorrelation()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/4.png">
eda.kendallCorrelation()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/5.png">
5. Missing Values
eda.missingValuesPlot(plot ='matrix')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/6.png">
eda.missingValuesPlot(plot ='bar')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/7.png">
eda.missingValuesPlot(plot ='heatmap')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/8.png">
eda.missingValuesPlot(plot ='dendrogram')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/9.png">
6. Outliers
eda.outliersVisualization(plot = 'box')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/10.png">
eda.outliersVisualization(plot = 'scatter')
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/11.png">
outliers = eda.getOutliers()
<img src="https://raw.githubusercontent.com/selva221724/edaSQL/main/readme_src/notebook_results/12.png">Related Skills
oracle
344.4kBest practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).
prose
344.4kOpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.
claude-opus-4-5-migration
99.2kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
Command Development
99.2kThis skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use command frontmatter", "organize commands", "create command with file references", "interactive command", "use AskUserQuestion in command", or needs guidance on slash command structure, YAML frontmatter fields, dynamic arguments, bash execution in commands, user interaction patterns, or command development best practices for Claude Code.
