91 skills found · Page 1 of 4
OpenRefine / OpenRefineOpenRefine is a free, open source power tool for working with messy data and improving it
great-expectations / Great ExpectationsAlways know what to expect from your data.
sfu-db / DataprepOpen-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
yobulkdev / Yobulkdev🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
DataCanvasIO / HyperGBMA full pipeline AutoML tool for tabular data
sharmaroshan / Twitter Sentiment AnalysisIt is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization
DataKitchen / Data Observability InstallerInstaller for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
imdevskp / Covid 19 Jhu Data Web Scrap And CleaningThis repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
prasanthg3 / CleantextAn open-source package for python to clean raw text data
benchopt / Benchmark BilevelBenchmark for bi-level optimization solvers
imdevskp / Covid 19 India Datadata and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/
sjyk / Datacleaning BenchmarkNo description available
data-cleaning / ValidatedbValidate on a table in a DB, using dbplyr
irworkshop / Accountability DatacleaningA collection of scripts and processing notes used for The Accountability Project.
DemonDamon / Tdxfinder Futures Dataclearer对通达信数据进行去重和清洗处理,并将数据存入MongoDB,方便往后研究
sayaliwalke30 / Kaggle ProjectsThis repo contains 4 different projects. Built various machine learning models for Kaggle competitions. Also carried out Exploratory Data Analysis, Data Cleaning, Data Visualization, Data Munging, Feature Selection etc
RonKG / Machine Learning Projects 2No description available
hoshigan / Supply Chain Analytic Just In Time CompanyThe project provides a real-world dataset focusing on supply chain analytics
mne-tools / Mne Denoisemne-denoise provides narrow-band artefact removal tailored to MNE-Python workflows. It wraps harmonic regression techniques to suppress power-line noise and other oscillatory contaminants while preserving signal rank and interpretability.
weismanm12 / Finances DatabasePersonal finance database creation, SQL analysis, and Power BI dashboard