996 skills found · Page 1 of 34
pandas-dev / PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
mwaskom / SeabornStatistical data visualization in Python
openvenues / LibpostalA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
unionai-oss / PanderaA light-weight, flexible, and expressive statistical data testing library
minimaxir / Facebook Page Post ScraperData scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis
fonnesbeck / Statistical Analysis Python TutorialStatistical Data Analysis in Python
RainCloudPlots / RainCloudPlotsCode and tutorials to visualise your data that is both beautiful *and* statistically valid
piermorel / GrammGramm is a complete data visualization toolbox for Matlab. It provides an easy to use and high-level interface to produce publication-quality plots of complex data with varied statistical visualizations. Gramm is inspired by R's ggplot2 library.
erikaduan / R TipsR programming tips for data cleaning, data visualisation, statistical modelling and machine learning
moderndive / ModernDive BookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
privacytrustlab / Ml Privacy MeterPrivacy Meter: An open-source library to audit data privacy in statistical and machine learning algorithms.
datalib / LibextractExtract data from websites using basic statistical magic
julkaar9 / PynimatePython package for statistical data animations
IndrajeetPatil / StatsExpressionsTidy data frames and expressions with statistical summaries 📜
fastverse / FastverseAn Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
kolaczyk / SandStatistical Analysis of Network Data with R, 2nd Edition
WinVector / Vtreatvtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.
narunlifescience / AlphaPlot:chart_with_upwards_trend: Application for statistical analysis and data visualization which can generate different types of publication quality 2D and 3D plots with extensive visual customization.
royalosyin / Python Practical Application On Climate Variability StudiesThis tutorial is a companion volume of Matlab versionm but add more. Main objective is the transference of know-how in practical applications and management of statistical tools commonly used to explore meteorological time series, focusing on applications to study issues related with the climate variability and climate change. This tutorial starts with some basic statistic for time series analysis as estimation of means, anomalies, standard deviation, correlations, arriving the estimation of particular climate indexes (Niño 3), detrending single time series and decomposition of time series, filtering, interpolation of climate variables on regular or irregular grids, leading modes of climate variability (EOF or HHT), signal processing in the climate system (spectral and wavelet analysis). In addition, this tutorial also deals with different data formats such as CSV, NetCDF, Binary, and matlab'mat, etc. It is assumed that you have basic knowledge and understanding of statistics and Python.
neomatrix369 / Nlp ProfilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.