12,478 skills found · Page 3 of 416
VerifyTests / VerifyVerify is a snapshot testing tool that simplifies the assertion of complex data models and documents.
QData / TextAttackTextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
limix-ldm-ai / LimiXLimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
deta / SurfPersonal AI Notebooks. Organize files & webpages and generate notes from them. Open source, local & open data, open model choice (incl. local).
jupyter-naas / Awesome Notebooks[Legacy] Data & AI Notebook templates catalog organized by tools, following the IMO (input, model, output) framework for easy usage and discovery..
FreedomIntelligence / LLMZoo⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
microsoft / MCPCatalog of official Microsoft MCP (Model Context Protocol) server implementations for AI-powered data access and tool integration
thunlp / UltraChatLarge-scale, Informative, and Diverse Multi-round Chat Data (and Models)
whylabs / WhylogsAn open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
skytable / SkytableSkytable is a modern scalable NoSQL database with BlueQL, designed for performance, scalability and flexibility. Skytable gives you spaces, models, data types, complex collections and more to build powerful experiences
instill-ai / Instill Core🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
iver56 / AudiomentationsA Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
open-thoughts / Open ThoughtsFully open data curation for reasoning models
OpenGVLab / InternVideo[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Tencent / MedicalNetMany studies have shown that the performance on deep learning is significantly affected by volume of training data. The MedicalNet project provides a series of 3D-ResNet pre-trained models and relative code.
globalize / GlobalizeRails I18n de-facto standard library for ActiveRecord model/data translation.
ChristianKienle / Core Data EditorCore Data Editor lets you easily view, edit and analyze applications‘ data. Core Data Editor is compatible with Mac and iOS applications and supports XML, SQLite and binary stores, visualizes all relationships and is able to edit the data and generate Objective-C code for the data model.
tensorflow / PrivacyLibrary for training machine learning models with privacy for training data
WenjieDu / PyPOTSA Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values
facebookresearch / DenoiserReal Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.