129 skills found · Page 1 of 5
NVIDIA-Merlin / NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
NVIDIA-Merlin / MerlinNVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
google / TemporianTemporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
pytorch / TorcharrowHigh performance model preprocessing library on PyTorch
eltonlaw / ImpyuteData imputations library to preprocess datasets with missing data
arkworks-rs / MarlinA Rust library for the Marlin preprocessing zkSNARK
facebookresearch / StopesA library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
kaniblu / Hangul UtilsAn integrated library for Korean language preprocessing.
NIHOPA / NLPrePython library for Natural Language Preprocessing (NLPre)
FuSiry / OpenSAAiming at the common training datsets split, spectrum preprocessing, wavelength select and calibration models algorithm involved in the spectral analysis process, a complete algorithm library is established, which is named opensa (openspectrum analysis).
lyeoni / PrenlpPreprocessing Library for Natural Language Processing
wang-fujin / Battery Dataset Preprocessing Code LibraryA code library for reading and preprocessing public battery dataset
NadavIs56 / Skin Disease AIA Python-based computer vision and AI system for skin disease recognition and diagnosis. Led end-to-end project pipeline, including data gathering, preprocessing, and training models. Utilized Keras, TensorFlow, OpenCV, and other libraries for image processing and CNN models, showcasing expertise in deep learning and machine learning techniques.
aboutmydreams / PycaptThe python verification code processing library pycapt supports extremely convenient verification code preprocessing and generation, and assists machine learning in automatically generating training sets.
NoorBayan / TanqeehTanqeeh is a Python library designed to preprocess and clean Arabic text efficiently. It provides a comprehensive set of functions to normalize, remove unwanted characters, fix spacing issues, and enhance text quality for NLP applications.
ARBML / TnkeehArabic cleaning, normalization and segmentation library.
TakeLab / PodiumPodium: a framework agnostic Python NLP library for data loading and preprocessing
ffengc / Boost Search EngineThis search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced search functionalities. It uses C++, HTML, CSS, and JavaScript, enhancing access to Boost technical documents for developers.
helloooideeeeea / RealTimeCutVADLibraryA real-time Voice Activity Detection (VAD) library for iOS and macOS using Silero models powered by ONNX Runtime. Includes advanced noise suppression and audio preprocessing with WebRTC APM, supporting seamless WAV data output with header metadata.
lucasrla / Wsi PreprocessingSimple library for preprocessing histopathological whole-slide images (WSI) into tiles (a.k.a. patches) towards deep learning