2,080 skills found · Page 1 of 70
google-research / Tuning PlaybookA playbook for systematically maximizing the performance of deep learning models.
sgl-project / SglangSGLang is a high-performance serving framework for large language models and multimodal models.
NVIDIA / DeepLearningExamplesState-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
arangodb / Arangodb🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
seerge / G HelperLightweight, open-source control tool for ASUS laptops and ROG Ally. Manage performance modes, fans, GPU, battery, and RGB lighting across Zephyrus, Flow, TUF, Strix, Scar, and other models.
Const-me / WhisperHigh-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
nebuly-ai / OptimateA collection of libraries to optimise AI model performances
tensorflow / ServingA flexible, high-performance serving system for machine learning models
ponylang / PonycPony is an open-source, actor-model, capabilities-secure, high performance programming language
winfunc / DeepreasoningA high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.
gpustack / GpustackA GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
Tencent / TNNTNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
ibireme / YYModelHigh performance model framework for iOS/OSX.
ModelTC / LightLLMLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
thu-pacman / ChituHigh-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
NVIDIA / TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
aksnzhy / XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
whylabs / WhylogsAn open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
skytable / SkytableSkytable is a modern scalable NoSQL database with BlueQL, designed for performance, scalability and flexibility. Skytable gives you spaces, models, data types, complex collections and more to build powerful experiences
modelscope / EvalscopeA streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.