237 skills found · Page 1 of 8
plasma-umass / ScaleneScalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
AdnanHodzic / Auto CpufreqAutomatic CPU speed & power optimizer for Linux
Diorser / LiteMonitor一款轻量级、高度可定制的 Windows桌面和任务栏硬件性能监控工具,支持监测 CPU、GPU、内存、磁盘、网速、FPS 计数、插件扩展及内存清理。A lightweight, customizable hardware monitor for the Windows desktop & taskbar. Features CPU/GPU/RAM/Network monitoring, FPS counter, plugin support, and memory optimization.
YosysHQ / Picorv32PicoRV32 - A Size-Optimized RISC-V CPU
microsoft / OliveOlive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
facebookresearch / DenoiserReal Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
symisc / SodAn Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
GPUOpen-Tools / CompressonatorTool suite for Texture and 3D Model Compression, Optimization and Analysis using CPUs, GPUs and APUs
minio / HighwayhashNative Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
intel / Auto RoundSOTA rounding-based quantization for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.
Geekgineer / YOLOs CPPCross-Platform Production-ready C++ inference engine for YOLO models (v5-v12, YOLO26). Unified API for detection, segmentation, pose estimation, OBB, and classification. Built on ONNX Runtime and OpenCV. Optimized for CPU/GPU with quantization support.
segmentio / AsmGo library providing algorithms optimized to leverage the characteristics of modern CPUs
JayDDee / Cpuminer OptOptimized multi algo CPU miner
amphp / ParallelAn advanced parallelization library for PHP, enabling efficient multitasking, optimizing resource use, and application responsiveness through multiple CPU threads.
graysky2 / Kernel Compiler PatchKernel patch enables compiler optimizations for additional CPUs.
byronknoll / Cmixcmix is a lossless data compression program aimed at optimizing compression ratio at the cost of high CPU/memory usage.
SimonvBez / CPUSetSetterMake your games and apps run on the right CPU cores - for smoother performance on AMD Dual-CCD and Intel Hybrid processors.
robertcprice / NCPUnCPU: model-native and tensor-optimized CPU research runtimes with organized workloads, tools, and docs
realopslabs / KubeledgerThe System of Record for Kubernetes Accounting. Tracks CPU/RAM/GPU usage per namespace. Reveals hidden overhead. Get Insights for cost optimization. (Formerly Kube-Opex-Analytics).
microsoft / AntaresAntares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.