2,085 skills found · Page 1 of 70
huggingface / Datasets🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
NVIDIA / TensorRT LLMTensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
TheR1D / Shell GptA command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
apple / Ml FastvlmThis repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
mit-han-lab / Streaming Llm[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
justlovemaki / AIClient 2 APISimulates Gemini CLI, Antigravity, Qwen Code, and Kiro client requests, compatible with the OpenAI API. It supports thousands of Gemini model requests per day and offers free use of the built-in Claude model in Kiro. Easily connect to any client via the API, making AI development more efficient!
deepseek-ai / DeepSeek V2DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
fla-org / Flash Linear Attention🚀 Efficient implementations for emerging model architectures
microsoft / FaraFara-7B: An Efficient Agentic Model for Computer Use
TencentARC / InstantMeshInstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
vllm-project / Vllm OmniA framework for efficient model inference with omni-modality models
hustvl / Vim[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
mit-han-lab / EfficientvitEfficient vision foundation models for high-resolution generation and perception.
jy0205 / Pyramid Flow[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
z-x-yang / Segment And Track AnythingAn open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
alibaba / ROLLAn Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
henrywoo / PyllamaLLaMA: Open and Efficient Foundation Language Models
modelscope / EvalscopeA streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
666DZY666 / Micronetmicronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
mit-han-lab / Temporal Shift Module[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding