Results for "efficient-model"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

2,085 skills found · Page 1 of 70

huggingface / Datasets

21.4k

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

universal

aiartificial-intelligencecomputer-vision+13

Updated 27m ago

NVIDIA / TensorRT LLM

13.2k

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

universal

blackwellcudallm-serving+2

Updated 13m ago

TheR1D / Shell Gpt

11.9k

A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.

universal

chatgptcheat-sheetcli+13

Updated 2h ago

apple / Ml Fastvlm

7.3k

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

universal

Updated 14h ago

mit-han-lab / Streaming Llm

7.2k

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

universal

Updated 10h ago

justlovemaki / AIClient 2 API

6.5k

Simulates Gemini CLI, Antigravity, Qwen Code, and Kiro client requests, compatible with the OpenAI API. It supports thousands of Gemini model requests per day and offers free use of the built-in Claude model in Kiro. Easily connect to any client via the API, making AI development more efficient!

claude codeclaude desktop+1

aicodingfree

Updated 5m ago

deepseek-ai / DeepSeek V2

5.0k

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

universal

Updated 2h ago

fla-org / Flash Linear Attention

4.8k

🚀 Efficient implementations for emerging model architectures

universal

large-language-modelsmachine-learning-systemsnatural-language-processing+1

Updated 1h ago

microsoft / Fara

4.7k

Fara-7B: An Efficient Agentic Model for Computer Use

universal

agentbrowser-usecomputer-use+2

Updated 1h ago

TencentARC / InstantMesh

4.3k

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

universal

Updated 5h ago

vllm-project / Vllm Omni

4.1k

A framework for efficient model inference with omni-modality models

universal

audio-generationdiffusionimage-generation+6

Updated 17m ago

hustvl / Vim

3.8k

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

universal

Updated 4h ago

mit-han-lab / Efficientvit

3.3k

Efficient vision foundation models for high-resolution generation and perception.

universal

deep-compression-autoencoderefficient-diffusion-modelefficientvit+5

Updated 1d ago

jy0205 / Pyramid Flow

3.2k

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

universal

diffusion-modelsflow-matchingvideo-generation

Updated 1d ago

z-x-yang / Segment And Track Anything

3.1k

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

zed

interactive-segmentationsegment-anythingsegment-anything-model+2

Updated 3h ago

alibaba / ROLL

3.0k

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

universal

agenticrlhfrlvr

Updated 2h ago

henrywoo / Pyllama

2.8k

LLaMA: Open and Efficient Foundation Language Models

universal

Updated 6d ago

modelscope / Evalscope

2.6k

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

universal

evaluationllmperformance+2

Updated 33m ago

666DZY666 / Micronet

2.3k

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

universal

batch-normalization-fusebnnconvolutional-networks+17

Updated 14d ago

mit-han-lab / Temporal Shift Module

2.2k

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

universal

accelerationefficient-modellow-latency+4

Updated 2d ago