Results for "omni-model"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

61 skills found · Page 1 of 3

vllm-project / Vllm Omni

4.1k

A framework for efficient model inference with omni-modality models

universal

audio-generationdiffusionimage-generation+6

Updated 1h ago

QwenLM / Qwen2.5 Omni

4.0k

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

universal

Updated 3d ago

ictnlp / LLaMA Omni

3.1k

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

universal

large-language-modelsmultimodal-large-language-modelsspeech-interaction+3

Updated 18h ago

VITA-MLLM / VITA

2.5k

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

universal

large-multimodal-modelsmultimodal-large-language-modelsomni-language-model+2

Updated 3d ago

AlphaAvatar / AlphaAvatar

566

A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (real-time model, visual, voice, memory, search, etc.).

universal

agentaiavatar+11

Updated 4d ago

Ola-Omni / Ola

389

Ola: Pushing the Frontiers of Omni-Modal Language Model

universal

Updated 11d ago

VITA-MLLM / Freeze Omni

374

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

universal

large-language-modelsmultimodal-large-language-modelsspeech+3

Updated 14d ago

CASIA-IVA-Lab / VALOR

307

[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

universal

audio-language-pretrainingaudiovisual-language-pretrainingmultimodal-representation-learning+1

Updated 9d ago

CASIA-IVA-Lab / VAST

299

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

universal

audio-languagecross-modality-pretrainingdataset+3

Updated 12d ago

opendilab / LightRFT

262

LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework

universal

dapogrpollm+6

Updated 1d ago

yunxiangfu2001 / SegMAN

239

[CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation

universal

Updated 1d ago

MooreThreads / MooER

218

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

universal

chatgptgpt-4olarge-language-models+5

Updated 21d ago

SOTAMak1r / VINO Code

210

A Unified Visual Generator with Interleaved OmniModal Context

universal

image-editingimage-generationomni-model+2

Updated 1d ago

sgl-project / Sglang Omni

166

SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models

universal

Updated 35m ago

ddlBoJack / Omni Captioner

129

[ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.

universal

Updated 11h ago

THU-BPM / Omni SafetyBench

104

Code for paper "Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models".

universal

Updated 3d ago

bytedance / OmniScient Model

100

This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model

universal

research

Updated 1mo ago

Zplusdragon / ReID5o ORBench

[NeurIPS2025] ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model

universal

Updated 3d ago

meituan-longcat / UNO Bench

Omni Model Benchmark with high quality and diversity, which reveals the Compositional Law. We’re now focused on Chinese scenarios — and actively seeking partners to co-build English & multilingual versions! Let’s expand global impact together.

universal

evaluationmultimodalomni-model

Updated 3d ago

maxencefaldor / Omni Epic

OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).

universal

open-ended-learningopen-endedness

Updated 25d ago