Results for "multimodel-large-language-model"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

11 skills found

inclusionAI / UI Venus

1.1k

UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.

universal

groundingmultimodel-large-language-modelreinforcement-learning+1

Updated 1h ago

FlagOpen / RoboBrain2.5

856

RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. 🎉🎉🎉

universal

embodied-aimultimodel-large-language-model

Updated 3d ago

JIA-Lab-research / Seg Zero

620

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

universal

multimodalmultimodel-large-language-modelreasoning-language-models+2

Updated 2d ago

jqtangust / Robust R1

515

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

universal

large-language-modelsmulti-modalmultimodel-large-language-model+3

Updated 17h ago

Atomic-man007 / Awesome Multimodel LLM

365

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.

universal

chatgptdatasetgpt+5

Updated 10d ago

sun-hailong / TVC

142

[ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.

universal

cotforgettingmllms+3

Updated 8h ago

theboringhumane / EchoOLlama

124

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect for private AI conversations and custom voice assistants.

universal

agentdockerdocker-compose+9

Updated 9d ago