Results for "visual-context"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

202 skills found · Page 3 of 7

CraftJarvis / ROCKET 1

Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR'25)

universal

agentminecraftvision-language-model

Updated 4mo ago

daqingliu / CAVP

Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)

universal

image-captioningpolicy-network

Updated 1y ago

sirkosophia / DIP

Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations

universal

Updated 1mo ago

UKPLab / 5pils

Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformation.

universal

fact-checkingfake-newslarge-language-models+4

Updated 3mo ago

ZJU-REAL / GSM8K V

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

universal

Updated 20d ago

edward3862 / Analogist

Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)

universal

Updated 2mo ago

llnl / Paraview MCP

ParaView-MCP integrates multimodal LLMs with ParaView via Model Context Protocol, enabling natural language control of scientific visualizations. The agent observes the viewport for visual feedback, making complex visualization tool accessible to all users while providing intelligent automation for experts.

claude codecursor

Updated 5d ago

huawei-lin / VTBench

This repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (VTs) in the context of autoregressive (AR) image generation.

universal

Updated 1mo ago

KaihuaTang / VCTree Visual Question Answering

Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"

universal

Updated 1y ago

hjrPhoebus / X Dub

Official project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).

universal

Updated 20d ago

devizor / MacOS Notification MCP

macOS Notification MCP enables AI assistants to trigger native macOS sounds, visual notifications, and text-to-speech. Built for Claude and other AI models using the Model Context Protocol.

claude codeclaude desktop+1

aiclaudellm+6

Updated 1mo ago