202 skills found · Page 3 of 7
CraftJarvis / ROCKET 1Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR'25)
daqingliu / CAVPCode release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)
sirkosophia / DIPOfficial implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations
UKPLab / 5pilsCode associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformation.
ZJU-REAL / GSM8K VGSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts
edward3862 / AnalogistAnalogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)
llnl / Paraview MCPParaView-MCP integrates multimodal LLMs with ParaView via Model Context Protocol, enabling natural language control of scientific visualizations. The agent observes the viewport for visual feedback, making complex visualization tool accessible to all users while providing intelligent automation for experts.
huawei-lin / VTBenchThis repository provides the official implementation of VTBench, a benchmark designed to evaluate the performance of visual tokenizers (VTs) in the context of autoregressive (AR) image generation.
KaihuaTang / VCTree Visual Question AnsweringCode for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contexts"
hjrPhoebus / X DubOfficial project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).
devizor / MacOS Notification MCPmacOS Notification MCP enables AI assistants to trigger native macOS sounds, visual notifications, and text-to-speech. Built for Claude and other AI models using the Model Context Protocol.
JanghoonChoi / TACTVisual Tracking by TridenAlign and Context Embedding
matthias-jaeger-net / P5 ToolkitA collection of effects and other visual trickery in the context of p5 sketches.
heyzgj / LumiLumi is a Chrome extension that turns your visual edits and annotations into high‑fidelity context for coding agents
julielerman / TEE14DemoVisual Studio Solution for TEE14 Session EF Model Partioning in Domain-Driven Design Bounded Contexts
Jiahao000 / VICT[CVPR 2025] Test-Time Visual In-Context Tuning
machine / Machine.specifications.runner.visualstudioA test adapter for Visual Studio and dotnet test for the Context/Specification framework Machine.Specifications
showlab / VisInContextOfficial implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
WayneTomas / TransCP[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".
autollama / AutollamaAnthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.