202 skills found · Page 4 of 7
shaadclt / Qwen2 VL OCR VQAThis project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
ms-dot-k / Visual Context Attentional GANPyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)
TencentYoutuResearch / VisualRecognition NomMerCode for CVPR 2022 paper "NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition"
ShiftMediaProject / NettleUnofficial Nettle with added custom native Visual Studio project build tools. Nettle: Nettle is a cryptographic library that is designed to fit easily in more or less any context.
CodingWithCalvin / VS MCPServerVS MCP Server exposes Visual Studio features through the Model Context Protocol (MCP), enabling AI assistants like Claude to interact with your IDE programmatically. Open files, read code, build projects, and more - all through natural conversation!
abhisheksambyal / Self Supervised Learning By Context PredictionImplementation of "Unsupervised Visual Representation Learning by Context Prediction" by C. Doersh, A. Gupta and A. A. Efros
sudraj2002 / AWRaCLePyTorch code for AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning
ForJadeForest / LIVE Learnable In Context Vector【NeurIPS 2024】The implementation of LIVE: Learnable In-Context Vector for Visual Question Answering https://arxiv.org/abs/2406.13185
mondweep / Youtube Music MCP ServerThis is a MCP (Model Context Protocol) server that you can use with Cline through Visual Studio Code and ask songs to be played using Youtube Music
ms-dot-k / AVSRPyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition" (Interspeech 2022)
GaryJiajia / OFv2 ICL VQA[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
ContextKeeper / ContextKeeper.VisualStudioSession Manager for Visual Studio
syp2ysy / Prompt SelF[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
sailro / RoslynMcpExtensionA Visual Studio extension that exposes semantic C# code analysis via the Model Context Protocol (MCP), powered by the live Roslyn workspace inside Visual Studio.
jaguadoromero / Vscode Php Create ClassA Visual Studio Code extension for create Class / Interface / Trait / Enum from context menu in file explorer
applecore56 / Shopping CartThis simple shopping cart prototype shows how React with Typescript, React hooks, react Context and Styled Components can be used to build a friendly user experience with instant visual updates and scaleable code in ecommerce applications.
AmirhosseinHonardoust / Mobile AI Satisfaction Behavior AanalysisDeep behavioral and machine learning analysis explaining why mobile users systematically report lower satisfaction with AI systems. Includes SHAP explainability, cognitive load modeling, device-context effects, interaction metadata analysis, and end-to-end reproducible research code and visuals.
kreimanlab / Put In ContextPutting Visual Object Recognition in Context
GXNU-ZhongLab / RSTrackExplicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)
ShmuelRonen / ComfyUI Pixtral VisionThe `ComfyUI_pixtral_vision` node is a powerful ComfyUI node designed to integrate seamlessly with the Mistral Pixtral API. It facilitates the analysis of images through deep learning models, interpreting and describing the visual content. Users can input an image directly and provide prompts for context, utilizing an API key for authentication.