119 skills found · Page 1 of 4
MarkMoHR / Awesome Referring Image Segmentation:books: A collection of papers about Referring Image Segmentation.
henghuiding / ReLA[CVPR 2023 Highlight & IJCV 2026] GRES: Generalized Referring Expression Segmentation
henghuiding / Vision Language Transformer[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
PolyU-ChenLab / UniPixel🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)
henghuiding / Awesome Multimodal Referring SegmentationMultimodal Referring Segmentation
Lsan2401 / RMSINRotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
heshuting555 / ReferSplat[ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting
luogen1996 / MCN[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
kkakkkka / ETRIS[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
iSEE-Laboratory / ReferDINO(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
Seonghoon-Yu / Zero Shot RIS[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"
bo-miao / SgMg[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
sosppxo / Mvggt[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
SketchyScene / SketchySceneColorization[SIGGRAPH Asia 2019] Language-based Colorization of Scene Sketches
FudanCVL / OmniAVS[ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
JerryX1110 / Awesome RvosReferring Video Object Segmentation / Multi-Object Tracking Repo
xmz111 / FlowRVS[ICLR 2026] Deforming Videos to Masks: Flow Matching for Referring Video Segmentation (FlowRVS)
Lavreniuk / EVP[ECCV 2024] EVP model for metric depth estimation from a single image and referring segmentation
heshuting555 / DsHmp[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
OpenGVLab / MUTR「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation