Results for "3d-visual-grounding"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

44 skills found · Page 1 of 2

liudaizong / Awesome 3D Visual Grounding

273

😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.

universal

Updated 9d ago

GWxuan / TSP3D

248

[CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding

universal

Updated 7d ago

iris0329 / SeeGround

216

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

universal

3d-scene-understanding3d-visual-groundingembodied-agent+4

Updated 5d ago

worldbench / 3EED

208

[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D

universal

3d3d-grounding3d-visual-grounding+11

Updated 6d ago

be2rlab / Gsplatloc

136

[IROS 2025] GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization

universal

Updated 2d ago

yanmin-wu / EDA

133

[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding

universal

3d-vision-and-language3d-visual-groundingvision-and-language+1

Updated 18d ago

InternRobotics / VLM Grounder

130

[CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding

universal

3d-scene-understandingagentgpt-4o+6

Updated 2d ago

CognitiveAISystems / 3DGraphLLM

111

[ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.

universal

3d-scene-graph3d-scene-understanding3d-visual-grounding+1

Updated 11d ago

jianghaojun / Awesome 3D Vision And Language

101

A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.

universal

3d-deep-learning3d-vision-and-languageawesome+5

Updated 15d ago

ZCMax / ScanReason

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

universal

Updated 5d ago

sega-hsj / MVT 3DVG

[CVPR 2022] Multi-View Transformer for 3D Visual Grounding

universal

Updated 21d ago

WHU-USI3DV / CityAnchor

[ICLR'25] City-scale 3D Visual Grounding with Multi-modality LLMs

universal

Updated 1d ago

ZhanYang-nwpu / Mono3DVG

[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024

universal

Updated 5d ago

CurryYuan / ZSVG3D

[CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding

universal

3dvision-and-languagevisual-grounding+1

Updated 2mo ago

Ivan-Tang-3D / ViewRefer3D

(ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'

universal

Updated 5mo ago

zlccccc / 3DVL Codebase

[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

universal

3d-vision3d-vision-and-language3d-visual-grounding+8

Updated 2mo ago

pqh22 / ProxyTransformation

[CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding

universal

3d-visual-groundingembodied-aimachine-learning

Updated 8d ago

daveredrum / D3Net

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

universal

3dcaption-generationcomputer-vision+7

Updated 8mo ago

zyang-ur / SAT

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

universal

Updated 1y ago

Leon1207 / 3DRefTR

This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"

universal

Updated 2mo ago