SCORE
[ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation"
Install / Use
/learn @HuangShiqi128/SCOREREADME
This repository contains code for our ICCV2025 Highlight✨ paper:
<div align="left"> <img src="imgs/teaser.png" width="70%" height="100%"/> </div><br/>SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation<br> Shiqi Huang, Shuting He, Huaiyuan Qin, Bihan Wen<br> ICCV 2025 (Highlight)
Framework
<div align="center"> <img src="imgs/framework.png" width="100%" height="100%"/> </div><br/>Installation
Please see Installation Instructions.
Datasets
Download datasets for open-vocabulary remote sensing instance segmentation from Hugging Face <img src="https://huggingface.co/front/assets/huggingface_logo-noborder.svg" alt="Hugging Face Logo" width="20"/>.
Getting Started with SCORE
Please see Getting Started with Detectron2 for full usage.
Training
Download RS-CLIP ckpt from RemoteCLIP and put it under SCORE/RemoteCLIP/RemoteCLIP-ViT-L-14.pt.
python train_net.py --num-gpus 1 \
--config-file configs/score_isaid_instances.yaml
python train_net.py --num-gpus 1 \
--config-file configs/score_sior_instances.yaml
Inference
bash eval_isaid.sh
bash eval_sior.sh
Acknowledgement
This project is based on FC-CLIP. Many thanks to the authors for their great work!
BibTeX
Please consider to cite SCORE if it helps your research.
@inproceedings{SCORE,
title={SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation},
author={Huang, Shiqi and He, Shuting and Qin, Huaiyuan and Wen, Bihan},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={12559--12569},
year={2025}
}
Related Skills
node-connect
351.8kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.9kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.8kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.8kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
