SCORE

[ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation"

Generate Convert Improve

Install / Use

/learn @HuangShiqi128/SCORE

About this skill

Quality Score

0/100

README

<h2 align="center">SCORE: Scene Context Matters <br> in Open-Vocabulary Remote Sensing Instance Segmentation </h2> <div align="center"> <p> <a href="https://arxiv.org/abs/2507.12857"><img src="https://img.shields.io/badge/arXiv-SCORE-b31b1b.svg"></a> <a href="https://arxiv.org/pdf/2507.12857"><img src="https://img.shields.io/badge/PDF-8A2BE2"></a> </p> </div>

This repository contains code for our ICCV2025 Highlight✨ paper:

SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation<br> Shiqi Huang, Shuting He, Huaiyuan Qin, Bihan Wen<br> ICCV 2025 (Highlight)

Framework

Installation

Please see Installation Instructions.

Datasets

Download datasets for open-vocabulary remote sensing instance segmentation from Hugging Face <img src="https://huggingface.co/front/assets/huggingface_logo-noborder.svg" alt="Hugging Face Logo" width="20"/>.

Getting Started with SCORE

Please see Getting Started with Detectron2 for full usage.

Training

Download RS-CLIP ckpt from RemoteCLIP and put it under SCORE/RemoteCLIP/RemoteCLIP-ViT-L-14.pt.

python train_net.py --num-gpus 1 \
  --config-file configs/score_isaid_instances.yaml

python train_net.py --num-gpus 1 \
  --config-file configs/score_sior_instances.yaml

Inference

bash eval_isaid.sh

bash eval_sior.sh

Acknowledgement

This project is based on FC-CLIP. Many thanks to the authors for their great work!

BibTeX

Please consider to cite SCORE if it helps your research.

@inproceedings{SCORE,
  title={SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation},
  author={Huang, Shiqi and He, Shuting and Qin, Huaiyuan and Wen, Bihan},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={12559--12569},
  year={2025}
}

Related Skills

node-connect

351.8k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.9k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.8k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.8k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。