GNN4CMR
PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-22 paper "Integrating Multi-Label Contrastive Learning with Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval".
Install / Use
/learn @LivXue/GNN4CMRREADME
Integrating Multi-Label Contrastive Learning with Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval
This repository contains the author's implementation in PyTorch for the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-22 paper "Integrating Multi-Label Contrastive Learning with Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval".
Dependencies
-
Python (>=3.8)
-
PyTorch (>=1.7.1)
-
Scipy (>=1.5.2)
Datasets
You can download the features of the datasets from:
- MIRFlickr, OneDrive, BaiduPan(password: b04z)
- NUS-WIDE (top-21 concepts), BaiduPan(password: tjvo)
- MS-COCO, BaiduPan(password: 5uvp)
Implementation
Here we provide the implementation of our proposed models, along with datasets. The repository is organised as follows:
data/contains the necessary dataset files for NUS-WIDE, MIRFlickr, and MS-COCO;models.pycontains the implementation of theP-GNN-CONandI-GNN-CON;
Finally, main.py puts all of the above together and can be used to execute a full training run on MIRFlcikr or NUS-WIDE or MS-COCO.
Process
- Place the datasets in
data/ - Set the experiment parameters in
main.py. - Train a model:
python main.py
- Modify the parameter
EVAL = Trueinmain.pyfor evaluation:
python main.py
Citation
If you find our work or the code useful, please consider cite our paper using:
@article{Qian_Xue_Zhang_Fang_Xu_2021,
title={Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval},
volume={35},
number={3},
journal={Proceedings of the AAAI Conference on Artificial Intelligence},
author={Qian, Shengsheng and Xue, Dizhan and Zhang, Huaiwen and Fang, Quan and Xu, Changsheng},
year={2021},
pages={2440-2448}
}
@article{9815553,
title={Integrating Multi-Label Contrastive Learning With Dual Adversarial Graph Neural Networks for Cross-Modal Retrieval},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
author={Qian, Shengsheng and Xue, Dizhan and Fang, Quan and Xu, Changsheng},
year={2022},
pages={1-18},
doi={10.1109/TPAMI.2022.3188547}
}
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
fullstack-developer
Full-Stack Developer Role Role Definition CONCEPT: Full-stack developer expertise ARCHITECTURE: Covers both frontend and backend development BEST_PRACTICE: Comprehensive web applicat
groundhog
401Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
workshop-rules
Materials used to teach the summer camp <Data Science for Kids>
