OpenHGNN
This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL.
Install / Use
/learn @BUPT-GAMMA/OpenHGNNREADME
OpenHGNN
启智社区(中文版) | OpenHGNN [CIKM2022] | Space4HGNN [SIGIR2022] | Benchmark&Leaderboard | Slack Channel
This is an open-source toolkit for Heterogeneous Graph Neural Network based on DGL [Deep Graph Library] and PyTorch. We integrate SOTA models of heterogeneous graph.
News
<details> <summary> 2024-07-23 release v0.7 </summary> <br/>We release the latest version v0.7.0
- New models and datasets.
- Graph Prompt pipeline
- Data process frame: dgl.graphBolt
- New GNN aggregator: dgl.sparse
- Distributed training
We release the latest version v0.5.0
- New models and datasets.
- 4 New tasks: pretrain, recommendation, graph attacks and defenses, abnorm_event detection.
- TensorBoard visualization.
- Maintenance and test module.
OpenHGNN won the Excellent Incubation Program Award of OpenI Community! For more details:https://mp.weixin.qq.com/s/PpbwEdP0-8wG9dsvRvRDaA
</details> <details> <summary> 2023-02-21 First Prize of CIE </summary> <br/>The algorithm library supports the project of "Intelligent Analysis Technology and Scale Application of Large Scale Complex Heterogeneous Graph Data" led by BUPT and participated by ANT GROUP, China Mobile, Haizhi Technology, etc. This project won the first prize of the 2022 Chinese Intitute of Electronics "Science and Technology Progress Award".
</details> <details> <summary> 2023-01-13 release v0.4 </summary> <br/>We release the latest version v0.4.
- New models
- Provide pipelines for applications
- More models supporting mini-batch training
- Benchmark for million-scale graphs
We release the latest version v0.3.
- New models
- API Usage
- Simply customization of user-defined datasets and models
- Visualization tools of heterogeneous graphs
We release the latest version v0.2.
- New Models
- Space4HGNN [SIGIR2022]
- Benchmark&Leaderboard
- 全新的中文文档
- 免费的计算资源—— 云脑使用教程
- OpenHGNN最新功能
- 新增模型:【KDD2017】Metapath2vec、【TKDE2018】HERec、【KDD2021】HeCo、【KDD2021】SimpleHGN、【TKDE2021】HPN、【ICDM2021】HDE、fastGTN
- 新增日志功能
- 新增美团外卖数据集
Key Features
- Easy-to-Use: OpenHGNN provides easy-to-use interfaces for running experiments with the given models and dataset. Besides, we also integrate optuna to get hyperparameter optimization.
- Extensibility: User can define customized task/model/dataset to apply new models to new scenarios.
- Efficiency: The backend dgl provides efficient APIs.
Get Started
Requirements and Installation
1. Python environment (Optional): We recommend using Conda package manager
conda create -n openhgnn python=3.6
source activate openhgnn
2. Install Pytorch: Follow their tutorial to run the proper command according to your OS and CUDA version. For example:
pip install torch torchvision torchaudio
3. Install DGL: Follow their tutorial to run the proper command according to your OS and CUDA version. For example:
pip install dgl -f https://data.dgl.ai/wheels/repo.html
4. Install openhgnn:
- install from pypi
pip install openhgnn
- install from source
git clone https://github.com/BUPT-GAMMA/OpenHGNN
# If you encounter a network error, try git clone from openi as following.
# git clone https://git.openi.org.cn/GAMMALab/OpenHGNN.git
cd OpenHGNN
pip install .
5. Install gdbi(Optional):
- install gdbi from git
pip install git+https://github.com/xy-Ji/gdbi.git
- install graph database from pypi
pip install neo4j==5.16.0
pip install nebula3-python==3.4.0
Running an existing baseline model on an existing benchmark dataset
python main.py -m model_name -d dataset_name -t task_name -g 0 --use_best_config --load_from_pretrained
usage: main.py [-h] [--model MODEL] [--task TASK] [--dataset DATASET] [--gpu GPU] [--use_best_config][--use_database]
optional arguments:
-h, --help show this help message and exit
--model -m name of models
--task -t name of task
--dataset -d name of datasets
--gpu -g controls which gpu you will use. If you do not have gpu, set -g -1.
--use_best_config use_best_config means you can use the best config in the dataset with the model. If you want to
set the different hyper-parameter, modify the openhgnn.config.ini manually. The best_config
will override the parameter in config.ini.
--load_from_pretrained will load the model from a default checkpoint.
--use_database get dataset from database
---mini_batch_flag train model with mini-batchs
---graphbolt mini-batch training with dgl.graphbolt
---use_distributed train model with distributed way
e.g.:
python main.py -m GTN -d imdb4GTN -t node_classification -g 0 --use_best_config
python main.py -m RGCN -d imdb4GTN -t node_classification -g 0 --mini_batch_flag --graphbolt
Note: If you are interested in some model, you can refer to the below models list.
Refer to the docs to get more basic and depth usage.
Use TensorBoard to visualize your train result
tensorboard --logdir=./openhgnn/output/{model_name}/
e.g.:
tensorboard --logdir=./openhgnn/output/RGCN/
Note: To visualize results, you need to train the model first.
Use gdbi to get grpah dataset
take neo4j and imdb dataset for example
- construct csv file for dataset(node-level:A.csv,edge-level:A_P.csv)
- import csv file to database
LOAD CSV WITH HEADERS FROM "file:///data.csv" AS row
CREATE (:graphname_labelname {ID: row.ID, ... });
- add user information to access database in config.py file
self.graph_address = [graph_address]
self.user_name = [user_name]
self.password = [password]
- e.g.:
python main.py -m MAGNN -d imdb4MAGNN -t node_classification -g 0 --use_best_config --use_database
Models
Supported Models with specific task
The link will give some basic usage.
| Model | Node classification | Link prediction | Recommendation | |-----------------------------------------------------------|---------------------|--------------------|--------------------| | TransE[NIPS 2013] | | :heavy_check_mark: | | | TransH[AAAI 2014] | | :heavy_check_mark: | | | TransR[AAAI 2015] | | :heavy_check_mark: | | | TransD[ACL 2015] | | :heavy_check_mark: | | | Metapath2vec[KDD 2017] | :heavy_check_mark: | | | | RGCN[ESWC 2018] | :heavy_check_mark: | :heavy_check_mark: | | | HERec[TKDE 2018] | :heavy_check_mark: | | | | HAN[WWW 2019] | :heavy_check_mark: | :heavy_check_mark: | | | KGCN[WWW 2019] | | | :heavy_check_mark: | | HetGNN[KDD 2019] | :heavy_check_mark: | :heavy_check_mark: | | | HeGAN[KDD 2019] | :heavy_check_mark: | | | | HGAT[EMNLP 2019] | | | | | GTN[NeurIPS 2019] & fastGTN | :heavy_check_mark: | | | | RSHN[ICDM 2019] | :heavy_check_mark: | :heavy_check_mark: | | | GATNE-T[KDD 20
Related Skills
node-connect
338.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
338.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.4kCommit, push, and open a PR
