ERDiff
[NeurIPS 2023 Spotlight] Official Repo for "Extraction and Recovery of Dpatio-temporal Structure in Latent Dynamics Alignment with Diffusion Models"
Install / Use
/learn @yulewang97/ERDiffREADME
<img src="images/GTVertical_RGB.png" alt="GTVertical_RGB" width="140" /><img src="images/127633222.png" alt="GTVertical_RGB" width="120" />
<div align=center><img src="images/ERDiff_main_github.png", width="650"></div>March 8, 2025 Update
A new tag v1.0.1 has been created.
Changes:
- Initialized linear probing layers with an identity matrix to enhance alignment stability.
- Improved diffusion model stability using data augmentation and
cosine_beta_schedule. - Resolved NaN issues for better numerical stability.
Environment Setup
To install the required dependancies using conda, run:
conda create --name erdiff --file requirements.txt
To install the required dependancies using Python virtual environment, run:
python3 -m venv erdiff
source erdiff/bin/activate
python3 -m pip install --upgrade pip
python3 -m pip install -e .
To train the diffusion model on the source session, run:
cd scripts/ && sbatch run_diffusion_train.sh
To perform the diffusion-guided maximum likelihood alignment, run:
cd scripts/ && sbatch run_mla.sh
The alignment process across epochs can be viewed in scripts/mla_erdiff_398637.out.
Neural Latent Trajectories and their Dynamics Visualization
Cited as
If you find the code useful for your research, please consider citing our work:
@article{wang2024extraction,
title={Extraction and recovery of spatio-temporal structure in latent dynamics alignment with diffusion model},
author={Wang, Yule and Wu, Zijing and Li, Chengrui and Wu, Anqi},
journal={Advances in Neural Information Processing Systems},
volume={36},
year={2024}
}
Poster for NeurIPS 2023
Related Skills
node-connect
343.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
90.0kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
343.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
343.1kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。


