ERDiff

[NeurIPS 2023 Spotlight] Official Repo for "Extraction and Recovery of Dpatio-temporal Structure in Latent Dynamics Alignment with Diffusion Models"

Generate Convert Improve

Install / Use

/learn @yulewang97/ERDiff

About this skill

Quality Score

0/100

README

<h2>Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Models [NeurIPS'2023 Spotlight]</h2> <div align='center' ><font size='4'>Yule Wang, Zijing Wu, Chengrui Li, and Anqi Wu</font></div> <div align='center' ><font size='5'>Georgia Institute of Technology</font></div> <div align='center' ><font size='5'>Atlanta, GA, USA</font></div>

March 8, 2025 Update

A new tag v1.0.1 has been created.

Changes:

Initialized linear probing layers with an identity matrix to enhance alignment stability.
Improved diffusion model stability using data augmentation and cosine_beta_schedule.
Resolved NaN issues for better numerical stability.

Environment Setup

To install the required dependancies using conda, run:

conda create --name erdiff --file requirements.txt

To install the required dependancies using Python virtual environment, run:

python3 -m venv erdiff
source erdiff/bin/activate
python3 -m pip install --upgrade pip
python3 -m pip install -e .

To train the diffusion model on the source session, run:

cd scripts/ && sbatch run_diffusion_train.sh

To perform the diffusion-guided maximum likelihood alignment, run:

cd scripts/ && sbatch run_mla.sh

The alignment process across epochs can be viewed in scripts/mla_erdiff_398637.out.

Neural Latent Trajectories and their Dynamics Visualization

Cited as

If you find the code useful for your research, please consider citing our work:

@article{wang2024extraction,
  title={Extraction and recovery of spatio-temporal structure in latent dynamics alignment with diffusion model},
  author={Wang, Yule and Wu, Zijing and Li, Chengrui and Wu, Anqi},
  journal={Advances in Neural Information Processing Systems},
  volume={36},
  year={2024}
}

Poster for NeurIPS 2023

Related Skills

node-connect

343.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

90.0k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

343.1k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

343.1k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。