DisentangledSSL
Code for paper: "An Information Criterion for Controlled Disentanglement of Multimodal Data"
Install / Use
/learn @uhlerlab/DisentangledSSLREADME
An Information Criterion for Controlled Disentanglement of Multimodal Data
The repository contains the code for the DisentangledSSL method presented in the paper: An Information Criterion for Controlled Disentanglement of Multimodal Data (ICLR 2025). DisentangledSSL is a novel self-supervised approach for learning disentangled representations, separating information shared across different modalities and modality-specific information.
An example in vision-language domain:

An example in the biological domain:

Set up the environment
conda create -n multimodal python=3.10.9
conda activate multimodal
bash env.sh
Simulation Study
The code is in the synthetic_task/ folder. The command for DisentangledSSL (both step 1 and step 2) and baselines can be found in scripts/.
bash scripts/run_step1.sh
bash scripts/run_step2.sh

Citation
If you find this work useful in your research, please cite:
@article{wang2024information,
title={An Information Criterion for Controlled Disentanglement of Multimodal Data},
author={Wang, Chenyu and Gupta, Sharut and Zhang, Xinyi and Tonekaboni, Sana and Jegelka, Stefanie and Jaakkola, Tommi and Uhler, Caroline},
journal={arXiv preprint arXiv:2410.23996},
year={2024}
}
Related Skills
node-connect
352.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
