OTDD
Python implementation of Geometric Dataset Distances via Optimal Transport
Install / Use
/learn @kheyer/OTDDREADME
Optimal Transport Dataset Distances
This repo is a python implementation of Geometric Dataset Distances via Optimal Transport and Robust Optimal Transport. Routines are implemented in numpy with Python Optimal Transport and CVXPY, as well as in Pytorch using KeOps and GeomLoss.
The OTDD algorithm allows us to incorporate label information into the optimal transport problem.

• Algorithm Overview • API • Examples
Installing
Core dependencies can be installed from the environment.yml file
conda env create -f environment.yml
To use the Pytorch implementation, install Pytorch, KeOps and GeomLoss
conda install pytorch torchvision torchaudio -c pytorch
pip install pykeops
pip install geomloss
Then validate the KeOps installation
import pykeops
pykeops.clean_pykeops()
pykeops.test_torch_bindings()
To use the cheminformatics functions in chem.py, install RDKit
conda install -c rdkit rdkit
Related Skills
node-connect
349.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
