Results for "multimodal-alignment"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

86 skills found · Page 1 of 3

Tencent-Hunyuan / HunyuanVideo Foley

1.3k

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.

universal

aigc-audiofoley-artfoley-sound-synthesis+5

Updated 16h ago

HorizonWind2004 / Reconstruction Alignment

383

[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

universal

aigcbagelcomfy+9

Updated 11m ago

shalfun / DriVerse

220

[ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment"

universal

Updated 2d ago

GradientSpaces / CrossOver

209

[CVPR 2025, Highlight] CrossOver: 3D Scene Cross-Modal Alignment

universal

3d-scene-understandingmultimodal-alignment

Updated 3d ago

Kwai-YuanQi / MM RLHF

200

The Next Step Forward in Multimodal LLM Alignment

universal

Updated 3d ago

RainBowLuoCS / OpenOmni

134

(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

universal

imagelarge-language-modellarge-multimodal-models+4

Updated 3d ago

chenllliang / DreamEngine

122

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!

universal

Updated 15d ago

thecharm / Mega

110

Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".

universal

Updated 6d ago

Max-Fu / Tvl

[ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment

universal

Updated 29d ago

kangverse / DALR

The implementation of our ACL 2025 paper "DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning"

universal

Updated 1d ago

YuanLi95 / EEGA For JMERE

This is code for Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging (AAAI 2023)

universal

Updated 4d ago