PoPo
Pose and animate MMD model with LLM and MPL code
Install / Use
/learn @AmyangXYZ/PoPoREADME
PoPo
AI-powered MMD pose generator - Transform natural language into expressive 3D character animations
PoPo uses LLMs to generate MMD character poses from natural language descriptions. Instead of raw rotation quaternions, we use MPL (MMD Pose Language) - a semantic, MMD-specific pose description language that helps AI understand and generate anatomically correct poses.
🌐 Live demo: popo.love
Powered by Reze Engine


✨ Features
- Natural Language Input: "wave right hand with big laugh, inviting me for dinner"
- LLM-Generated Poses: Output semantic MPL code for precise pose control
- Real-time Rendering: Instant pose creation with smooth bone animations
- MMD-Specific: Built for anime characters with proper bone constraints and physics
🎯 LLMs with MPL
PoPo fine-tunes LLMs with MPL: MPL is a semantic pose description language designed specifically for MMD. This approach provides:
- Better training convergence - Structured, human-readable pose descriptions
- Consistent outputs - Same prompt generates reliable pose code
- Anatomically correct - Built-in constraints prevent impossible movements
- Debuggable results - Generated MPL code can be read and modified
🛠️ Technology
- Frontend: Next.js, shadcn/ui, TypeScript
- 3D Engine: Reze Engine
- Pose Language: MPL (MMD Pose Language) for semantic pose description
- AI Model: GPT-4o-mini for natural language → MPL generation
- Deployment: Vercel
🎭 Evolution
- MiKaPo: Camera → MediaPipe → MMD bones (real-time capture)
- PoPo: Text → Fine-tuned LLM → MPL code → MMD bones (AI-generated poses)
By using semantic MPL as the training target instead of raw quaternions, we achieve better consistency and allow the AI to learn the "grammar" of human movement.
📄 License
GPL-3.0 License - see LICENSE for details.
Related Skills
node-connect
349.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
