PoPo

Pose and animate MMD model with LLM and MPL code

Generate Convert Improve

Install / Use

/learn @AmyangXYZ/PoPo

About this skill

Quality Score

0/100

README

PoPo

AI-powered MMD pose generator - Transform natural language into expressive 3D character animations

PoPo uses LLMs to generate MMD character poses from natural language descriptions. Instead of raw rotation quaternions, we use MPL (MMD Pose Language) - a semantic, MMD-specific pose description language that helps AI understand and generate anatomically correct poses.

🌐 Live demo: popo.love

Powered by Reze Engine

✨ Features

Natural Language Input: "wave right hand with big laugh, inviting me for dinner"
LLM-Generated Poses: Output semantic MPL code for precise pose control
Real-time Rendering: Instant pose creation with smooth bone animations
MMD-Specific: Built for anime characters with proper bone constraints and physics

🎯 LLMs with MPL

PoPo fine-tunes LLMs with MPL: MPL is a semantic pose description language designed specifically for MMD. This approach provides:

Better training convergence - Structured, human-readable pose descriptions
Consistent outputs - Same prompt generates reliable pose code
Anatomically correct - Built-in constraints prevent impossible movements
Debuggable results - Generated MPL code can be read and modified

🛠️ Technology

Frontend: Next.js, shadcn/ui, TypeScript
3D Engine: Reze Engine
Pose Language: MPL (MMD Pose Language) for semantic pose description
AI Model: GPT-4o-mini for natural language → MPL generation
Deployment: Vercel

🎭 Evolution

MiKaPo: Camera → MediaPipe → MMD bones (real-time capture)
PoPo: Text → Fine-tuned LLM → MPL code → MMD bones (AI-generated poses)

By using semantic MPL as the training target instead of raw quaternions, we achieve better consistency and allow the AI to learn the "grammar" of human movement.

📄 License

GPL-3.0 License - see LICENSE for details.

Related Skills

node-connect

349.0k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.0k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.0k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。