Cad3dify
2D to 3D CAD Conversion Using VLM
Install / Use
/learn @neka-nat/Cad3difyREADME
cad3dify
Using GPT-5 (or Claude 4.5 opus, Gemini 3 pro, Llama 3.2 on Vertex AI), generate a 3D CAD model (STEP file) from a 2D CAD image.
[!NOTE] We developed an agent-based version of this project called agent3dify. You can use it instead of this project.
Getting started
Installation.
git clone git@github.com:neka-nat/cad3dify.git
cd cad3dify
poetry install
Run script. A STEP`file ("output.step") will be generated.
cd scripts
export OPENAI_API_KEY=<YOUR API KEY>
python cli.py <2D CAD Image File>
Or run streamlit spp
streamlit run scripts/app.py
streamlit run scripts/app.py -- --model_type claude # Use Claude 4.5 sonnet
streamlit run scripts/app.py -- --model_type gemini # Use Gemini 3.0 pro preview
streamlit run scripts/app.py -- --model_type llama # Use Llama 3.2 on Vertex AI
Architecture
graph TD
Input((Input Image)) --> CodeGenerator(CAD Code Generator AI Agent)
CodeGenerator --> PythonDebugger(Python Execution and Debugging AI Agent)
PythonDebugger --> StepFile((STEP File))
StepFile --> Rendering(Rendering 3D CAD Model Image from STEP File)
Rendering --> Refiner(CAD Code Refiner AI Agent)
Refiner --> PythonDebugger
Demo
We will use the sample file here.
Input image

Generated 3D CAD model

Related Skills
node-connect
351.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
