Nxml
LLaMA from First Principles
Install / Use
/learn @lachlansneff/NxmlREADME
LLaMA from First Principles
I wanted to learn more about how transfomers worked, so I spent a night hacking at an implementation of LLaMA from scratch.
No ML frameworks. No BLAS. Minimal abstractions. Clarity over performance.
I haven't finished it, but it's probably more than half way finished.
Remaining pieces
- Rotary Position Embedding
- Finish adding all the layer operations
- Fix all the bugs in the matrix multiplication implementations
- Token decoding
Related Skills
node-connect
337.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
337.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.2kCommit, push, and open a PR
