Gokin
A powerful CLI tool that brings AI assistance directly to your terminal. Gokin understands your codebase and helps with file operations, code search, shell commands, git workflows, task management, and more - all through natural language.
Install / Use
/learn @ginkida/GokinQuality Score
Category
Development & EngineeringSupported Platforms
README

✨ Why Gokin? <a id="why-gokin"></a>
Most AI coding tools are closed-source, route your code through third-party servers, and give you zero control over what gets sent to the model. Gokin was built with a different goal: a fast, secure, zero-telemetry CLI where your code goes directly to the provider you chose — and nothing else leaves your machine.
This matters especially when you work with multiple LLM providers across different jurisdictions (DeepSeek, GLM, Kimi, MiniMax, Gemini, Claude, OpenAI). Gokin ensures that secrets, credentials, and sensitive code are automatically redacted before reaching any model, TLS is enforced on every connection, and no proxy or middleware ever touches your data. You pick the provider — Gokin handles the rest.
| Feature | Gokin | Claude Code | Cursor | |---------|-------|-------------|--------| | Price | Free → Pay-per-use | $20+/month | $20+/month | | Providers | 8 (Gemini, Claude, OpenAI, DeepSeek, GLM, Kimi, MiniMax, Ollama) | 1 (Claude) | Multi | | Offline | ✅ Ollama | ❌ | ❌ | | 54 Tools | ✅ | ~30 | ~30 | | Multi-agent | ✅ 5 parallel | Basic | ❌ | | Direct API | ✅ Zero proxies | ✅ | ❌ Routes through Cursor servers | | Security | ✅ TLS 1.2+, secret redaction (24 patterns), sandbox, 3-level permissions | Basic | Basic | | Open Source | ✅ | ❌ | ❌ | | Self-hosting | ✅ | ❌ | ❌ |
Choose your price tier:
| Stack | Cost | Best For | |-------|------|----------| | Gokin + Ollama | 🆓 Free | Privacy, offline, no API costs | | Gokin + Gemini Flash | 🆓 Free tier | Fast iterations, prototyping | | Gokin + DeepSeek | ~$1/month | Daily coding, best value | | Gokin + Kimi | Pay-per-use | Fast reasoning, 256K context | | Gokin + MiniMax | Pay-per-use | 1M context, strong coding | | Gokin + Claude | Pay-per-use | Complex reasoning | | Gokin + OpenAI | Pay-per-use | Codex models, o3 reasoning |
🚀 Installation <a id="installation"></a>
One-liner (recommended)
curl -fsSL https://raw.githubusercontent.com/ginkida/gokin/main/install.sh | sh
From source
git clone https://github.com/ginkida/gokin.git
cd gokin
go build -o gokin ./cmd/gokin
./gokin --setup
Requirements
- Go 1.25+ (build from source)
- One AI provider (see Providers below)
⚡ Quick Start <a id="quick-start"></a>
# Launch with interactive setup
gokin --setup
# Or set API key and run
export GEMINI_API_KEY="your-key"
gokin
Then just talk naturally:
> Explain how auth works in this project
> Add user registration endpoint with validation
> Run the tests and fix any failures
> Refactor this module to use dependency injection
> Create a PR for these changes
🎯 Key Features <a id="features"></a>
🧠 Smart Code Understanding
- Semantic Search — Find code by meaning, not just keywords
- Code Graph — Dependency visualization
- Multi-file Analysis — Understand entire modules
- Session Memory — Auto-summarizes your session (files, tools, errors, decisions). Survives context compaction. Optional LLM-based summarization every 3rd extraction.
📝 Multi-Layer Project Instructions
Priority: Low ──────────────────────────────── High
Global → User → Project → Local
Global: ~/.config/gokin/GOKIN.md
User: ~/.gokin/GOKIN.md
Project: ./GOKIN.md, .gokin/rules/*.md
Local: ./GOKIN.local.md (git-ignored)
- All layers merged automatically
@includedirective:@./path,@~/path,@/absolute/path- File watching with auto-reload on changes
⚒️ 54 Built-in Tools
- Files: read, write, edit, diff, batch, copy, move, delete
- Search: glob, grep, semantic_search, tree, code_graph
- Git: status, commit, diff, branch, log, blame, PR
- Run: bash, run_tests, ssh, env
- Plan: todo, task, enter_plan_mode, coordinate
- Memory: memorize, shared_memory, pin_context
- Parallel execution: Read-only tools (read, grep, glob) run in parallel when model calls multiple
🤝 Multi-Agent System
┌─────────────┐ ┌─────────────┐ ┌─────────────┐
│ Explore │────▶│ General │────▶│ Bash │
│ (read) │ │ (write) │ │ (execute) │
└─────────────┘ └─────────────┘ └─────────────┘
│ │ │
└───────────────────┴───────────────────┘
│
[Progress UI]
- Up to 5 parallel agents
- Shared memory between agents
- Automatic task decomposition
- API retry with exponential backoff — agents survive transient API errors (rate limits, timeouts, 500s)
- Provider failover — agents automatically try fallback providers when primary fails
- Real-time streaming — agent output streamed to UI as it's generated
- Git worktree support — parallel branch work with isolated sessions
💰 Cost Tracking
- Per-model pricing for all 20+ models across 8 providers
- Real-time cost in status bar (
$0.0243) - Per-response cost in message footer
/costand/statscommands with accurate model-specific pricing
🔄 Prompt Caching
- Explicit
cache_controlbreakpoints for Anthropic, MiniMax, and Kimi - System prompt, tools, and conversation prefix cached — up to 90% input cost savings
- Cache break detection with efficiency tracking
🛡️ Safety & Permissions
- 3-level permissions: Low (auto), Medium (ask once), High (always ask)
- Sandbox mode for bash commands
- Diff preview before applying changes (single-file and multi-file)
- Undo/Redo for all file operations
- Audit logging
- Proactive context compaction — predicts token growth and compacts before hitting model limits
🔒 Security & Privacy <a id="security"></a>
Zero Proxies — Your Code Goes Nowhere Except the LLM
┌──────────┐ ┌──────────────────┐
│ Gokin │ ──TLS──▶ │ Provider API │
│ (local) │ │ (Gemini/Claude/ │
│ │ ◀──TLS── │ DeepSeek/etc.) │
└──────────┘ └──────────────────┘
No middle servers. No Vercel. No telemetry proxies.
Your API key, your code, your conversation — direct.
Some CLI tools route requests through their own proxy servers (Vercel Edge, custom gateways) for telemetry, analytics, or API key management. Gokin does none of this. Every API call goes directly from your machine to the provider's endpoint. You can verify this — it's open source.
Secret Redaction in Terminal Output
LLM tool calls can accidentally expose secrets found in your codebase. Gokin automatically redacts them before they reach the model or your terminal:
| Category | Examples |
|----------|----------|
| API keys | AKIA..., ghp_..., sk_live_..., AIza... |
| Tokens | Bearer tokens, JWT (eyJ...), Slack/Discord tokens |
| Credentials | Database URIs (postgres://user:pass@...), Redis, MongoDB |
| Crypto material | PEM private keys, SSH keys |
24 regex patterns, applied to every tool result and audit log. Handles any data type — strings, maps, typed slices, structs. Custom patterns supported via API.
Defense in Depth
| Layer | What it does |
|-------|-------------|
| TLS 1.2+ enforced | No weak ciphers, certificate verification always on |
| Sandbox mode | Bash runs in isolated namespace (Linux), safe env whitelist (~35 vars) — API keys never leak to subprocesses |
| Command validation | 50+ blocked patterns: fork bombs, reverse shells, rm -rf /, credential theft, env injection |
| SSH validation | Host allowlist, loopback blocked, username injection prevention |
| Path validation | Symlink resolution, directory traversal blocked, TOCTOU prevention |
| SSRF protection | Private IPs, loopback, link-local blocked; all DNS results checked |
| Audit trail | Every tool call logged with sanitized args |
Keys Stay Local
- API keys loaded from env vars or local config (
~/.config/gokin/config.yaml) - Keys are masked in all UI displays (
sk-12****cdef) - Keys are never included in conversation history or tool results
- Ollama mode: zero network calls — fully airgapped
💾 Memory That Persists
> Remember we use PostgreSQL with pgx driver
> What were our database conventions?
- Project-specific memories
- Auto-inject relevant context
- Stored locally (your data stays yours)
☁️ Providers <a id="providers"></a>
[!IMPORTANT] Recommended providers: Gokin supports many providers, but testing every model combination is not feasible. I am confident in stable operation with these two:
- MiniMax Coding Plan (M2.7 / M2.5) — best choice for coding
- GLM Coding Plan (GL
