SkillAgentSearch skills...

Llmfit

Hundreds of models & providers. One command to find what runs on your hardware.

Install / Use

/learn @AlexsJones/Llmfit
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

llmfit

<p align="center"> <img src="assets/icon.svg" alt="llmfit icon" width="128" height="128"> </p> <p align="center"> <b>English</b> · <a href="README.zh.md">中文</a> </p> <p align="center"> <a href="https://github.com/AlexsJones/llmfit/actions/workflows/ci.yml"><img src="https://github.com/AlexsJones/llmfit/actions/workflows/ci.yml/badge.svg" alt="CI"></a> <a href="https://crates.io/crates/llmfit"><img src="https://img.shields.io/crates/v/llmfit.svg" alt="Crates.io"></a> <a href="LICENSE"><img src="https://img.shields.io/badge/license-MIT-blue.svg" alt="License"></a> </p>

Hundreds of models & providers. One command to find what runs on your hardware.

A terminal tool that right-sizes LLM models to your system's RAM, CPU, and GPU. Detects your hardware, scores each model across quality, speed, fit, and context dimensions, and tells you which ones will actually run well on your machine.

Ships with an interactive TUI (default) and a classic CLI mode. Supports multi-GPU setups, MoE architectures, dynamic quantization selection, speed estimation, and local runtime providers (Ollama, llama.cpp, MLX, Docker Model Runner, LM Studio).

Sister projects:

  • sympozium — managing agents in Kubernetes.
  • llmserve — a simple TUI for serving local LLM models. Pick a model, pick a backend, serve it.

demo


Install

Windows

scoop install llmfit

If Scoop is not installed, follow the Scoop installation guide.

macOS / Linux

Homebrew

brew install llmfit

MacPorts

port install llmfit

Quick install

curl -fsSL https://llmfit.axjns.dev/install.sh | sh

Downloads the latest release binary from GitHub and installs it to /usr/local/bin (or ~/.local/bin if no sudo).

Install to ~/.local/bin without sudo:

curl -fsSL https://llmfit.axjns.dev/install.sh | sh -s -- --local

Docker / Podman

docker run ghcr.io/alexsjones/llmfit

This prints JSON from llmfit recommend command. The JSON could be further queried with jq.

podman run ghcr.io/alexsjones/llmfit recommend --use-case coding | jq '.models[].name'

From source

git clone https://github.com/AlexsJones/llmfit.git
cd llmfit
cargo build --release
# binary is at target/release/llmfit

Usage

TUI (default)

llmfit

Launches the interactive terminal UI. Your system specs (CPU, RAM, GPU name, VRAM, backend) are shown at the top. Models are listed in a scrollable table sorted by composite score. Each row shows the model's score, estimated tok/s, best quantization for your hardware, run mode, memory usage, and use-case category.

| Key | Action | |----------------------------|-----------------------------------------------------------------------| | Up / Down or j / k | Navigate models | | / | Enter search mode (partial match on name, provider, params, use case) | | Esc or Enter | Exit search mode | | Ctrl-U | Clear search | | f | Cycle fit filter: All, Runnable, Perfect, Good, Marginal | | a | Cycle availability filter: All, GGUF Avail, Installed | | s | Cycle sort column: Score, Params, Mem%, Ctx, Date, Use Case | | v | Enter Visual mode (select multiple models) | | V | Enter Select mode (column-based filtering) | | t | Cycle color theme (saved automatically) | | p | Open Plan mode for selected model (hardware planning) | | P | Open provider filter popup | | U | Open use-case filter popup | | C | Open capability filter popup | | L | Open license filter popup | | R | Open runtime/backend filter popup (llama.cpp, MLX, vLLM) | | h | Open help popup (all key bindings) | | m | Mark selected model for compare | | c | Open compare view (marked vs selected) | | x | Clear compare mark | | i | Toggle installed-first sorting (any detected runtime provider) | | d | Download selected model (provider picker when multiple are available) | | r | Refresh installed models from runtime providers | | Enter | Toggle detail view for selected model | | PgUp / PgDn | Scroll by 10 | | g / G | Jump to top / bottom | | q | Quit |

Vim-like modes

The TUI uses Vim-inspired modes shown in the bottom-left status bar. The current mode determines which keys are active.

Normal mode

The default mode. Navigate, search, filter, and open views. All keys in the table above apply here.

Visual mode (v)

Select a contiguous range of models for bulk comparison. Press v to anchor at the current row, then navigate with j/k or arrow keys to extend the selection. Selected rows are highlighted.

| Key | Action | |---------------------|--------------------------------------------------------| | j / k or arrows | Extend selection up/down | | c | Compare all selected models (opens multi-compare view) | | m | Mark current model for two-model compare | | Esc or v | Exit Visual mode |

The multi-compare view displays a table where rows are attributes (Score, tok/s, Fit, Mem%, Params, Mode, Context, Quant, etc.) and columns are models. Best values are highlighted. Use h/l or arrow keys to scroll horizontally if more models are selected than fit on screen.

Select mode (V)

Column-based filtering. Press V (shift-v) to enter Select mode, then use h/l or arrow keys to move between column headers. The active column is visually highlighted. Press Enter or Space to activate the appropriate filter for that column:

| Column | Filter action | |-------------------------------|---------------------------------------------------------------------------| | Inst | Cycle availability filter | | Model | Enter search mode | | Provider | Open provider popup | | Params | Open parameter-size bucket popup (<3B, 3-7B, 7-14B, 14-30B, 30-70B, 70B+) | | Score, tok/s, Mem%, Ctx, Date | Sort by that column | | Quant | Open quantization popup | | Mode | Open run-mode popup (GPU, MoE, CPU+GPU, CPU) | | Fit | Cycle fit filter | | Use Case | Open use-case popup |

Row navigation (j/k) still works in Select mode so you can see the effect of filters as you apply them. Press Esc to return to Normal mode.

TUI Plan mode (p)

Plan mode inverts normal fit analysis: instead of asking "what fits my hardware?", it estimates "what hardware is needed for this model config?".

Use p on a selected row, then:

| Key | Action | |------------------------|-----------------------------------------------------------| | Tab / j / k | Move between editable fields (Context, Quant, Target TPS) | | Left / Right | Move cursor in current field | | Type | Edit current field | | Backspace / Delete | Remove characters | | Ctrl-U | Clear current field | | Esc or q | Exit Plan mode |

Plan mode shows estimates for:

  • minimum and recommended VRAM/RAM/CPU cores
  • feasible run paths (GPU, CPU offload, CPU-only)
  • upgrade deltas to reach better fit targets

Themes

Press t to cycle through 10 built-in color themes. Your selection is saved automatically to ~/.config/llmfit/theme and restored on next launch.

| Theme | Description | |-----------------------

View on GitHub
GitHub Stars21.7k
CategoryDevelopment
Updated1m ago
Forks1.3k

Languages

Rust

Security Score

100/100

Audited on Apr 6, 2026

No findings