Cortex

Cortex is a private, secure, and highly responsive desktop AI assistant designed for seamless interaction with local Large Language Models (LLMs) through the Ollama framework. All models and data stay on your device, no cloud, no third parties.

Generate Convert Improve

Install / Use

/learn @dovvnloading/Cortex

About this skill

Quality Score

0/100

README

More info -> https://dovvnloading.github.io/Cortex/

Cortex

Cortex is a desktop AI assistant for running local large language models through Ollama, with a native PySide6 interface and persistent local data storage.

The project is focused on local-first operation: conversation processing, memory, translation, and chat state all run on your machine.

Overview
Core Capabilities
System Architecture
Repository Layout
Requirements
Quick Start
Configuration and Runtime Behavior
Data and Persistence
Troubleshooting
Security and Privacy Notes
Development
License

Overview

Cortex combines:

A Qt-based desktop application (PySide6) for a native UI.
Ollama-backed model orchestration for chat, title generation, translation, and embeddings.
Persistent conversation and memory systems (SQLite + vector and memo layers).
Multi-threaded workers to keep the UI responsive during long-running model operations.

Core Capabilities

Local chat with Ollama models: configurable generation model, host, and generation parameters.
Threaded conversation management: new chat creation, title generation, and chat history handling.
Translation pipeline: optional post-generation translation using a dedicated model.
Suggestion generation: optional context-aware follow-up suggestions.
Vector memory retrieval: semantic context lookup with embedding support.
Permanent memo memory: persistent user/project memory used to improve response relevance.
Theme and UX controls: light/dark theme support and UI state persisted via QSettings.

System Architecture

Cortex is organized around three layers:

Presentation Layer
- Built with PySide6 widgets and custom UI components.
- Main window and dialogs manage chat, settings, memory controls, and translation/suggestion toggles.
Orchestration Layer
- The Orchestrator in Chat_LLM.py coordinates model calls, thread lifecycle, and feature toggles.
- Worker objects and QThread usage isolate blocking operations (query execution, title generation, update checks, model connection checks).
Data + Model Layer
- Ollama client interaction for inference and embeddings.
- Persistent storage for chat records and memory data.
- Prompt-building and synthesis logic in the synthesis agent.

Repository Layout

.
├── Chat_LLM/
│   ├── assets/                  # Icons and prompt assets
│   └── Chat_LLM/
│       ├── Chat_LLM.py          # Main application entry point + orchestrator
│       ├── main_window.py       # Primary UI window
│       ├── synthesis_agent.py   # Prompting + generation/translation/suggestions
│       ├── memory.py            # Memory/database managers
│       ├── ui_*.py              # UI components/styles/dialogs
│       └── ...
├── Cortex_Startup.py            # Startup utility for Ollama setup/model pulling
├── requirements.txt             # Root Python dependencies
├── index.html                   # Landing page
└── README.md

Requirements

Runtime

Python 3.10+
Ollama installed and running (default host: http://127.0.0.1:11434)

Python Dependencies

Install from the repository root:

pip install -r requirements.txt

Root dependencies currently include:

PySide6
markdown
ollama

Quick Start

1) Install Ollama

Install Ollama for your platform from the official site:

https://ollama.com/download

2) Pull at least one chat model

Example:

ollama pull qwen3:8b

Optional models used by advanced features:

# Chat title generation
ollama pull granite4:tiny-h

# Translation
ollama pull translategemma:4b

# Embeddings for vector memory
ollama pull nomic-embed-text

3) Launch Cortex

From repository root:

python Chat_LLM/Chat_LLM/Chat_LLM.py

4) Optional: run the setup utility

The startup utility can help install/pull models with a GUI workflow:

python Cortex_Startup.py

Configuration and Runtime Behavior

Default runtime configuration is defined in Chat_LLM/Chat_LLM/Chat_LLM.py (CONFIG dictionary), including:

Ollama host URL
default generation/title/translation/embedding models
generation parameters (temperature, num_ctx, seed)
available chat model list
update check URL

User-specific settings (theme, feature toggles, selected models, and related UI preferences) are persisted with QSettings.

Data and Persistence

Cortex uses local persistence for conversation state and memory systems. In practice, this includes:

chat/thread records and related metadata
vector memory embeddings and semantic retrieval context
permanent memo-style memory for personalization
local user settings via QSettings

Troubleshooting

Ollama connection errors

Verify Ollama is installed and running.
Confirm the host in configuration/settings matches your local Ollama endpoint.

No response or slow response

Ensure your selected model exists locally (ollama list).
Reduce model size if hardware resources are limited.
Check RAM/CPU/GPU load while generating.

Missing model errors for optional features

Pull the required specialized model (translation/title/embedding) or disable that feature in settings.

UI startup issues

Run from a terminal to inspect logs.
Confirm Python dependencies are installed in the active environment.

Security and Privacy Notes

Cortex is designed for local usage, but your privacy posture still depends on local environment configuration:

Keep Ollama bound to local interfaces unless remote access is intentionally configured.
Review any custom model endpoints before use.
Protect your local machine and account, since all data is stored locally.

Development

Contribution process: see CONTRIBUTING.md.
Security disclosures: see SECURITY.md.
Project change history: see Change_Log.md.

License

This project is licensed under the terms in LICENSE.

Related Skills

oracle

351.8k

Best practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).

prose

351.8k

OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.

claude-opus-4-5-migration

110.9k

Migrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5

Command Development

110.9k

This skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use command frontmatter", "organize commands", "create command with file references", "interactive command", "use AskUserQuestion in command", or needs guidance on slash command structure, YAML frontmatter fields, dynamic arguments, bash execution in commands, user interaction patterns, or command development best practices for Claude Code.

dovvnloading

View profile

View on GitHub

GitHub Stars34

CategoryData

Updated23h ago

Forks6

dovvnloading/Cortex

Languages

Python

Security Score

95/100

Audited on Apr 7, 2026

No findings

Cortex

Install / Use

README

Cortex

Table of Contents

Overview

Core Capabilities

System Architecture

Repository Layout

Requirements

Runtime

Python Dependencies

Quick Start

1) Install Ollama

2) Pull at least one chat model

3) Launch Cortex

4) Optional: run the setup utility

Configuration and Runtime Behavior

Data and Persistence

Troubleshooting

Ollama connection errors

No response or slow response

Missing model errors for optional features

UI startup issues

Security and Privacy Notes

Development

License

Related Skills