SkillAgentSearch skills...

Bagofwords

Chat with your data - with memory, rules, and observability built in. Deploy in 2 minutes

Install / Use

/learn @bagofwords1/Bagofwords
About this skill

Quality Score

0/100

Category

Operations

Supported Platforms

Universal

README

<div> <div> <img src="./media/logo-128.png" style="width:128px;" /> </div>

The open-source agentic analytics platform

Chat, visualize, and run deep analysis on your data - with memory, rules management, and observability built in.

Website Docs Docker e2e tests

</div>

Features:

  • Memory: Stores context, preferences, and usage patterns down to table/column level. AI learns over time and surfaces semantic-layer updates, review-gated.

  • Rules: Define instructions, terms, and guardrails with versioning and approval workflows. Git sync with auto-indexing of dbt, markdown, code, and more.

  • Observability: Full visibility into agent traces, plans, guardrails, LLM judges, evals, and feedback — debug and improve the loop.

  • Dashboards & sharing: Build and share dashboards instantly. Share conversations, saved queries, and analysis with your team.

  • Any LLM ↔ Any warehouse: OpenAI, Anthropic, Gemini, Ollama ↔ Snowflake, BigQuery, Postgres, Redshift. Swap or mix without breaking workflows.

  • MCP: Use from Cursor, Claude Desktop, and other AI clients while reliably tracking requests and data operations.

Deploy anywhere

Demo


Quick Start 🚀

# runs with SQLite (default)
docker run -p 3000:3000 bagofwords/bagofwords

Or, run with a ready PostgreSQL instance

docker run -p 3000:3000 \
  -e BOW_DATABASE_URL=postgresql://user:password@localhost:5432/dbname \
  bagofwords/bagofwords

Other deployment options

docker compose and Kubernete deployments are also available (and recommended for servers). See the docs.

Product Overview

Chat with any data

Create reports, deep analysis or quick visuals with an AI interface powered by an agentic-loop with tools, reasoning and reflection built in.

<div style="text-align: center; margin: 20px 0;"> <img src="./media/chat.png" alt="Bag of words" style="width: 100%; max-width: 1200px;"> <i></i> </div>

Instructions Registry

Manage AI rules and instructions with review process, full versioning and tracking. Sync with git to auto-index dbt, Tableau, AGENTS.md and more.

<div style="text-align: center; margin: 20px 0;"> <img src="./media/instructions-table.png" alt="Bag of words" style="width: 100%; max-width: 1200px;"> <i></i> </div>

Save data and queries to the Catalog

Leverage the catalog to store, share, and explore reusable queries and datasets. This feature also improves discoverability and searchability for AI, contributing to smarter AI decisions.

<div style="text-align: center; margin: 20px 0;"> <img src="./media/catalog.png" alt="Bag of words" style="width: 100%; max-width: 1200px;"> </div>

Monitor AI and data operations

Full observability into queries, feedback, and context — powering self-learning and high quality AI results

<div style="text-align: center; margin: 20px 0;"> <img src="./media/monitoring.png" alt="Bag of words" style="width: 100%; max-width: 1200px;"> </div>

Architecture

Bag of words acts as a context-aware analytics layer that connects to any database or service, works with any LLM, and enriches queries with docs, BI models, or code.

The architecture is fully flexible: plug in any data source, any model, and any interface — giving your team maximum freedom of choice, without sacrificing governance or reliability.

<div style="text-align: center; margin: 20px 0;"> <img src="./media/arch.png" alt="Bag of words" style="width: 100%; max-width: 1200px;"> </div>

Integrations

Supported LLM Integrations

Bag of words supports a wide range of LLM providers out of the box. You can bring your own API key for any of the following:

| Provider | Supported Models / APIs | Notes | |------------------|---------------------------------|-----------------------------------------------------------------------| | OpenAI | GPT models, o-models, etc. | Any OpenAI-compatible endpoint (including self-hosted, vLLM, etc.) | | Azure OpenAI | GPT models, o-models, etc. | Azure resource/endpoint support, including model deployment names | | Google Gemini| Gemini, Flash versions, etc. | Requires Google Cloud API key | | Anthropic | Claude, Sonnet, Haiku | Just provide the API key | | Any OpenAI-compatible | vLLM, LM Studio, Ollama, etc. | Just provide the base URL and API key |

Tip: You can configure multiple providers and models, set defaults, and more.

Data Sources

Supported Data Sources

Below is a list of all data sources supported by Bag of words, as defined in the data source registry. Each entry is marked as either a Database/Warehouse or a Service.

| Title | Kind | |--------------------------|---------------------| | PostgreSQL | Database/Warehouse | | Snowflake | Database/Warehouse | | Google BigQuery | Database/Warehouse | | Databricks SQL | Database/Warehouse | | Microsoft Fabric | Database/Warehouse | | NetSuite | Service | | MySQL | Database/Warehouse | | AWS Athena | Database/Warehouse | | MariaDB | Database/Warehouse | | DuckDB | Database/Warehouse | | Salesforce | Service | | Microsoft SQL Server | Database/Warehouse | | ClickHouse | Database/Warehouse | | Azure Data Explorer | Database/Warehouse | | AWS Cost Explorer | Service | | Vertica | Database/Warehouse | | AWS Redshift | Database/Warehouse | | Tableau | Service | | Presto | Database/Warehouse | | Apache Pinot | Database/Warehouse | | Oracle DB | Database/Warehouse | | MongoDB | Database/Warehouse | | PostHog | Service | | PowerBI | Service | | QlikView | Service | | Sybase SQL Anywhere | Database/Warehouse | | Sisense BI | Service |

Note: Some data sources (like NetSuite) may be marked as inactive or beta in the registry. "Service" refers to APIs or SaaS platforms, while "Database/Warehouse" refers to systems that store and query structured data.

Enterprise

For teams requiring advanced security, compliance, and governance:

  • SSO: Google Workspace and OIDC-compatible identity providers
  • Self-hosted: Deploy on your own infrastructure with full data control
  • RBAC: Role-based access control for fine-grained permissions

🔒 Security & Privacy

We take data security and privacy seriously.

Telemetry

By default, Bag of words captures basic usage stats of self-hosted instances to a centralized server. The data helps us improve the product.

You can disable by setting in bow-config.yaml

telemetry
  enabled: false

You can also disable the Intercom chat for support

intercom
  enabled: false

Related Skills

View on GitHub
GitHub Stars434
CategoryOperations
Updated10h ago
Forks77

Languages

Python

Security Score

85/100

Audited on Apr 6, 2026

No findings