<h1 align="center">OpenAlerts</h1> Real-time monitoring & alerting for AI agent frameworks. <a href="https://pypi.org/project/openalerts"><img src="https://img.shields.io/pypi/v/openalerts?style=flat&color=blue" alt="PyPI"></a> <a href="https://pypi.org/project/openalerts"><img src="https://static.pepy.tech/personalized-badge/openalerts?period=month&units=international_system&left_color=grey&right_color=blue&left_text=downloads" alt="PyPI downloads"></a> <a href="https://www.npmjs.com/package/@steadwing/openalerts"><img src="https://img.shields.io/npm/v/@steadwing/openalerts?style=flat&color=blue" alt="npm"></a> <a href="https://www.npmjs.com/package/@steadwing/openalerts"><img src="https://img.shields.io/npm/dt/@steadwing/openalerts?style=flat&color=blue" alt="npm downloads"></a> <a href="https://discord.gg/4rUP86tSXn"><img src="https://img.shields.io/badge/discord-community-5865F2?style=flat" alt="Discord"></a> <a href="#quickstart">Quickstart</a> · <a href="#dashboard">Dashboard</a> · <a href="#alert-rules">Alert Rules</a> · <a href="#llm-enriched-alerts">LLM Enrichment</a> · <a href="#mcp-server">MCP Server</a> · <a href="#commands">Commands</a>

AI agents fail silently. LLM errors, stuck sessions, token blowups - nobody knows until a user complains.

OpenAlerts watches your agent in real-time and alerts you the moment something goes wrong. Runs fully locally - no external services, no cloud dependencies, everything stays on your machine.

Dashboard

Quickstart

<details open> <summary>Python - for <a href="https://github.com/crewAIInc/crewAI">CrewAI</a>, <a href="https://github.com/FoundationAgents/OpenManus">OpenManus</a>, and <a href="https://github.com/HKUDS/nanobot">nanobot</a></summary>

Install

pip install openalerts

# For CrewAI support
pip install openalerts crewai

<details open> <summary>CrewAI</summary>

import asyncio
import openalerts
from crewai import Agent, Task, Crew

async def main():
    # Dashboard starts at http://localhost:9464/openalerts
    await openalerts.init({"framework": "crewai"})

    # Use CrewAI as normal — automatically monitored
    researcher = Agent(
        role="Researcher",
        goal="Research topics thoroughly",
        backstory="You are an expert researcher.",
        llm="gpt-4o-mini",
    )
    task = Task(
        description="Research the benefits of AI monitoring",
        expected_output="A short summary",
        agent=researcher,
    )
    crew = Crew(agents=[researcher], tasks=[task])
    result = crew.kickoff()
    print(result)

asyncio.run(main())

The CrewAI adapter uses CrewAI's native event bus — no monkey-patching. Every crew run, agent execution, task step, tool call, and LLM call is tracked automatically with full session correlation (Crew = session, Agent = subagent, Task = step).

</details> <details> <summary>OpenManus</summary>

import asyncio
import openalerts
from app.agent.manus import Manus

async def main():
    # Dashboard starts at http://localhost:9464/openalerts
    await openalerts.init({"framework": "openmanus"})

    # Use your agents as normal — they're automatically monitored
    agent = Manus()
    await agent.run("Research quantum computing")

asyncio.run(main())

</details> <details> <summary>nanobot</summary>

import asyncio
import openalerts
from nanobot.agent.loop import AgentLoop
from nanobot.bus.queue import MessageBus
from nanobot.providers.litellm_provider import LiteLLMProvider

async def main():
    await openalerts.init({"framework": "nanobot"})

    provider = LiteLLMProvider(api_key="sk-...", default_model="gpt-4o-mini")
    agent = AgentLoop(
        bus=MessageBus(),
        provider=provider,
        workspace="./workspace",
    )
    response = await agent.process_direct("Research quantum computing")
    print(response)

asyncio.run(main())

The nanobot adapter also tracks subagent lifecycle — subagent.spawn, subagent.end, and subagent.error events are captured automatically when SubagentManager is used, with parent/child session correlation.

</details>

That's it. A real-time dashboard starts at http://localhost:9464/openalerts. OpenAlerts auto-instruments the configured framework so every event flows through the monitoring engine. Cleanup happens automatically on exit. All events are persisted to ~/.openalerts/ as JSONL.

Optionally, add channels (Slack, Discord, webhooks) to get alerts delivered when things go wrong.

Standalone Dashboard

By default, the dashboard runs in-process - when your agent exits, the dashboard dies too. For a persistent dashboard that survives agent restarts:

# Terminal 1 — start persistent dashboard (stays running)
openalerts serve

# Terminal 2 — run your agent (writes events, no dashboard of its own)
python my_agent.py

Disable the in-process dashboard when using standalone mode:

await openalerts.init({
    "dashboard": False,
    "channels": [...]
})

openalerts serve [--port 9464] [--state-dir ~/.openalerts] [--log-level INFO]

Also works via python -m openalerts serve.

Channels

# Slack
{"type": "slack", "webhook_url": "https://hooks.slack.com/services/..."}

# Discord
{"type": "discord", "webhook_url": "https://discord.com/api/webhooks/..."}

# Generic webhook
{"type": "webhook", "webhook_url": "https://your-server.com/alerts", "headers": {"Authorization": "Bearer ..."}}

Or via environment variables (no code changes needed):

OPENALERTS_SLACK_WEBHOOK_URL="https://hooks.slack.com/services/..."
OPENALERTS_DISCORD_WEBHOOK_URL="https://discord.com/api/webhooks/..."
OPENALERTS_WEBHOOK_URL="https://your-server.com/alerts"

Configuration

await openalerts.init({
    "channels": [...],
    "rules": {
        "llm-errors": {"threshold": 3},
        "high-error-rate": {"enabled": False},
        "tool-errors": {"cooldown_seconds": 1800},
    },
    "cooldown_seconds": 900,
    "max_alerts_per_hour": 5,
    "quiet": False,
    "dashboard": True,
    "dashboard_port": 9464,
    "state_dir": "~/.openalerts",
    "log_level": "INFO",
})

API

engine = await openalerts.init({...})   # async init
engine = openalerts.init_sync({...})    # sync init
await openalerts.send_test_alert()      # verify channels
engine = openalerts.get_engine()        # get engine instance
await openalerts.shutdown()             # optional — runs automatically on exit

</details> <details> <summary>Node - for <a href="https://github.com/openclaw/openclaw">OpenClaw</a></summary>

Install

npm install -g @steadwing/openalerts

Requires Node.js >= 22.5.0 (uses the built-in node:sqlite module — no native builds).

Usage

# 1. Create default config (auto-detects your OpenClaw gateway token)
openalerts init

# 2. Edit config to add your alert channel
#    ~/.openalerts/config.json

# 3. Start monitoring
openalerts start

Dashboard at http://127.0.0.1:4242 — the gateway overlay dismisses automatically once connected. No code changes to OpenClaw needed — runs as a separate process alongside it.

CLI

| Command | Description | |---|---| | openalerts init | Create default config at ~/.openalerts/config.json | | openalerts start | Start the monitoring daemon | | openalerts status | Print live engine state (daemon must be running) | | openalerts test | Fire a test alert through all configured channels | | openalerts mcp | Start the MCP server for AI assistant integration (stdio) |

Configuration

~/.openalerts/config.json (created by openalerts init):

{
  "gatewayUrl": "ws://127.0.0.1:18789",
  "gatewayToken": "<auto-detected from ~/.openclaw/openclaw.json>",
  "stateDir": "~/.openalerts",
  "server": { "port": 4242, "host": "127.0.0.1" },
  "channels": [
    { "type": "telegram", "token": "BOT_TOKEN", "chatId": "CHAT_ID" },
    { "type": "webhook", "webhookUrl": "https://your-endpoint" },
    { "type": "console" }
  ],
  "quiet": false
}

</details>

Dashboard

A real-time web dashboard starts automatically and shows everything happening inside your agents:

Activity - Step-by-step execution timeline with tool calls, LLM usage, costs
Health - Rule status, alert history, system stats
Debug - State snapshot for troubleshooting

Python: http://localhost:9464/openalerts | Node: http://127.0.0.1:4242

Alert Rules

All rules run against every event in real-time. Thresholds and cooldowns are configurable.

| Rule | Watches for | Severity | Default threshold | |---|---|---|---| | llm-errors | LLM/agent failures in 1-min window | ERROR | 1 error | | tool-errors | Tool execution failures in 1-min window | WARN | 1 error | | high-error-rate | Failure rate over last 20 calls | ERROR | 50% | | agent-stuck / session-stuck | Agent idle too long | WARN | 120000 ms | | token-limit | Token limit exceeded | ERROR | - | | step-limit-warning | Agent reaches 80% of max_steps | WARN | - | | subagent-errors | Subagent failures in 1-min window (Python) | WARN | 1 error | | infra-errors | Infrastructure errors (Node) | ERROR | 1 error | | gateway-down | No heartbeat received (Node) | CRITICAL | 30000 ms | | queue-depth | Queued items piling up (Node) | WARN | 10 items | | heartbeat-fail | Consecutive heartbeat failures (Node) | ERROR | 3 failures |

Every rule also accepts:

enabled - false to disable (d

Openalerts

Install / Use

README

Dashboard

Quickstart

Install

Standalone Dashboard

Channels

Configuration

API

Install

Usage

CLI

Configuration

Dashboard

Alert Rules