Numalogic
Collection of operational time series ML models and tools
Install / Use
/learn @numaproj/NumalogicREADME
numalogic
Background
Numalogic is a collection of ML models and algorithms for operation data analytics and AIOps. At Intuit, we use Numalogic at scale for continuous real-time data enrichment including anomaly scoring. We assign an anomaly score (ML inference) to any time-series datum/event/message we receive on our streaming platform (say, Kafka). 95% of our data sets are time-series, and we have a complex flowchart to execute ML inference on our high throughput sources. We run multiple models on the same datum, say a model that is sensitive towards +ve sentiments, another more tuned towards -ve sentiments, and another optimized for neutral sentiments. We also have a couple of ML models trained for the same data source to provide more accurate scores based on the data density in our model store. An ensemble of models is required because some composite keys in the data tend to be less dense than others, e.g., forgot-password interaction is less frequent than a status check interaction. At runtime, for each datum that arrives, models are picked based on a conditional forwarding filter set on the data density. ML engineers need to worry about only their inference container; they do not have to worry about data movement and quality assurance.
Numalogic realtime training
For an always-on ML platform, the key requirement is the ability to train or retrain models automatically based on the incoming messages. The composite key built at per message runtime looks for a matching model, and if the model turns out to be stale or missing, an automatic retriggering is applied. The conditional forwarding feature of the platform improves the development velocity of the ML developer when they have to make a decision whether to forward the result further or drop it after a trigger request.
Key Features
- Ease of use: simple and efficient tools for predictive data analytics
- Reusability: all the functionalities can be re-used in various contexts
- Model selection: easy to compare, validate, fine-tune and choose the model that works best with each data set
- Data processing: readily available feature extraction, scaling, transforming and normalization tools
- Extensibility: adding your own functions or extending over the existing capabilities
- Model Storage: out-of-the-box support for MLFlow and support for other model ML lifecycle management tools
Use Cases
- Deployment failure detection
- System failure detection for node failures or crashes
- Fraud detection
- Network intrusion detection
- Forecasting on time series data
Getting Started
For set-up information and running your first pipeline using numalogic, please see our getting started guide.
Installation
Numalogic requires Python 3.8 or higher.
Prerequisites
Numalogic needs PyTorch and PyTorch Lightning to work. But since these packages are platform dependendent, they are not included in the numalogic package itself. Kindly install them first.
Numalogic supports pytorch versions 2.0.0 and above.
numalogic can be installed using pip.
pip install numalogic
If using mlflow for model registry, install using:
pip install numalogic[mlflow]
Build locally
- Install Poetry:
curl -sSL https://install.python-poetry.org | python3 - - To activate virtual env:
poetry shell - To install dependencies:
If extra dependencies are needed:poetry install --with dev,torchpoetry install --all-extras - To run unit tests:
make test - To format code style using black and ruff:
make lint - Setup pre-commit hooks:
pre-commit install
Contributing
We would love contributions in the numalogic project in one of the following (but not limited to) areas:
- Adding new time series anomaly detection models
- Making it easier to add user's custom models
- Support for additional model registry frameworks
For contribution guildelines please refer here.
Resources
Related Skills
claude-opus-4-5-migration
81.5kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
model-usage
331.2kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
mcp-for-beginners
15.5kThis open-source curriculum introduces the fundamentals of Model Context Protocol (MCP) through real-world, cross-language examples in .NET, Java, TypeScript, JavaScript, Rust and Python. Designed for developers, it focuses on practical techniques for building modular, scalable, and secure AI workflows from session setup to service orchestration.
TrendRadar
49.6k⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
