Aibench
AIBench, a tool for comparing and evaluating AI serving solutions. forked from [tsbs](https://github.com/timescale/tsbs) and adapted to AI serving use case
Install / Use
/learn @RedisAI/AibenchREADME
aibench
This repo contains code for benchmarking deep learning solutions, including RedisAI. This code is based on a fork of work initially made public by TSBS at https://github.com/timescale/tsbs.
Current DL solutions supported:
- RedisAI: an AI serving engine for real-time applications built by Redis Labs and Tensorwerk, seamlessly plugged into Redis.
- Nvidia Triton Inference Server: An open source inference serving software that lets teams deploy trained AI models from any framework (TensorFlow, TensorRT, PyTorch, ONNX Runtime, or a custom framework), from local storage or Google Cloud Platform or AWS S3 on any GPU- or CPU-based infrastructure.
- TorchServe: built and maintained by Amazon Web Services (AWS) in collaboration with Facebook, TorchServe is available as part of the PyTorch open-source project.
- Tensorflow Serving: a high-performance serving system, wrapping TensorFlow and maintained by Google.
- Common REST API serving: a common DL production grade setup with Gunicorn (a Python WSGI HTTP server) communicating with Flask through a WSGI protocol, and using TensorFlow as the backend.
Current use cases
Currently, aibench supports two use cases:
-
creditcard-fraud [details here]: from Kaggle with the extension of reference data. This use-case aims to detect a fraudulent transaction based on anonymized credit card transactions and reference data.
-
vision-image-classification[details here]: an image-focused use-case that uses one network “backbone”: MobileNet V1, which can be considered as one of the standards by the AI community. To assess inference performance we’re recurring to COCO 2017 validation dataset (a large-scale object detection, segmentation, and captioning dataset).
Current DL solutions supported per use case:
| Use case/Inference Server | model | RedisAI | TensorFlow Serving | Torch Serve | Nvidia Triton | Rest API | |--------------------------------|----------|----------|--------------------|-------------|---------------|----------| | Vision Benchmark (CPU/GPU) (details) | mobilenet-v1 (224_224)| :heavy_check_mark: | Not supported | Not supported | :heavy_check_mark: | Not supported | | Fraud Benchmark (CPU) (details) | Non standard Kaggle Model with the extension of reference data | :heavy_check_mark: docs | :heavy_check_mark: docs | :heavy_check_mark: docs | Not supported | :heavy_check_mark: docs |
Installation
The easiest way to get and install the go benchmark programs is to use go get and then issuing make:
# Fetch aibench and its dependencies
go get github.com/RedisAI/aibench
cd $GOPATH/src/github.com/RedisAI/aibench
make
Blogs/White-papers that reference this tool
- [1] Announcing RedisAI 1.0: AI Serving Engine for Real-Time Applications, May 19, 2020
Related Skills
node-connect
352.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
