SkillAgentSearch skills...

Mmeval

A unified evaluation library for multiple machine learning libraries

Install / Use

/learn @open-mmlab/Mmeval
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

<div align="center"> <img src="https://github.com/open-mmlab/mmeval/raw/main/docs/zh_cn/_static/image/mmeval-logo.png" height="100"/> <div>&nbsp;</div> <div align="center"> <b><font size="5">OpenMMLab website</font></b> <sup> <a href="https://openmmlab.com"> <i><font size="4">HOT</font></i> </a> </sup> &nbsp;&nbsp;&nbsp;&nbsp; <b><font size="5">OpenMMLab platform</font></b> <sup> <a href="https://platform.openmmlab.com"> <i><font size="4">TRY IT OUT</font></i> </a> </sup> </div> <div>&nbsp;</div>

PyPI - Python Version PyPI license open issues issue resolution

🤔Reporting Issues

</div> <div align="center">

English | 简体中文

</div>

Introduction

MMEval is a machine learning evaluation library that supports efficient and accurate distributed evaluation on a variety of machine learning frameworks.

Major features:

  • Comprehensive metrics for various computer vision tasks (NLP will be covered soon!)
  • Efficient and accurate distributed evaluation, backed by multiple distributed communication backends
  • Support multiple machine learning frameworks via dynamic input dispatching mechanism
<div align="center"> <img src="docs/zh_cn/_static/image/mmeval-arch.png" width="600"/> </div> <details> <summary> Supported distributed communication backends </summary>

| MPI4Py | torch.distributed | Horovod | paddle.distributed | oneflow.comm | | :------------------------------------------------------------------------------------------------------------------------------------------------: | :-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------------: | :------------------------------------------------------------------------------------------------------------------------------------------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------: | | MPI4PyDist | TorchCPUDist <br> TorchCUDADist | TFHorovodDist | PaddleDist | OneFlowDist |

</details> <details> <summary> Supported metrics and ML frameworks </summary>

NOTE: MMEval tested with PyTorch 1.6+, TensorFlow 2.4+, Paddle 2.2+ and OneFlow 0.8+.

| Metric | numpy.ndarray | torch.Tensor | tensorflow.Tensor | paddle.Tensor | oneflow.Tensor | | :--------------------------------------------------------------------------------------------------------------------------------------------------------: | :-----------: | :----------: | :---------------: | :-----------: | :------------: | | Accuracy | ✔ | ✔ | ✔ | ✔ | ✔ | | SingleLabelMetric | ✔ | ✔ | | | ✔ | | MultiLabelMetric | ✔ | ✔ | | | ✔ | | AveragePrecision | ✔ | ✔ | | | ✔ | | MeanIoU | ✔ | ✔ | ✔ | ✔ | ✔ | | VOCMeanAP | ✔ | | | | | | OIDMeanAP | ✔ | | | | | | COCODetection | ✔ | | | | | | ProposalRecall | ✔ | | | | | | F1Score | ✔ | ✔ | | | ✔ | | HmeanIoU | ✔ | | | | | | PCKAccuracy | ✔ | | | | | | MpiiPCKAccuracy | ✔ | | | | | | JhmdbPCKAccuracy | ✔ | | | | | | EndPointError | ✔ | ✔ | | | ✔ | | AVAMeanAP | ✔ | | | | | | StructuralSimilarity | ✔ | | | | | | SignalNoiseRatio | ✔ | | | | | | PeakSignalNoiseRatio | ✔ | | | | |

Related Skills

View on GitHub
GitHub Stars269
CategoryEducation
Updated3mo ago
Forks50

Languages

Python

Security Score

97/100

Audited on Dec 26, 2025

No findings