7 skills found
open-compass / OpencompassOpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
MigoXLab / DingoDingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
SmartFlowAI / Llama3 TutorialLlama3-Tutorial(XTuner、LMDeploy、OpenCompass)
open-compass / CompassJudgerThe All-in-one Judge Models introduced by Opencompass
AISBench / BenchmarkAISBench Benchmark is a model evaluation tool built on OpenCompass, compatible with OpenCompass’s configuration system, dataset structure, and model backend implementation, while extending support for service-based models.
domonic18 / AI Eval System这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。
little1d / Hands On OpenCompassNo description available