10 skills found
uptrain-ai / UptrainUpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
celiavelmar / Open Covid19 TestAn open source COVID-19 autoevaluation test that gives you the same results as coronamadrid.com but does not store your valuable data.
PremierLangage / PremierlangageServer for auto-evaluating exercices
amazon-science / BeyondCorrelationImplementation of the paper: Beyond Correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge
vivacious1024 / BIT AutoEvaluation百丽宫(北理工)自动评教程序,用kotlin写的,附上转换好的exe安装包
Wind-Gone / ECNU AutoEvaluationECNU自动评教脚本
jchiquet / Quarto HceresModèle quarto pour rapport d'autoévaluation HCERES
AdenCJM / AutoEvaluationAn autonomous optimisation engine that makes any set of LLM instructions measurably better without a human in the loop.
chenkangyang / AutoEvaluationJS用于正方系统上的教师自动打分JS脚本
vicgalle / Autocrit Likert GptAutomatic and zero-shot critique of outputs using the OpenAI API with json outputs