SkillAgentSearch skills...

Geval

Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"

Install / Use

/learn @nlpyang/Geval
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment" [https://arxiv.org/abs/2303.16634]

Experiments on SummEval dataset

Evaluate fluency on SummEval dataset

python .\gpt4_eval.py --prompt .\prompts\summeval\flu_detailed.txt --save_fp .\results\gpt4_flu_detailed.json --summeval_fp .\data\summeval.json --key XXXXX

Meta Evaluate the G-Eval results

python .\meta_eval_summeval.py --input_fp .\results\gpt4_flu_detailed.json --dimension fluency

Prompts and Evaluation Results

Prompts used to evaluate SummEval are in prompts/summeval

G-eval results on SummEval are in results

Related Skills

View on GitHub
GitHub Stars415
CategoryDevelopment
Updated37m ago
Forks45

Languages

Python

Security Score

95/100

Audited on Apr 6, 2026

No findings