SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
Install / Use
/learn @Alsace08/SumCoTREADME
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method (ACL'23 Long Paper)
<font color=green size=4><i>Let's elicit LLMs summarize step by step following the professional communication theory!</i></font>
In this work, you can use directly or get inspired by:
-
A fine-grained generic summary data annotation protocol (combining micro and macro demands)
-
An expert-aligned generic summary test set (rewrite CNN/DailyMail and BBC XSum)
-
An expandable CoT-based open-end generation path (not only SumCoT)
Element-aware Dataset
Annotation Statement
Our annotation protocol is mainly based on Lasswell Communication Model --- a famous communication theory proposed by Lasswell(1948). Additionally, we removed as much noise as possible from the original data set and performed data analysis (See paper for more details).
Case comparisons for our Element-aware summary and original dataset-specific summary:
<div align=center> <img src="./assets/Dataset.png" width="60%" align="center"></img> </div>Data Usage
-
See our constructed datasets in
/data. The twojsonfiles hold the data of CNN/DailyMail and BBC/XSum, respectively. -
Each
jsonfile contains 200 samples, each containing the source document, original summary, element-aware summary, GPT-3 summary (standard), and GPT-3 summary (under SumCoT). You can use these data for more fine-grained evaluation.
Dataset format:
{
"dataset_name": [
{
"id": 0,
"src": "xxx",
"original_summary": "xxx",
"element-aware_summary": "xxx",
"gpt3_summary": "xxx",
"gpt3_cot_summary": "xxx"
},
{
"id": 1,
"src": "xxx",
"original_summary": "xxx",
"element-aware_summary": "xxx",
"gpt3_summary": "xxx",
"gpt3_cot_summary": "xxx"
},
...
{
"id": 199,
"src": "xxx",
"original_summary": "xxx",
"element-aware_summary": "xxx",
"gpt3_summary": "xxx",
"gpt3_cot_summary": "xxx"
}
]
}
Summary Chain-of-Thought (SumCoT)
Pipeline
<div align=center> <img src="./assets/Method.png" width="80%"> </div>Code Usage
generation
If you want to generate summaries (w/ or w/o SumCoT) by GPT-3, run generation.py:
python generation.py \
--dataset cnndm
--start_id 0
--end_id 199
where cot_true indicates if you want to use the SumCoT technique for generation, start_id and end_id determine the range of source documents in the dataset that you want to request. If you only want to generate the summary of the $a$-th sample, set start_id=end_id=a.
To facilitate the subsequent evaluation, the summaries just generated will be stored into $dataset_name$_output.json.
Note: Don't forget to modify your openai_key in the Decoder().
evaluation
We also provide code in /evaluation/eva.py to reproduce the results:
python eva.py \
--dataset cnndm \
--start_id 0 \
--end_id 199 \
--bs_true false
where bs_true indicates if you want to use the bert score, start_id and end_id are the same as above.
You can also build your own dataset json file, adjust start_id and end_id to perform your own evaluation.
Reproducibility
Experimental results in this paper are obtained during October-December 2022 via the OpenAI API interface (basically before the ChatGPT release). If your reproduction results are slightly different from the results of the paper, you can report any of them (just indicate).
Citation
@inproceedings{wang-etal-2023-element,
title = "Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method",
author = "Wang, Yiming and
Zhang, Zhuosheng and
Wang, Rui",
booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
month = jul,
year = "2023",
address = "Toronto, Canada",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2023.acl-long.482",
doi = "10.18653/v1/2023.acl-long.482",
pages = "8640--8665",
}
Related Skills
node-connect
354.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
112.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
354.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
354.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
