Wikisum: Coherent summarization dataset for efficient human-evaluation

N Cohen, O Kalinsky, Y Ziser… - Proceedings of the 59th …, 2021 - aclanthology.org
Proceedings of the 59th Annual Meeting of the Association for …, 2021aclanthology.org
Recent works made significant advances on summarization tasks, facilitated by
summarization datasets. Several existing datasets have the form of coherent-paragraph
summaries. However, these datasets were curated from academic documents that were
written for experts, thus making the essential step of assessing the summarization output
through human-evaluation very demanding. To overcome these limitations, we present a
dataset based on article summaries appearing on the WikiHow website, composed of how …
Abstract
Recent works made significant advances on summarization tasks, facilitated by summarization datasets. Several existing datasets have the form of coherent-paragraph summaries. However, these datasets were curated from academic documents that were written for experts, thus making the essential step of assessing the summarization output through human-evaluation very demanding. To overcome these limitations, we present a dataset based on article summaries appearing on the WikiHow website, composed of how-to articles and coherent-paragraph summaries written in plain language. We compare our dataset attributes to existing ones, including readability and world-knowledge, showing our dataset makes human evaluation significantly easier and thus, more effective. A human evaluation conducted on PubMed and the proposed dataset reinforces our findings.
aclanthology.org
以上显示的是最相近的搜索结果。 查看全部搜索结果