Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org
Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

An empirical survey on long document summarization: Datasets, models, and metrics

HY Koh, J Ju, M Liu, S Pan - ACM computing surveys, 2022 - dl.acm.org
Long documents such as academic articles and business reports have been the standard
format to detail out important issues and complicated subjects that require extra attention. An …

Unlimiformer: Long-range transformers with unlimited length input

A Bertsch, U Alon, G Neubig… - Advances in Neural …, 2024 - proceedings.neurips.cc
Since the proposal of transformers, these models have been limited to bounded input
lengths, because of their need to attend to every token in the input. In this work, we propose …

Yarn: Efficient context window extension of large language models

B Peng, J Quesnelle, H Fan, E Shippole - arXiv preprint arXiv:2309.00071, 2023 - arxiv.org
Rotary Position Embeddings (RoPE) have been shown to effectively encode positional
information in transformer-based language models. However, these models fail to …

∞ Bench: Extending long context evaluation beyond 100k tokens

X Zhang, Y Chen, S Hu, Z Xu, J Chen… - Proceedings of the …, 2024 - aclanthology.org
Processing and reasoning over long contexts is crucial for many practical applications of
Large Language Models (LLMs), such as document comprehension and agent construction …

Dialoglm: Pre-trained model for long dialogue understanding and summarization

M Zhong, Y Liu, Y Xu, C Zhu, M Zeng - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Dialogue is an essential part of human communication and cooperation. Existing research
mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person …

Deep reinforcement and transfer learning for abstractive text summarization: A review

A Alomari, N Idris, AQM Sabri, I Alsmadi - Computer Speech & Language, 2022 - Elsevier
Abstract Automatic Text Summarization (ATS) is an important area in Natural Language
Processing (NLP) with the goal of shortening a long text into a more compact version by …

LongEval: Guidelines for human evaluation of faithfulness in long-form summarization

K Krishna, E Bransom, B Kuehl, M Iyyer… - arXiv preprint arXiv …, 2023 - arxiv.org
While human evaluation remains best practice for accurately judging the faithfulness of
automatically-generated summaries, few solutions exist to address the increased difficulty …

A survey on long text modeling with transformers

Z Dong, T Tang, L Li, WX Zhao - arXiv preprint arXiv:2302.14502, 2023 - arxiv.org
Modeling long texts has been an essential technique in the field of natural language
processing (NLP). With the ever-growing number of long documents, it is important to …

Summ^ n: A multi-stage summarization framework for long input dialogues and documents

Y Zhang, A Ni, Z Mao, CH Wu, C Zhu, B Deb… - arXiv preprint arXiv …, 2021 - arxiv.org
Text summarization helps readers capture salient information from documents, news,
interviews, and meetings. However, most state-of-the-art pretrained language models (LM) …