Efficient attentions for long document summarization

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org

Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

被引用次数：394 相关文章所有 7 个版本

[PDF] arxiv.org

An empirical survey on long document summarization: Datasets, models, and metrics

HY Koh, J Ju, M Liu, S Pan - ACM computing surveys, 2022 - dl.acm.org

Long documents such as academic articles and business reports have been the standard
format to detail out important issues and complicated subjects that require extra attention. An …

被引用次数：121 相关文章所有 7 个版本

[PDF] neurips.cc

Unlimiformer: Long-range transformers with unlimited length input

A Bertsch, U Alon, G Neubig… - Advances in Neural …, 2024 - proceedings.neurips.cc

Since the proposal of transformers, these models have been limited to bounded input
lengths, because of their need to attend to every token in the input. In this work, we propose …

被引用次数：119 相关文章所有 7 个版本

[PDF] arxiv.org

Yarn: Efficient context window extension of large language models

B Peng, J Quesnelle, H Fan, E Shippole - arXiv preprint arXiv:2309.00071, 2023 - arxiv.org

Rotary Position Embeddings (RoPE) have been shown to effectively encode positional
information in transformer-based language models. However, these models fail to …

被引用次数：248 相关文章所有 5 个版本

[PDF] aclanthology.org

∞ Bench: Extending long context evaluation beyond 100k tokens

X Zhang, Y Chen, S Hu, Z Xu, J Chen… - Proceedings of the …, 2024 - aclanthology.org

Processing and reasoning over long contexts is crucial for many practical applications of
Large Language Models (LLMs), such as document comprehension and agent construction …

被引用次数：29 相关文章

[PDF] aaai.org

Dialoglm: Pre-trained model for long dialogue understanding and summarization

M Zhong, Y Liu, Y Xu, C Zhu, M Zeng - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Dialogue is an essential part of human communication and cooperation. Existing research
mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person …

被引用次数：134 相关文章所有 6 个版本

Deep reinforcement and transfer learning for abstractive text summarization: A review

A Alomari, N Idris, AQM Sabri, I Alsmadi - Computer Speech & Language, 2022 - Elsevier

Abstract Automatic Text Summarization (ATS) is an important area in Natural Language
Processing (NLP) with the goal of shortening a long text into a more compact version by …

被引用次数：139 相关文章所有 4 个版本

[PDF] arxiv.org

LongEval: Guidelines for human evaluation of faithfulness in long-form summarization

K Krishna, E Bransom, B Kuehl, M Iyyer… - arXiv preprint arXiv …, 2023 - arxiv.org

While human evaluation remains best practice for accurately judging the faithfulness of
automatically-generated summaries, few solutions exist to address the increased difficulty …

被引用次数：69 相关文章所有 8 个版本

[PDF] arxiv.org

A survey on long text modeling with transformers

Z Dong, T Tang, L Li, WX Zhao - arXiv preprint arXiv:2302.14502, 2023 - arxiv.org

Modeling long texts has been an essential technique in the field of natural language
processing (NLP). With the ever-growing number of long documents, it is important to …

被引用次数：54 相关文章所有 2 个版本

[PDF] arxiv.org

Summ^ n: A multi-stage summarization framework for long input dialogues and documents

Y Zhang, A Ni, Z Mao, CH Wu, C Zhu, B Deb… - arXiv preprint arXiv …, 2021 - arxiv.org

Text summarization helps readers capture salient information from documents, news,
interviews, and meetings. However, most state-of-the-art pretrained language models (LM) …

被引用次数：101 相关文章所有 7 个版本

高级搜索

QQ 群