Editeval: An instruction-based benchmark for text improvements

J Dwivedi-Yu, T Schick, Z Jiang, M Lomeli… - arXiv preprint arXiv …, 2022 - arxiv.org
Evaluation of text generation to date has primarily focused on content created sequentially,
rather than improvements on a piece of text. Writing, however, is naturally an iterative and …

INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables

S Khincha, C Jain, V Gupta, T Kataria… - arXiv preprint arXiv …, 2023 - arxiv.org
Information Synchronization of semi-structured data across languages is challenging. For
instance, Wikipedia tables in one language should be synchronized across languages. To …

From nuisance to news sense: Augmenting the news with cross-document evidence and context

J Milbauer, Z Ding, Z Wu, T Wu - arXiv preprint arXiv:2310.04592, 2023 - arxiv.org
Reading and understanding the stories in the news is increasingly difficult. Reporting on
stories evolves rapidly, politicized news venues offer different perspectives (and sometimes …

Identifying Informational Sources in News Articles

A Spangher, N Peng, J May, E Ferrara - arXiv preprint arXiv:2305.14904, 2023 - arxiv.org
News articles are driven by the informational sources journalists use in reporting. Modeling
when, how and why sources get used together in stories can help us better understand the …

NewsSense: Reference-free Verification via Cross-document Comparison

J Milbauer, Z Ding, Z Wu, T Wu - Proceedings of the 2023 …, 2023 - aclanthology.org
We present NewsSense, a novel sensemaking tool and reading interface designed to collect
and integrate information from multiple news articles on a central topic. NewsSense …

EDIS: Entity-Driven Image Search over Multimodal Web Content

S Liu, W Feng, T Fu, W Chen, WY Wang - arXiv preprint arXiv:2305.13631, 2023 - arxiv.org
Making image retrieval methods practical for real-world search applications requires
significant progress in dataset scales, entity comprehension, and multimodal information …

A System to Support Readers in Automatically Acquiring Complete Summarized Information on an Event from Different Sources

P Dell'Oglio, A Bondielli, F Marcelloni - Algorithms, 2023 - mdpi.com
Today, most newspapers utilize social media to disseminate news. On the one hand, this
results in an overload of related articles for social media users. On the other hand, since …

Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision

Q Ruan, I Kuznetsov, I Gurevych - arXiv preprint arXiv:2406.00197, 2024 - arxiv.org
Collaborative review and revision of textual documents is the core of knowledge work and a
promising target for empirical analysis and NLP assistance. Yet, a holistic framework that …

Tracking the Newsworthiness of Public Documents

A Spangher, E Ferrara, B Welsh, N Peng… - arXiv preprint arXiv …, 2023 - arxiv.org
Journalists must find stories in huge amounts of textual data (eg leaks, bills, press releases)
as part of their jobs: determining when and why text becomes news can help us understand …

XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates

H Zhang, H Iso, S Gurajada, N Bhutani - arXiv preprint arXiv:2309.11063, 2023 - arxiv.org
Text editing is a crucial task that involves modifying text to better align with user intents.
However, existing text editing benchmark datasets have limitations in providing only coarse …