The dimensions of data labor: A road map for researchers, activists, and policymakers to empower data producers

H Li, N Vincent, S Chancellor, B Hecht - … of the 2023 ACM conference on …, 2023 - dl.acm.org
Many recent technological advances (eg ChatGPT and search engines) are possible only
because of massive amounts of user-generated data produced through user interactions …

Improving wikipedia verifiability with ai

F Petroni, S Broscheit, A Piktus, P Lewis… - Nature Machine …, 2023 - nature.com
Verifiability is a core content policy of Wikipedia: claims need to be backed by citations.
Maintaining and improving the quality of Wikipedia references is an important challenge and …

COVID-19 research in Wikipedia

G Colavizza - Quantitative science studies, 2020 - direct.mit.edu
Wikipedia is one of the main sources of free knowledge on the Web. During the first few
months of the pandemic, over 5,200 new Wikipedia pages on COVID-19 were created …

Users meet clarifying questions: Toward a better understanding of user interactions for search clarification

J Zou, M Aliannejadi, E Kanoulas, MS Pera… - ACM Transactions on …, 2023 - dl.acm.org
The use of clarifying questions (CQs) is a fairly new and useful technique to aid systems in
recognizing the intent, context, and preferences behind user queries. Yet, understanding the …

Wikipedia citations: A comprehensive data set of citations with identifiers extracted from English Wikipedia

H Singh, R West, G Colavizza - Quantitative Science Studies, 2021 - direct.mit.edu
Wikipedia's content is based on reliable and published sources. To this date, relatively little
is known about what sources Wikipedia relies on, in part because extracting citations and …

An analysis of content gaps versus user needs in the wikidata knowledge graph

D Abián, A Meroño-Peñuela, E Simperl - International Semantic Web …, 2022 - Springer
Content gaps in knowledge graphs impact downstream applications. Semantic Web
researchers have studied them mainly in relation to data quality or ontology evaluation, for …

Identifying and characterizing social media communities: a socio-semantic network approach to altmetrics

W Arroyo-Machado, D Torres-Salinas… - Scientometrics, 2021 - Springer
Altmetric indicators allow exploring and profiling individuals who discuss and share scientific
literature in social media. But it is still a challenge to identify and characterize communities …

A large-scale characterization of how readers browse Wikipedia

T Piccardi, M Gerlach, A Arora, R West - ACM Transactions on the Web, 2023 - dl.acm.org
Despite the importance and pervasiveness of Wikipedia as one of the largest platforms for
open knowledge, surprisingly little is known about how people navigate its content when …

On the Value of Wikipedia as a Gateway to the Web

T Piccardi, M Redi, G Colavizza, R West - Proceedings of the Web …, 2021 - dl.acm.org
By linking to external websites, Wikipedia can act as a gateway to the Web. To date,
however, little is known about the amount of traffic generated by Wikipedia's external links …

Modeling popularity and reliability of sources in multilingual Wikipedia

W Lewoniewski, K Węcel, W Abramowicz - Information, 2020 - mdpi.com
One of the most important factors impacting quality of content in Wikipedia is presence of
reliable sources. By following references, readers can verify facts or find more details about …