P Yu, H Fei, P Li - Proceedings of the Web Conference 2021, 2021 - dl.acm.org
Existing research on cross-lingual retrieval cannot take good advantage of large-scale pretrained language models such as multilingual BERT and XLM. We hypothesize that the …
Wikipedia is one of the most visited sites on the Web and a common source of information for many users. As an encyclopedia, Wikipedia was not conceived as a source of original …
Social biases on Wikipedia, a widely-read global platform, could greatly influence public opinion. While prior research has examined man/woman gender bias in biography articles …
Despite the importance and pervasiveness of Wikipedia as one of the largest platforms for open knowledge, surprisingly little is known about how people navigate its content when …
A major challenge for many analyses of Wikipedia dynamics—eg, imbalances in content quality, geographic differences in what content is popular, what types of articles attract more …
Learning semantic representations of documents is essential for various downstream applications, including text classification and information retrieval. Entities, as important …
Wikipedia is edited by volunteer editors around the world. Considering the large amount of existing content (eg over 5M articles in English Wikipedia), deciding what to edit next can be …
There has recently been much interest in extending vector-based word representations to multiple languages, such that words can be compared across languages. In this paper, we …
This Open-Access-book addresses the issue of translating mathematical expressions from LaTeX to the syntax of Computer Algebra Systems (CAS). Over the past decades, especially …