Specter: Document-level representation learning using citation-informed transformers

A Cohan, S Feldman, I Beltagy, D Downey… - arXiv preprint arXiv …, 2020 - arxiv.org
Representation learning is a critical ingredient for natural language processing systems.
Recent Transformer language models like BERT learn powerful textual representations, but …

Club: A contrastive log-ratio upper bound of mutual information

P Cheng, W Hao, S Dai, J Liu… - … on machine learning, 2020 - proceedings.mlr.press
Mutual information (MI) minimization has gained considerable interests in various machine
learning tasks. However, estimating and minimizing MI in high-dimensional spaces remains …

Beyond the overlapping users: Cross-domain recommendation via adaptive anchor link learning

Y Zhao, C Li, J Peng, X Fang, F Huang… - Proceedings of the 46th …, 2023 - dl.acm.org
Cross-Domain Recommendation (CDR) is capable of incorporating auxiliary information
from multiple domains to advance recommendation performance. Conventional CDR …

A trainable optimal transport embedding for feature aggregation and its relationship to attention

G Mialon, D Chen, A d'Aspremont, J Mairal - arXiv preprint arXiv …, 2020 - arxiv.org
We address the problem of learning on sets of features, motivated by the need of performing
pooling operations in long biological sequences of varying sizes, with long-range …

TOT: topology-aware optimal transport for multimodal hate detection

L Zhang, L Jin, X Sun, G Xu, Z Zhang, X Li… - Proceedings of the …, 2023 - ojs.aaai.org
Multimodal hate detection, which aims to identify the harmful content online such as memes,
is crucial for building a wholesome internet environment. Previous work has made …

Pass: Personalized advertiser-aware sponsored search

Z Tian, C Li, Z Zuo, Z Wen, L Sun, X Hu… - Proceedings of the 29th …, 2023 - dl.acm.org
The nucleus of online sponsored search systems lies in measuring the relevance between
the search intents of users and the advertising purposes of advertisers. Existing …

Textomics: A dataset for genomics data summary generation

MC Wang, Z Liu, S Wang - … of the 60th Annual Meeting of the …, 2022 - aclanthology.org
Summarizing biomedical discovery from genomics data using natural languages is an
essential step in biomedical research but is mostly done manually. Here, we introduce …

Event Causality Extraction via Implicit Cause-Effect Interactions

J Liu, Z Zhang, K Wei, Z Guo, X Sun… - Proceedings of the …, 2023 - aclanthology.org
Abstract Event Causality Extraction (ECE) aims to extract the cause-effect event pairs from
the given text, which requires the model to possess a strong reasoning ability to capture …

Generating temporally-ordered event sequences via event optimal transport

B Zhou, Y Chen, K Liu, J Zhao, J Xu… - Proceedings of the …, 2022 - aclanthology.org
Generating temporally-ordered event sequences in texts is important to natural language
processing. Two emerging tasks in this direction are temporal event ordering (rearranging …

Topic Modeling on Document Networks with Dirichlet Optimal Transport Barycenter

DC Zhang, HW Lauw - IEEE Transactions on Knowledge and …, 2023 - ieeexplore.ieee.org
Text documents are often interconnected in a network structure, eg, academic papers via
citations, Web pages via hyperlinks. On the one hand, though Graph Neural Networks …