[PDF][PDF] Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

N Reimers - arXiv preprint arXiv:1908.10084, 2019 - fq.pkwyx.com
BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art
performance on sentence-pair regression tasks like semantic textual similarity (STS) …

An autonomous debating system

N Slonim, Y Bilu, C Alzate, R Bar-Haim, B Bogin… - Nature, 2021 - nature.com
Artificial intelligence (AI) is defined as the ability of machines to perform tasks that are
usually associated with intelligent beings. Argument and debate are fundamental …

Condenser: a pre-training architecture for dense retrieval

L Gao, J Callan - arXiv preprint arXiv:2104.08253, 2021 - arxiv.org
Pre-trained Transformer language models (LM) have become go-to text representation
encoders. Prior research fine-tunes deep LMs to encode text sequences such as sentences …

Scalable hierarchical agglomerative clustering

N Monath, KA Dubey, G Guruganesh… - Proceedings of the 27th …, 2021 - dl.acm.org
The applicability of agglomerative clustering, for inferring both hierarchical and flat
clustering, is limited by its scalability. Existing scalable hierarchical clustering methods …

Sentence similarity based on contexts

X Sun, Y Meng, X Ao, F Wu, T Zhang, J Li… - Transactions of the …, 2022 - direct.mit.edu
Existing methods to measure sentence similarity are faced with two challenges:(1) labeled
datasets are usually limited in size, making them insufficient to train supervised neural …

Structural text segmentation of legal documents

D Aumiller, S Almasian, S Lackner… - Proceedings of the …, 2021 - dl.acm.org
The growing complexity of legal cases has lead to an increasing interest in legal information
retrieval systems that can effectively satisfy user-specific information needs. However, such …

Hierarchical multi-label text classification with horizontal and vertical category correlations

L Xu, S Teng, R Zhao, J Guo, C Xiao… - Proceedings of the …, 2021 - aclanthology.org
Hierarchical multi-label text classification (HMTC) deals with the challenging task where an
instance can be assigned to multiple hierarchically structured categories at the same time …

Exploring topic modelling for generalising design requirements in complex design

C Chen, B Morkos - Journal of Engineering Design, 2023 - Taylor & Francis
As the redesign process progresses in product lifecycle management, effectively managing
engineering changes becomes increasingly challenging, often leading to catastrophic and …

[PDF][PDF] Is your language model ready for dense representation fine-tuning

L Gao, J Callan - arXiv preprint arXiv:2104.08253, 2021 - boston.lti.cs.cmu.edu
Pre-trained language models (LM) have become go-to text representation encoders. Prior
research used deep LMs to encode text sequences such as sentences and passages into …

Semantic-Aware Contrastive Sentence Representation Learning with Large Language Models

H Wang, L Cheng, Z Li, DW Soh, L Bing - arXiv preprint arXiv:2310.10962, 2023 - arxiv.org
Contrastive learning has been proven to be effective in learning better sentence
representations. However, to train a contrastive learning model, large numbers of labeled …