Interpreting pretrained contextualized representations via reductions to static embeddings

R Bommasani, K Davis, C Cardie - … of the 58th Annual Meeting of …, 2020 - aclanthology.org
Contextualized representations (eg ELMo, BERT) have become the default pretrained
representations for downstream NLP applications. In some settings, this transition has …

N2d:(not too) deep clustering via clustering the local manifold of an autoencoded embedding

R McConville, R Santos-Rodriguez… - 2020 25th …, 2021 - ieeexplore.ieee.org
Deep clustering has increasingly been demonstrating superiority over conventional shallow
clustering algorithms. Deep clustering algorithms usually combine representation learning …

Learning to remove: Towards isotropic pre-trained bert embedding

Y Liang, R Cao, J Zheng, J Ren, L Gao - … 14–17, 2021, Proceedings, Part V …, 2021 - Springer
Research in word representation shows that isotropic embeddings can significantly improve
performance on downstream tasks. However, we measure and analyze the geometry of pre …

Sentence representation with manifold learning for biomedical texts

D Zhao, J Wang, H Lin, Y Chu, Y Wang, Y Zhang… - Knowledge-Based …, 2021 - Elsevier
Sentence representation approaches based on deep learning have become a major part of
natural language processing, and pretrained sentences have wide applications in …

Ultra-fine entity typing with prior knowledge about labels: A simple clustering based strategy

N Li, Z Bouraoui, S Schockaert - arXiv preprint arXiv:2305.12802, 2023 - arxiv.org
Ultra-fine entity typing (UFET) is the task of inferring the semantic types, from a large set of
fine-grained candidates, that apply to a given entity mention. This task is especially …

Manifold-Based Verbalizer Space Re-embedding for Tuning-Free Prompt-Based Classification

H Wang, S Zhao, C Liu, N Xi, M Cai, B Qin… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Prompt-based classification adapts tasks to a cloze question format utilizing the [MASK]
token and the filled tokens are then mapped to labels through pre-defined verbalizers …

Deep clustering bearing fault diagnosis method based on local manifold learning of an autoencoded embedding

J An, P Ai, C Liu, S Xu, D Liu - IEEE Access, 2021 - ieeexplore.ieee.org
In many practical fault diagnosis applications, the acquisition of fault data labels requires
substantial manpower and resources, which are sometimes impossible to achieve. To …

Refining electronic medical records representation in manifold subspace

B Wang, Y Sun, Y Chu, D Zhao, Z Yang, J Wang - BMC bioinformatics, 2022 - Springer
Background Electronic medical records (EMR) contain detailed information about patient
health. Developing an effective representation model is of great significance for the …

Global-locality preserving projection for word embedding

B Wang, Y Sun, Y Chu, Z Yang, H Lin - International Journal of Machine …, 2022 - Springer
Pre-trained word embedding has a significant impact on constructing representations for
sentences, paragraphs and documents. However, existing word embedding methods are …

Matrix factorisation and the interpretation of geodesic distance

N Whiteley, A Gray… - Advances in Neural …, 2021 - proceedings.neurips.cc
Given a graph or similarity matrix, we consider the problem of recovering a notion of true
distance between the nodes, and so their true positions. We show that this can be …