A survey of machine learning for big code and naturalness

M Allamanis, ET Barr, P Devanbu… - ACM Computing Surveys …, 2018 - dl.acm.org
Research at the intersection of machine learning, programming languages, and software
engineering has recently taken important steps in proposing learnable probabilistic models …

Unsupervised learning of sentence embeddings using compositional n-gram features

M Pagliardini, P Gupta, M Jaggi - arXiv preprint arXiv:1703.02507, 2017 - arxiv.org
The recent tremendous success of unsupervised word embeddings in a multitude of
applications raises the obvious question if similar methods could be derived to improve …

[PDF][PDF] Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification.

J Wang, Z Wang, D Zhang, J Yan - IJCAI, 2017 - ijcai.org
Text classification is a fundamental task in NLP applications. Most existing work relied on
either explicit or implicit text representation to address this problem. While these techniques …

emoji2vec: Learning emoji representations from their description

B Eisner, T Rocktäschel, I Augenstein… - arXiv preprint arXiv …, 2016 - arxiv.org
Many current natural language processing applications for social media rely on
representation learning and utilize pre-trained word embeddings. There currently exist …

Learning distributed representations of sentences from unlabelled data

F Hill, K Cho, A Korhonen - arXiv preprint arXiv:1602.03483, 2016 - arxiv.org
Unsupervised methods for learning distributed representations of words are ubiquitous in
today's NLP research, but far less is known about the best ways to learn distributed phrase …

Applying pragmatics principles for interaction with visual analytics

E Hoque, V Setlur, M Tory… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Interactive visual data analysis is most productive when users can focus on answering the
questions they have about their data, rather than focusing on how to operate the interface to …

Semeval-2022 Task 1: CODWOE--Comparing Dictionaries and Word Embeddings

T Mickus, K Van Deemter, M Constant… - arXiv preprint arXiv …, 2022 - arxiv.org
Word embeddings have advanced the state of the art in NLP across numerous tasks.
Understanding the contents of dense neural representations is of utmost interest to the …

Beyond word embeddings: A survey

F Incitti, F Urli, L Snidaro - Information Fusion, 2023 - Elsevier
The goal of this paper is to provide an overview of the methods that allow text
representations with a focus on embeddings for text of different lengths, specifically on works …

[PDF][PDF] Definition modeling: literature review and dataset analysis

N Gardner, H Khan, CC Hung - Applied Computing and Intelligence, 2022 - aimspress.com
Definition modeling, the task of generating a definition for a given term, is a relatively new
area of research applied in evaluating word embeddings. Automatic generation of dictionary …

Multi-simlex: A large-scale evaluation of multilingual and crosslingual lexical semantic similarity

I Vulić, S Baker, EM Ponti, U Petti, I Leviant… - Computational …, 2020 - direct.mit.edu
Abstract We introduce Multi-SimLex, a large-scale lexical resource and evaluation
benchmark covering data sets for 12 typologically diverse languages, including major …