Offline dominance and zeugmatic similarity normings of variably ambiguous words assessed against a neural language model (BERT)

KA DeLong, S Trott, M Kutas - Behavior research methods, 2023 - Springer
Behavior research methods, 2023Springer
For any research program examining how ambiguous words are processed in broader
linguistic contexts, a first step is to establish factors relating to the frequency balance or
dominance of those words' multiple meanings, as well as the similarity of those meanings to
one other. Homonyms—words with divergent meanings—are one ambiguous word type
commonly utilized in psycholinguistic research. In contrast, although polysemes—words with
multiple related senses—are far more common in English, they have been less frequently …
Abstract
For any research program examining how ambiguous words are processed in broader linguistic contexts, a first step is to establish factors relating to the frequency balance or dominance of those words’ multiple meanings, as well as the similarity of those meanings to one other. Homonyms—words with divergent meanings—are one ambiguous word type commonly utilized in psycholinguistic research. In contrast, although polysemes—words with multiple related senses—are far more common in English, they have been less frequently used as tools for understanding one-to-many word-to-meaning mappings. The current paper details two norming studies of a relatively large number of ambiguous English words. In the first, offline dominance norming is detailed for 547 homonyms and polysemes via a free association task suitable for words across the ambiguity continuum, with a goal of identifying words with more equibiased meanings. The second norming assesses offline meaning similarity for a partial subset of 318 ambiguous words (including homonyms, unambiguous words, and polysemes divided into regular and irregular types) using a novel, continuous rating method reliant on the linguistic phenomenon of zeugma. In addition, we conduct computational analyses on the human similarity norming data using the BERT pretrained neural language model (Devlin et al., , BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv Preprint. arXiv:1810.04805) to evaluate factors that may explain variance beyond that accounted for by dictionary-criteria ambiguity categories. Finally, we make available the summarized item dominance values and similarity ratings in resultant appendices (see supplementary material), as well as individual item and participant norming data, which can be accessed online (https://osf.io/g7fmv/).
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果