Siamese neural networks: An overview

D Chicco - Artificial neural networks, 2021 - Springer
Similarity has always been a key aspect in computer science and statistics. Any time two
element vectors are compared, many different similarity approaches can be used …

Temporal modulations in speech and music

N Ding, AD Patel, L Chen, H Butler, C Luo… - … & Biobehavioral Reviews, 2017 - Elsevier
Speech and music have structured rhythms. Here we discuss a major acoustic correlate of
spoken and musical rhythms, the slow (0.25–32 Hz) temporal modulations in sound intensity …

The discriminative lexicon: A unified computational model for the lexicon and lexical processing in comprehension and production grounded not in (de) composition …

RH Baayen, YY Chuang, E Shafaei-Bajestan… - …, 2019 - Wiley Online Library
The discriminative lexicon is introduced as a mathematical and computational model of the
mental lexicon. This novel theory is inspired by word and paradigm morphology but …

A cross-language perspective on speech information rate

F Pellegrino, C Coupé, E Marsico - Language, 2011 - JSTOR
This article is a crosslinguistic investigation of the hypothesis that the average information
rate conveyed during speech communication results from a trade-off between average …

[引用][C] The Emergence of Distinctive Features

J Mielke - 2008 - books.google.com
This book makes a fundamental contribution to phonology, linguistic typology, and the
nature of the human language faculty. Distinctive features in phonology distinguish one …

Augmented datasheets for speech datasets and ethical decision-making

O Papakyriakopoulos, ASG Choi, W Thong… - Proceedings of the …, 2023 - dl.acm.org
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …

Why reduce? Phonological neighborhood density and phonetic reduction in spontaneous speech

S Gahl, Y Yao, K Johnson - Journal of memory and language, 2012 - Elsevier
Frequent or contextually predictable words are often phonetically reduced, ie shortened and
produced with articulatory undershoot. Explanations for phonetic reduction of predictable …

Speech self-supervised representation benchmarking: Are we doing it right?

S Zaiem, Y Kemiche, T Parcollet, S Essid… - arXiv preprint arXiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has recently allowed leveraging large datasets of unlabeled
speech signals to reach impressive performance on speech tasks using only small amounts …

[引用][C] Phonetic Analysis of Speech Corpora

J Harrington - 2010 - books.google.com
An accessible introduction to the phonetic analysis of speech corpora, this workbook-style
text provides an extensive set of exercises to help readers develop the necessary skills to …

Transcribing talk and interaction

C Jenks - Transcribing Talk and Interaction, 2011 - torrossa.com
This book has been written because I believe researchers and postgraduate students who
are beginning to explore the world of spoken discourse analysis have few publications to …