Secure your voice: An oral airflow-based continuous liveness detection for voice assistants

Y Wang, W Cai, T Gu, W Shao, Y Li, Y Yu - Proceedings of the ACM on …, 2019 - dl.acm.org
Voice control has attracted extensive attention recently as it is a prospective User Interface
(UI) to substitute for conventional touch control on smart devices. Voice assistants have …

Understanding self-attention of self-supervised audio transformers

S Yang, AT Liu, H Lee - arXiv preprint arXiv:2006.03265, 2020 - arxiv.org
Self-supervised Audio Transformers (SAT) enable great success in many downstream
speech applications like ASR, but how they work has not been widely explored yet. In this …

Casting to corpus: Segmenting and selecting spontaneous dialogue for TTS with a CNN-LSTM speaker-dependent breath detector

É Székely, GE Henter… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
This paper considers utilising breaths to create improved spontaneous-speech corpora for
conversational text-to-speech from found audio recordings such as dialogue podcasts …

A Simple HMM with Self-Supervised Representations for Phone Segmentation

GP Yang, H Tang - 2024 IEEE Spoken Language Technology …, 2024 - ieeexplore.ieee.org
Despite the recent advance in self-supervised representations, unsupervised phonetic
segmentation remains challenging. Most approaches focus on improving phonetic …

Unsupervised segmentation of speech signals using kernel-gram matrices

S Bhati, S Nayak, K Sri Rama Murty - … 2017, Mandi, India, December 16-19 …, 2018 - Springer
The objective of this paper is to develop an unsupervised method for segmentation of
speech signals into phoneme-like units. The proposed algorithm is based on the …

Unsupervised speech signal-to-symbol transformation for language identification

S Bhati, S Nayak, SRM Kodukula - Circuits, Systems, and Signal …, 2020 - Springer
This paper presents a new approach for unsupervised segmentation and labeling of
acoustically homogeneous segments from the speech signals. The virtual labels, thus …

Phoneme segmentation-based unsupervised pattern discovery and clustering of speech signals

KK Ravi, SR Krothapalli - Circuits, Systems, and Signal Processing, 2022 - Springer
This paper proposes a new method that detects the repeated keyword/phrase patterns from
speech utterances by performing pattern discovery at the phoneme level. Prior to this …

UNSUPERVISED SEGMENTAL MODELING OF SPEECH FOR LOW RESOURCE APPLICATIONS

S Bhati - 2023 - jscholarship.library.jhu.edu
Voice-enabled interfaces for human-machine interaction have made significant progress in
recent years. Most of the success can be attributed to deep neural networks trained on …

Estimation of musical features using EEG signals

Ç Demirel, UC Akkaya, M Yalçin… - 2018 26th Signal …, 2018 - ieeexplore.ieee.org
Nowadays with the recent development in Brain Computer Interfaces (BCI), research field
branches to arts and especially to music. In our study, a system is developed which analyses …

Dynamic scale adaptation algorithm of image etalon functions

M Gavrikov, R Sinetsky - 2022 - repo.bibliothek.uni-halle.de
An algorithm for large-scale adaptation of prototype functions representing image classes is
proposed. The algorithm identifies the parameters of nonlinear scale distortions contained in …