Speaker change detection using fundamental frequency with application to multi-talker segmentation

AOT Hogg, C Evers, PA Naylor - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
This paper shows that time varying pitch properties can be used advantageously within the
segmentation step of a multi-talker diarization system. First a study is conducted to verify that …

Method and system for conversation transcription with metadata

KL Bradley, E Coeytaux, YIN Ziming - US Patent 12,020,708, 2024 - Google Patents
Methods and systems for enabling an efficient review of meeting content via a metadata-
enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization …

Overlapping speaker segmentation using multiple hypothesis tracking of fundamental frequency

AOT Hogg, C Evers, AH Moore… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
This paper demonstrates how the harmonic structure of voiced speech can be exploited to
segment multiple overlapping speakers in a speaker diarization task. We explore how a …

Multiple hypothesis tracking for overlapping speaker segmentation

AOT Hogg, C Evers, PA Naylor - 2019 IEEE Workshop on …, 2019 - ieeexplore.ieee.org
Speaker segmentation is an essential part of any diarization system. Applications of
diarization include tasks such as speaker indexing, improving automatic speech recognition …

Method and system for conversation transcription with metadata

KL Bradley, E Coeytaux, YIN Ziming - US Patent 12,125,487, 2024 - Google Patents
Methods and systems for enabling an efficient review of meeting content via a metadata-
enriched, speaker-attributed and multiuser-editable transcript are disclosed. By …

Multichannel overlapping speaker segmentation using multiple hypothesis tracking of acoustic and spatial features

AOT Hogg, C Evers, PA Naylor - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
An essential part of any diarization system is the task of speaker segmentation which is
important for many applications including speaker indexing and automatic speech …

Videoconference interpreting goes multimodal: Some insights and a tentative proposal

X Zhang, GC Pastor, J Zhang - Interpreting Technologies–Current …, 2023 - jbe-platform.com
Recent times have witnessed an unprecedent surge of distant modalities of interpreting
(remote, videoconference, etc.). The tendency has been particularly noticeable since the …