Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier
In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

[HTML][HTML] ATOMMIC: An Advanced Toolbox for Multitask Medical Imaging Consistency to facilitate Artificial Intelligence applications from acquisition to analysis in …

D Karkalousos, I Išgum, HA Marquering… - Computer Methods and …, 2024 - Elsevier
Abstract Background and Objectives: Artificial intelligence (AI) is revolutionizing Magnetic
Resonance Imaging (MRI) along the acquisition and processing chain. Advanced AI …

An AI powered system to enhance self‐reflection practice in coaching

M Jelodari, MH Amirhosseini… - Cognitive Computation …, 2023 - Wiley Online Library
Self‐reflection practice in coaching can help with time management by promoting self‐
awareness. Through this process, a coach can identify habits, tendencies and behaviours …

Mi-Go: tool which uses YouTube as data source for evaluating general-purpose speech recognition machine learning models

T Wojnar, J Hryszko, A Roman - EURASIP Journal on Audio, Speech, and …, 2024 - Springer
Abstract This article introduces Mi-Go, a tool aimed at evaluating the performance and
adaptability of general-purpose speech recognition machine learning models across diverse …

Towards Yoruba-Speaking Google Maps Navigation

F Oyesanmi, P Olukanmi - 2024 - researchsquare.com
Advances in natural language processing (NLP) have made several technological
interventions and services available to people in different languages. One such service is …

A Multi-modal Approach to Mining Intent from Code-Mixed Hindi-English Calls in the Hyperlocal-Delivery Domain

J Mathew, P Sahu, B Singhal, A Joshi… - … Conference on Speech …, 2022 - Springer
In this work we outline an approach to mine insights from calls between delivery partners
(DP) and customers involved in hyperlocal food delivery in India. Incorrect …

[PDF][PDF] Improving Automatic Transcription Using Natural Language Processing

A Kiefer - 2024 - digitalcommons.calpoly.edu
In the United States, power is shared between federal and state governments. With the
exception of Nebraska, state legislatives are divided into upper and lower houses. In the …

Human-Robot Interaction Using Deep Learning Based Approach

A Pallavoor, A Jalan, SC Ballapur, S Kiran… - … and Machine Learning … - books.google.com
Interactions with robots involve various modes, including gestures, audio commands, and
texts, where the choice of communication interface significantly influences the robot's …

End-to-end Speech Translation with Self-supervised Speech Representations

GI Gallego Olsina - 2020 - upcommons.upc.edu
Nowadays, there is a growing interest in the field of Speech Translation (speech-to-text).
Traditionally, this task has been faced with the concatenation of Automatic Speech …