Multiclass audio segmentation based on recurrent neural networks for broadcast domain data P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida EURASIP Journal on Audio, Speech, and Music Processing 2020, 1-19, 2020 | 47 | 2020 |
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge. I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida Interspeech, 2803-2807, 2018 | 32 | 2018 |
Generalizing AUC optimization to multiclass classification for audio segmentation with limited training data P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida IEEE Signal Processing Letters 28, 1135-1139, 2021 | 14 | 2021 |
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation. I Viñals, D Ribas, V Mingote, J Llombart, P Gimeno, A Miguel, ... Interspeech, 4310-4314, 2019 | 11 | 2019 |
Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida Interspeech, 3067-3071, 2020 | 10 | 2020 |
ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida Proc. Interspeech 2019, 988-992, 2019 | 10 | 2019 |
In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida Proc. IberSPEECH 2018, 220-223, 2018 | 10 | 2018 |
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida Iberspeech 2021, 2021 | 8 | 2021 |
A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida Proc. IberSPEECH 2018, 87-91, 2018 | 7 | 2018 |
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021 P Gimeno, A Ortega, A Miguel, E Lleida Proc. Interspeech 2021, 4359-4363, 2021 | 5 | 2021 |
ALLIES: A Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change Detection M Tahon, A Larcher, M Lebourdais, F Bougares, A Silnova, P Gimeno Proceedings of the 2024 Joint International Conference on Computational …, 2024 | 3 | 2024 |
Improved cross-lingual transfer learning for automatic speech translation S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ... arXiv preprint arXiv:2306.00789, 2023 | 3 | 2023 |
Multimodal diarization systems by training enrollment models as identity representations V Mingote, I Viñals, P Gimeno, A Miguel, A Ortega, E Lleida Applied Sciences 12 (3), 1141, 2022 | 3 | 2022 |
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge V Mingote, I Vinals, P Gimeno, A Miguel, A Ortega, E Lleida Iberspeech 2021, 2021 | 2 | 2021 |
ViVoVAD: a Voice Activity Detection Tool based on Recurrent Neural Networks PG Jordán, IV Bailo, AO Giménez, AM Artiaga, EL Solano Jornada de Jóvenes Investigadores del I3A 7, 2019 | 2 | 2019 |
3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model M Lebourdais, P Gimeno, T Mariotte, M Tahon, A Ortega, A Larcher Speaker and Language Recognition Workshop-Odyssey, 2024 | 1 | 2024 |
Unsupervised adaptation of deep speech activity detection models to unseen domains P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida Applied Sciences 12 (4), 1832, 2022 | 1 | 2022 |
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida Proc. IberSPEECH 2021, 94-98, 2021 | 1 | 2021 |
Cross-Lingual Transfer Learning for Low-Resource Speech Translation S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ... IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024 | | 2024 |
Direct Text to Speech Translation System Using Acoustic Units V Mingote, P Gimeno, L Vicente, S Khurana, A Laurent, J Duret IEEE Signal Processing Letters, 2023 | | 2023 |