Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification V Mingote, A Miguel, A Ortega, E Lleida Computer Speech & Language 63, 101078, 2020 | 39 | 2020 |
Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems. V Mingote, A Miguel, D Ribas, AO Giménez, E Lleida INTERSPEECH, 2903-2907, 2019 | 26 | 2019 |
Knowledge distillation and random erasing data augmentation for text-dependent speaker verification V Mingote, A Miguel, D Ribas, A Ortega, E Lleida ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 17 | 2020 |
Generalizing AUC optimization to multiclass classification for audio segmentation with limited training data P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida IEEE Signal Processing Letters 28, 1135-1139, 2021 | 15 | 2021 |
Language Recognition Using Triplet Neural Networks. V Mingote, D Castan, M McLaren, MK Nandwana, AO Giménez, E Lleida, ... INTERSPEECH, 4025-4029, 2019 | 15 | 2019 |
Memory layers with multi-head attention mechanisms for text-dependent speaker verification V Mingote, A Miguel, A Ortega, E Lleida ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 12 | 2021 |
Supervector extraction for encoding speaker and phrase information with neural networks for text-dependent speaker verification V Mingote, A Miguel, A Ortega, E Lleida Applied Sciences 9 (16), 3295, 2019 | 12 | 2019 |
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation. I Viñals, D Ribas, V Mingote, J Llombart, P Gimeno, A Miguel, ... Interspeech, 4310-4314, 2019 | 11 | 2019 |
Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data. P Gimeno, V Mingote, AO Giménez, A Miguel, E Lleida INTERSPEECH, 3067-3071, 2020 | 10 | 2020 |
Class token and knowledge distillation for multi-head self-attention speaker verification systems V Mingote, A Miguel, A Ortega, E Lleida Digital Signal Processing 133, 103859, 2023 | 9 | 2023 |
Log-likelihood-ratio cost function as objective loss for speaker verification systems V Mingote, A Miguel, A Ortega, E Lleida Interspeech 2021, 2361-2365, 2021 | 7 | 2021 |
Differentiable supervector extraction for encoding speaker and phrase information in text dependent speaker verification V Mingote, A Miguel, A Ortega, E Lleida arXiv preprint arXiv:1812.09484, 2018 | 6 | 2018 |
Training Speaker Enrollment Models by Network Optimization. V Mingote, A Miguel, AO Giménez, E Lleida INTERSPEECH, 3810-3814, 2020 | 5 | 2020 |
Improved cross-lingual transfer learning for automatic speech translation S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ... arXiv preprint arXiv:2306.00789, 2023 | 3 | 2023 |
Multimodal diarization systems by training enrollment models as identity representations V Mingote, I Viñals, P Gimeno, A Miguel, A Ortega, E Lleida Applied Sciences 12 (3), 1141, 2022 | 3 | 2022 |
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge. V Mingote, I Vinals, P Gimeno, A Miguel, AO Giménez, E Lleida IberSPEECH, 2021 | 2 | 2021 |
Direct Text to Speech Translation System Using Acoustic Units V Mingote, P Gimeno, L Vicente, S Khurana, A Laurent, J Duret IEEE Signal Processing Letters, 2023 | 1 | 2023 |
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems V Mingote, A Miguel, D Ribas, A Ortega, E Lleida IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 772-784, 2022 | 1 | 2022 |
Cross-Lingual Transfer Learning for Low-Resource Speech Translation S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ... IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024 | | 2024 |
Multi-lingual Speech to Speech Translation for Under-Resourced Languages A Larcher, Y Estève, M Rouvier, N Tomashenko, J Duret, G Laperriere, ... Le Mans Université, 2022 | | 2022 |