Speaker recognition from raw waveform with sincnet M Ravanelli, Y Bengio 2018 IEEE spoken language technology workshop (SLT), 1021-1028, 2018 | 875 | 2018 |
SpeechBrain: A general-purpose speech toolkit M Ravanelli, T Parcollet, P Plantinga, A Rouhe, S Cornell, L Lugosch, ... arXiv preprint arXiv:2106.04624, 2021 | 583 | 2021 |
Attention is all you need in speech separation C Subakan, M Ravanelli, S Cornell, M Bronzi, J Zhong ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 461 | 2021 |
Light gated recurrent units for speech recognition M Ravanelli, P Brakel, M Omologo, Y Bengio IEEE Transactions on Emerging Topics in Computational Intelligence 2 (2), 92-102, 2018 | 396 | 2018 |
Speech model pre-training for end-to-end spoken language understanding L Lugosch, M Ravanelli, P Ignoto, VS Tomar, Y Bengio arXiv preprint arXiv:1904.03670, 2019 | 347 | 2019 |
Multi-task self-supervised learning for robust speech recognition M Ravanelli, J Zhong, S Pascual, P Swietojanski, J Monteiro, J Trmal, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 303 | 2020 |
The pytorch-kaldi speech recognition toolkit M Ravanelli, T Parcollet, Y Bengio ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 270 | 2019 |
Learning problem-agnostic speech representations from multiple self-supervised tasks S Pascual, M Ravanelli, J Serra, A Bonafonte, Y Bengio arXiv preprint arXiv:1904.03416, 2019 | 263 | 2019 |
Metricgan+: An improved version of metricgan for speech enhancement SW Fu, C Yu, TA Hsieh, P Plantinga, M Ravanelli, X Lu, Y Tsao arXiv preprint arXiv:2104.03538, 2021 | 193 | 2021 |
Quaternion recurrent neural networks T Parcollet, M Ravanelli, M Morchid, G Linarès, C Trabelsi, R De Mori, ... arXiv preprint arXiv:1806.04418, 2018 | 155 | 2018 |
Interpretable convolutional filters with sincnet M Ravanelli, Y Bengio arXiv preprint arXiv:1811.09725, 2018 | 135 | 2018 |
Samuele Cornell M Ravanelli, T Parcollet, P Plantinga, A Rouhe Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan …, 2021 | 102 | 2021 |
Learning Speaker Representations with Mutual Information. M Ravanelli, Y Bengio Interspeech, 1153-1157, 2019 | 97 | 2019 |
The DIRHA simulated corpus. L Cristoforetti, M Ravanelli, M Omologo, A Sosi, A Abad, M Hagmüller, ... LREC, 2629-2634, 2014 | 96 | 2014 |
ECAPA-TDNN embeddings for speaker diarization N Dawalatabad, M Ravanelli, F Grondin, J Thienpondt, B Desplanques, ... arXiv preprint arXiv:2104.01466, 2021 | 93 | 2021 |
Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, et al. 2021. Speechbrain: A general-purpose speech toolkit M Ravanelli, T Parcollet, P Plantinga, A Rouhe arXiv preprint arXiv:2106.04624, 1-34, 2021 | 72 | 2021 |
Improving speech recognition by revising gated recurrent units M Ravanelli, P Brakel, M Omologo, Y Bengio arXiv preprint arXiv:1710.00641, 2017 | 67 | 2017 |
The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments M Ravanelli, L Cristoforetti, R Gretter, M Pellin, A Sosi, M Omologo 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 64 | 2015 |
A network of deep neural networks for distant speech recognition M Ravanelli, P Brakel, M Omologo, Y Bengio 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 53 | 2017 |
Batch-normalized joint training for DNN-based distant speech recognition M Ravanelli, P Brakel, M Omologo, Y Bengio 2016 IEEE Spoken Language Technology Workshop (SLT), 28-34, 2016 | 44 | 2016 |