关注
Roshan Sharma
Roshan Sharma
Research Scientist, Google
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
End-to-end speech summarization using restricted self-attention
R Sharma, S Palaskar, AW Black, F Metze
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
30*2022
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks
S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ...
arXiv preprint arXiv:2212.10525, 2022
242022
Reproducing whisper-style training using an open-source toolkit and publicly available data
Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
192023
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
142024
A summary of the first workshop on language technology for language documentation and revitalization
G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ...
arXiv preprint arXiv:2004.13203, 2020
142020
Speech recognition in Kannada using HTK and julius: a comparative study
RS Sharma, SH Paladugu, KJ Priya, D Gupta
2019 international conference on communication and signal processing (iccsp …, 2019
142019
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
92024
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model
MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ...
arXiv preprint arXiv:2310.04445, 2023
92023
Speech summarization of long spoken document: Improving memory efficiency of speech/text encoders
T Kano, A Ogawa, M Delcroix, R Sharma, K Matsuura, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network
S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ...
arXiv preprint arXiv:2310.02973, 2023
52023
BASS: Block-wise Adaptation for Speech Summarization
R Sharma, S Arora, K Zheng, S Watanabe, R Singh, B Raj
Proc. INTERSPEECH 2023, 1454--1458, 2023
42023
Xnor-former: Learning accurate approximations in long speech transformers
R Sharma, B Raj
arXiv preprint arXiv:2210.16643, 2022
42022
Espnet-summ: Introducing a novel large dataset, toolkit, and a cross-corpora evaluation of speech summarization systems
R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
32023
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction
R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj
Proceedings of the 39th International Conference on Machine Learning 2022 …, 2022
32022
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models
J Jung, R Sharma, W Chen, B Raj, S Watanabe
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
22024
Unifying the discrete and continuous emotion labels for speech emotion recognition
R Sharma, H Dhamyal, B Raj, R Singh
arXiv preprint arXiv:2210.16642, 2022
22022
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ...
arXiv preprint arXiv:2406.10083, 2024
12024
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech
D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh
arXiv preprint arXiv:2310.00706, 2023
12023
Augmenting text for spoken language understanding with Large Language Models
R Sharma, S Kim, D Lazar, T Le, A Shrivastava, K Ahn, P Kansal, L Sari, ...
arXiv preprint arXiv:2309.09390, 2023
12023
Egocentric audio-visual noise suppression
R Sharma, W He, J Lin, E Lakomkin, Y Liu, K Kalgaonkar
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
系统目前无法执行此操作,请稍后再试。
文章 1–20