Roshan Sharma 个人学术档案

引用次数

	总计	2019 年至今
引用	169	169
h 指数	8	8
i10 指数	6	6

100

202020212022202320244 4 18 43 100

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Siddhant AroraGraduate Student, Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Bhiksha RajCarnegie Mellon University在 cs.cmu.edu 的电子邮件经过验证
Jee-weon JungCarnegie Mellon University在 ieee.org 的电子邮件经过验证
Hira DhamyalCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Yifan PengCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Jiatong Shi (史嘉彤)Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
William ChenCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Hung-yi LeeNational Taiwan University在 ntu.edu.tw 的电子邮件经过验证
Soumi MaitiCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Karen LivescuTTI-Chicago在 ttic.edu 的电子邮件经过验证
Xinjian LiGoogle在 google.com 的电子邮件经过验证
Xuankai ChangCarnegie Mellon University, Student在 andrew.cmu.edu 的电子邮件经过验证
Florian MetzeCarnegie Mellon University; Meta AI在 andrew.cmu.edu 的电子邮件经过验证
Shruti PalaskarApple在 apple.com 的电子邮件经过验证
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon University在 cs.cmu.edu 的电子邮件经过验证
Ankita PasadToyota Technological Institute at Chicago在 ttic.edu 的电子邮件经过验证
Suwon ShonASAPP在 csail.mit.edu 的电子邮件经过验证
Dareen AlharthiResearcher, carnegie mellon university在 andrew.cmu.edu 的电子邮件经过验证
Felix WuCharacter AI在 character.ai 的电子邮件经过验证

关注

Roshan Sharma

Research Scientist, Google

在 google.com 的电子邮件经过验证 - 首页

Speech Recognition Speech Processing Machine Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
End-to-end speech summarization using restricted self-attention R Sharma, S Palaskar, AW Black, F Metze ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	30*	2022
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ... arXiv preprint arXiv:2212.10525, 2022	24	2022
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	19	2023
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	14	2024
A summary of the first workshop on language technology for language documentation and revitalization G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ... arXiv preprint arXiv:2004.13203, 2020	14	2020
Speech recognition in Kannada using HTK and julius: a comparative study RS Sharma, SH Paladugu, KJ Priya, D Gupta 2019 international conference on communication and signal processing (iccsp …, 2019	14	2019
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	9	2024
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023	9	2023
Speech summarization of long spoken document: Improving memory efficiency of speech/text encoders T Kano, A Ogawa, M Delcroix, R Sharma, K Matsuura, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ... arXiv preprint arXiv:2310.02973, 2023	5	2023
BASS: Block-wise Adaptation for Speech Summarization R Sharma, S Arora, K Zheng, S Watanabe, R Singh, B Raj Proc. INTERSPEECH 2023, 1454--1458, 2023	4	2023
Xnor-former: Learning accurate approximations in long speech transformers R Sharma, B Raj arXiv preprint arXiv:2210.16643, 2022	4	2022
Espnet-summ: Introducing a novel large dataset, toolkit, and a cross-corpora evaluation of speech summarization systems R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	3	2023
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj Proceedings of the 39th International Conference on Machine Learning 2022 …, 2022	3	2022
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models J Jung, R Sharma, W Chen, B Raj, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024
Unifying the discrete and continuous emotion labels for speech emotion recognition R Sharma, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2210.16642, 2022	2	2022
On the Evaluation of Speech Foundation Models for Spoken Language Understanding S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ... arXiv preprint arXiv:2406.10083, 2024	1	2024
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh arXiv preprint arXiv:2310.00706, 2023	1	2023
Augmenting text for spoken language understanding with Large Language Models R Sharma, S Kim, D Lazar, T Le, A Shrivastava, K Ahn, P Kansal, L Sari, ... arXiv preprint arXiv:2309.09390, 2023	1	2023
Egocentric audio-visual noise suppression R Sharma, W He, J Lin, E Lakomkin, Y Liu, K Kalgaonkar ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	1	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用