Efficient Personalized Speech Enhancement through Self-Supervised Learning A Sivaraman, M Kim IEEE Journal of Selected Topics in Signal Processing 16 (6), 1342-1356, 2022 | 33* | 2022 |
Adapting Speech Separation Systems to Real-World Meetings using Mixture Invariant Training A Sivaraman, S Wisdom, H Erdogan, JR Hershey ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification A Sivaraman, S Kim, M Kim Proc. Interspeech 2021, 2676-2680, 2021 | 24 | 2021 |
Detecting Extraneous Content in Podcasts S Reddy, Y Yu, A Pappu, A Sivaraman, R Rezapour, R Jones Proc. European Chapter of the Association for Computational Linguistics …, 2021 | 19 | 2021 |
Audio signal encoding method and apparatus and audio signal decoding method and apparatus using psychoacoustic-based weighted error function J Sung, M Kim, A Sivaraman, K Zhen US Patent 11,416,742, 2022 | 15 | 2022 |
Sparse Mixture of Local Experts for Efficient Speech Enhancement A Sivaraman, M Kim Proc. Interspeech 2020, 4526-4530, 2020 | 12 | 2020 |
Zero-shot personalized speech enhancement through speaker-informed model selection A Sivaraman, M Kim 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021 | 10 | 2021 |
On psychoacoustically weighted cost functions towards resource-efficient deep neural networks for speech denoising K Zhen, A Sivaraman, J Sung, M Kim arXiv preprint arXiv:1801.09774, 2018 | 10 | 2018 |
Deep Autotuner: A Data-Driven Approach To Natural-Sounding Pitch Correction For Singing Voice In Karaoke Performances S Wager, G Tzanetakis, C Wang, L Guo, A Sivaraman, M Kim IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), Submitted …, 0 | 3* | |
The Potential of Neural Speech Synthesis-Based Data Augmentation for Personalized Speech Enhancement A Kuznetsova, A Sivaraman, M Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music S Wager, L Guo, A Sivaraman, M Kim arXiv preprint arXiv:1805.02603, 2018 | 1 | 2018 |
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection S Palaskar, O Rudovic, S Dharur, F Pesce, G Krishna, A Sivaraman, ... arXiv preprint arXiv:2406.09617, 2024 | | 2024 |
Resource-Efficient Model Adaptation Methods for Personalized Speech Enhancement Systems A Sivaraman Indiana University, 2024 | | 2024 |
Systems and methods for skip-based content detection AK Pappu, RE Jones, S Nahmad, K Savage, A Sivaraman US Patent 11,234,031, 2022 | | 2022 |
Quantization Error Tolerance in Hashed Audio Spectra A Sivaraman | | 2015 |