Multichannel CNN-BLSTM architecture for speech emotion recognition system by fusion of magnitude and phase spectral features using DCCA for consumer applications GA Prabhakar, B Basel, A Dutta, CVR Rao IEEE Transactions on Consumer Electronics 69 (2), 226-235, 2023 | 38 | 2023 |
Improving the performance of the speaker emotion recognition based on low dimension prosody features vector AP Gudmalwar, CV Rama Rao, A Dutta International Journal of Speech Technology 22, 521-531, 2019 | 14 | 2019 |
Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function A Dutta, G Ashishkumar, CVR Rao Frontiers of Computer Science 15 (4), 154705, 2021 | 8 | 2021 |
Designing of gabor filters for spectro-temporal feature extraction to improve the performance of asr system A Dutta, G Ashishkumar, CVR Rao International Journal of Speech Technology 22 (4), 1085-1097, 2019 | 7 | 2019 |
Phase Based Spectro-Temporal Features for Building a Robust ASR System. A Dutta, AP Gudmalwar, CVR Rao INTERSPEECH, 1668-1672, 2020 | 5 | 2020 |
Improving the performance of asr system by building acoustic models using spectro-temporal and phase-based features A Dutta, G Ashishkumar, CVR Rao Circuits, Systems, and Signal Processing, 1-24, 2022 | 4 | 2022 |
The Magnitude and Phase based Speech Representation Learning using Autoencoder for Classifying Speech Emotions using Deep Canonical Correlation Analysis. AP Gudmalwar, B Basel, A Dutta, CVR Rao INTERSPEECH, 1163-1167, 2022 | 3 | 2022 |
Auditory inspired acoustic model for hybrid asr system using gammatone based gabor filters A Dutta, G Ashishkumar, CVR Rao 2019 IEEE International Symposium on Signal Processing and Information …, 2019 | 3 | 2019 |
Design and analysis of inexact 3: 2 compressor-based radix-4 multiplier towards image multiplication SK Beura, GA Prabhakar, SM Mahanta, BP Devi, P Saha 2021 12th International Conference on Computing Communication and Networking …, 2021 | 2 | 2021 |
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing N Sahipjohn, A Gudmalwar, N Shah, P Wasnik, RR Shah arXiv preprint arXiv:2406.08802, 2024 | 1 | 2024 |
Single Channel Speech Enhancement System using Convolutional Neural Network based Autoencoder for Noisy Environments R Buragohain, G Ashishkumar, CVR Rao 2022 IEEE 19th India Council International Conference (INDICON), 1-6, 2022 | 1 | 2022 |
Single channel speech enhancement using masking based on sinusoidal modeling R Buragohain, RA Reddy, Y Venkatesh, GA Prabhakar, CVR Rao International Conference on Recent Trends in Image Processing and Pattern …, 2021 | 1 | 2021 |
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech A Gudmalwar, N Shah, S Akarsh, P Wasnik, RR Shah arXiv preprint arXiv:2406.08076, 2024 | | 2024 |
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning SR Mhaskar, NJ Shah, M Zaki, AP Gudmalwar, P Wasnik, RR Shah arXiv preprint arXiv:2403.15469, 2024 | | 2024 |
Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling A Gudmalwar, CVR Rao SN Computer Science 4 (1), 71, 2022 | | 2022 |
Extraction of Temporal Features on Fibonacci Space for Audio Based Vehicle Classification A Sinha, SH Kumar, GA Prabhakar, CVR Rao International Conference on Recent Trends in Image Processing and Pattern …, 2021 | | 2021 |
On the Impact of Gabor Phase for Spectro-Temporal Feature Extraction in Building an ASR System A Dutta, G Prabhakar, CVR Rao 2020 11th IEEE Annual Information Technology, Electronics and Mobile …, 2020 | | 2020 |
Estimation of Fundamental Frequency of Noisy Speech Signals using Correlogram based on Subband Filtering A Gudmalwar, A Dutta, VR Rao 2019 IEEE 6th International Conference on Engineering Technologies and …, 2019 | | 2019 |