关注
Ashishkumar Gudmalwar
Ashishkumar Gudmalwar
National Institute of Technology
在 nitm.ac.in 的电子邮件经过验证
标题
引用次数
引用次数
年份
Multichannel CNN-BLSTM architecture for speech emotion recognition system by fusion of magnitude and phase spectral features using DCCA for consumer applications
GA Prabhakar, B Basel, A Dutta, CVR Rao
IEEE Transactions on Consumer Electronics 69 (2), 226-235, 2023
382023
Improving the performance of the speaker emotion recognition based on low dimension prosody features vector
AP Gudmalwar, CV Rama Rao, A Dutta
International Journal of Speech Technology 22, 521-531, 2019
142019
Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function
A Dutta, G Ashishkumar, CVR Rao
Frontiers of Computer Science 15 (4), 154705, 2021
82021
Designing of gabor filters for spectro-temporal feature extraction to improve the performance of asr system
A Dutta, G Ashishkumar, CVR Rao
International Journal of Speech Technology 22 (4), 1085-1097, 2019
72019
Phase Based Spectro-Temporal Features for Building a Robust ASR System.
A Dutta, AP Gudmalwar, CVR Rao
INTERSPEECH, 1668-1672, 2020
52020
Improving the performance of asr system by building acoustic models using spectro-temporal and phase-based features
A Dutta, G Ashishkumar, CVR Rao
Circuits, Systems, and Signal Processing, 1-24, 2022
42022
The Magnitude and Phase based Speech Representation Learning using Autoencoder for Classifying Speech Emotions using Deep Canonical Correlation Analysis.
AP Gudmalwar, B Basel, A Dutta, CVR Rao
INTERSPEECH, 1163-1167, 2022
32022
Auditory inspired acoustic model for hybrid asr system using gammatone based gabor filters
A Dutta, G Ashishkumar, CVR Rao
2019 IEEE International Symposium on Signal Processing and Information …, 2019
32019
Design and analysis of inexact 3: 2 compressor-based radix-4 multiplier towards image multiplication
SK Beura, GA Prabhakar, SM Mahanta, BP Devi, P Saha
2021 12th International Conference on Computing Communication and Networking …, 2021
22021
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing
N Sahipjohn, A Gudmalwar, N Shah, P Wasnik, RR Shah
arXiv preprint arXiv:2406.08802, 2024
12024
Single Channel Speech Enhancement System using Convolutional Neural Network based Autoencoder for Noisy Environments
R Buragohain, G Ashishkumar, CVR Rao
2022 IEEE 19th India Council International Conference (INDICON), 1-6, 2022
12022
Single channel speech enhancement using masking based on sinusoidal modeling
R Buragohain, RA Reddy, Y Venkatesh, GA Prabhakar, CVR Rao
International Conference on Recent Trends in Image Processing and Pattern …, 2021
12021
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech
A Gudmalwar, N Shah, S Akarsh, P Wasnik, RR Shah
arXiv preprint arXiv:2406.08076, 2024
2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
SR Mhaskar, NJ Shah, M Zaki, AP Gudmalwar, P Wasnik, RR Shah
arXiv preprint arXiv:2403.15469, 2024
2024
Single Channel Speech Enhancement Using Masking Based on Sinusoidal Modeling
A Gudmalwar, CVR Rao
SN Computer Science 4 (1), 71, 2022
2022
Extraction of Temporal Features on Fibonacci Space for Audio Based Vehicle Classification
A Sinha, SH Kumar, GA Prabhakar, CVR Rao
International Conference on Recent Trends in Image Processing and Pattern …, 2021
2021
On the Impact of Gabor Phase for Spectro-Temporal Feature Extraction in Building an ASR System
A Dutta, G Prabhakar, CVR Rao
2020 11th IEEE Annual Information Technology, Electronics and Mobile …, 2020
2020
Estimation of Fundamental Frequency of Noisy Speech Signals using Correlogram based on Subband Filtering
A Gudmalwar, A Dutta, VR Rao
2019 IEEE 6th International Conference on Engineering Technologies and …, 2019
2019
系统目前无法执行此操作,请稍后再试。
文章 1–18