作者
JN Singh, Sahil Sirohi, Shachi Mall
发表日期
2023/12/15
研讨会论文
2023 5th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)
页码范围
995-998
出版商
IEEE
简介
Artificial intelligence (AI) allows machine systems to act like human intelligence, permitting them to learn from data and make decisions based on analysis. the technology known as voice recognition, which translates spoken words into text or other formats that are desired. First, voice signals are recorded using audio input devices. Then, pre-processing is done to remove unnecessary information and background noise. Next, feature extraction converts the pre-processed signals into a set of features for recognition. Acoustic modeling involves creating statistical models mapping these features to phonetic units, often utilizing Hidden Markov Models. Language modeling constructs statistical models of spoken language to aid in recognizing words and phrases in context. Decoding matches the acoustic and language models to identify spoken words or phrases. Finally, recognized words or phrases are outputted in the …
学术搜索中的文章
JN Singh, S Sirohi, S Mall - 2023 5th International Conference on Advances in …, 2023