Method and system for creating frugal speech corpus using internet resources and conventional speech corpus

S Kopparapu, IA Sheikh - US Patent 8,756,064, 2014 - Google Patents
A speech corpus creation method and system are disclosed. The method comprising
identifying a publicly accessible first source of the first speech data and its corresponding
first text transcription; extracting a second speech data of an accessible encoding format
from the first speech data; extracting a second text transcription data with at least one
encoding format from the first text transcription data; matching and aligning the transcription
to the extracted second speech data at a sentence, word, phoneme level, or combination …
以上显示的是最相近的搜索结果。 查看全部搜索结果