作者
Vincent Vandeghinste, Jolien Mathysen, Elke Peters, Patrick Wambacq
发表日期
2023/10/1
期刊
CLARIN Annual Conference Proceedings
页码范围
1
简介
We present the Spoken Academic Belgian Dutch (SABeD) corpus. It was compiled from selected first bachelor academic lectures in higher education institutions in Flanders, as students indicate that the language used in such lectures is one of the hurdles for comprehension and academic success. We first applied speech recognition on these lectures, and then applied manual utterance segmentation, and manual correction of the automated transcription. The resulting text is processed with the FROG language analyser and will be made searchable through a CLARIN website as soon as all manual editing is done.
学术搜索中的文章
V Vandeghinste, J Mathysen, E Peters, P Wambacq - CLARIN Annual Conference Proceedings, 2023