[PDF][PDF] Bidirectional Recurrent Neural Network with Attention Mechanism for Punctuation Restoration.

O Tilk, T Alumäe - Interspeech, 2016 - researchgate.net
Automatic speech recognition systems generally produce unpunctuated text which is difficult
to read for humans and degrades the performance of many downstream machine …

Decembert: Learning from noisy instructional videos via dense captions and entropy minimization

Z Tang, J Lei, M Bansal - Proceedings of the 2021 Conference of …, 2021 - aclanthology.org
Leveraging large-scale unlabeled web videos such as instructional videos for pre-training
followed by task-specific finetuning has become the de facto approach for many video-and …

Punctuation prediction for unsegmented transcript based on word vector

X Che, C Wang, H Yang, C Meinel - Proceedings of the Tenth …, 2016 - aclanthology.org
In this paper we propose an approach to predict punctuation marks for unsegmented speech
transcript. The approach is purely lexical, with pre-trained Word Vectors as the only input. A …

Advanced rich transcription system for Estonian speech

T Alumäe, O Tilk - Human language technologies–the Baltic …, 2018 - ebooks.iospress.nl
This paper describes the current TTÜ speech transcription system for Estonian speech. The
system is designed to handle semi-spontaneous speech, such as broadcast conversations …

Capitalization and punctuation restoration: a survey

V Păiş, D Tufiş - Artificial Intelligence Review, 2022 - Springer
Ensuring proper punctuation and letter casing is a key pre-processing step towards applying
complex natural language processing algorithms. This is especially significant for textual …

Self-attention based model for punctuation prediction using word and speech embeddings

J Yi, J Tao - ICASSP 2019-2019 IEEE International Conference …, 2019 - ieeexplore.ieee.org
This paper proposes to use self-attention based model to predict punctuation marks for word
sequences. The model is trained using word and speech embedding features which are …

Punctuation prediction model for conversational speech

P Żelasko, P Szymański, J Mizgajski… - arXiv preprint arXiv …, 2018 - arxiv.org
An ASR system usually does not predict any punctuation or capitalization. Lack of
punctuation causes problems in result presentation and confuses both the human reader …

Efficient automatic punctuation restoration using bidirectional transformers with robust inference

M Courtland, A Faulkner… - Proceedings of the 17th …, 2020 - aclanthology.org
Though people rarely speak in complete sentences, punctuation confers many benefits to
the readers of transcribed speech. Unfortunately, most ASR systems do not produce …

A nonlinear regression application via machine learning techniques for geomagnetic data reconstruction processing

H Liu, Z Liu, S Liu, Y Liu, J Bin, F Shi… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
The integrity of geomagnetic data is a critical factor in understanding the evolutionary
process of Earth's magnetic field, as it provides useful information for near-surface …

Adversarial transfer learning for punctuation restoration

J Yi, J Tao, Y Bai, Z Tian, C Fan - arXiv preprint arXiv:2004.00248, 2020 - arxiv.org
Previous studies demonstrate that word embeddings and part-of-speech (POS) tags are
helpful for punctuation restoration tasks. However, two drawbacks still exist. One is that word …