Neural natural language generation: A survey on multilinguality, multimodality, controllability and learning

E Erdem, M Kuyu, S Yagcioglu, A Frank… - Journal of Artificial …, 2022 - jair.org
Developing artificial learning systems that can understand and generate natural language
has been one of the long-standing goals of artificial intelligence. Recent decades have …

: Increasing GPU Utilization during Generative Inference for Higher Throughput

Y Jin, CF Wu, D Brooks, GY Wei - Advances in Neural …, 2023 - proceedings.neurips.cc
Generating texts with a large language model (LLM) consumes massive amounts of
memory. Apart from the already-large model parameters, the key/value (KV) cache that …

Neural hidden Markov model for machine translation

W Wang, D Zhu, T Alkhouli, Z Gan… - Proceedings of the 56th …, 2018 - aclanthology.org
Attention-based neural machine translation (NMT) models selectively focus on specific
source positions to produce a translation, which brings significant improvements over pure …

Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR

J Jiang, Y Gao, Z Tuske - arXiv preprint arXiv:2311.14835, 2023 - arxiv.org
In this paper, we aim to create weak alignment supervision to aid the end-to-end modeling.
Towards this end, we use the existing hybrid ASR system to produce triphone alignments of …

Fast bilingual grapheme-to-phoneme conversion

HY Kim, JH Kim, JM Kim - Proceedings of the 2022 Conference of …, 2022 - aclanthology.org
Autoregressive transformer (ART)-based grapheme-to-phoneme (G2P) models have been
proposed for bi/multilingual text-to-speech systems. Although they have achieved great …

Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation

I Provilkov, A Malinin - arXiv preprint arXiv:2109.06253, 2021 - arxiv.org
Neural Machine Translation (NMT) is known to suffer from a beam-search problem: after a
certain point, increasing beam size causes an overall drop in translation quality. This effect …

Tigrigna-English Bidirectional Machine Translation using Deep Learning

F Hailu - 2024 - repository.smuc.edu.et
A language can be described by its rules or its symbols. Making computers understand
sentences or words written in human languages is the goal of natural language processing …

The Implicit Length Bias of Label Smoothing on Beam Search Decoding

B Liang, P Wang, Y Cao - arXiv preprint arXiv:2205.00659, 2022 - arxiv.org
Label smoothing is ubiquitously applied in Neural Machine Translation (NMT) training.
While label smoothing offers a desired regularization effect during model training, in this …

Length-constrained neural machine translation using length prediction and perturbation into length-aware positional encoding

Y Oka, K Sudoh, S Nakamura - Journal of Natural Language …, 2021 - jstage.jst.go.jp
Neural machine translation often suffers from an under-translation problem owing to its
limited modeling of the output sequence lengths. In this study, we propose a novel approach …

Sebat Bet Gurage (Chaha)-Amharic Machine Translation using Deep Learning

D Yirga - 2024 - repository.smuc.edu.et
Natural Language Processing (NLP) is defined as a method for computers to intelligently
analyze, understand, and derive meaning from human language. Machine translation is a …