Neural machine translation of logographic languages using sub-character level information

L Zhang, M Komachi - arXiv preprint arXiv:1809.02694, 2018 - arxiv.org
Recent neural machine translation (NMT) systems have been greatly improved by encoder-
decoder models with attention mechanisms and sub-word units. However, important …

Using sub-character level information for neural machine translation of logographic languages

L Zhang, M Komachi - Transactions on Asian and Low-Resource …, 2021 - dl.acm.org
Logographic and alphabetic languages (eg, Chinese vs. English) have different writing
systems linguistically. Languages belonging to the same writing system usually exhibit more …

Improving Chinese word representation using four corners features

H Jin, Z Zhang, P Yuan - IEEE Transactions on Big Data, 2021 - ieeexplore.ieee.org
Intuitively, word representation for logographic languages like Chinese can be enhanced by
its internal characteristics. Several research endeavors tried to learn Chinese word …

Resource development to prevent riots at mass events

LI Voronova, RV Tolmachev… - … in the Field of on Board …, 2018 - ieeexplore.ieee.org
Currently, the main communication means and management of the participants behavior in
mass events, including unauthorized ones, are portable mobile devices, through which, in …

Six-Writings multimodal processing with pictophonetic coding to enhance Chinese language models

L Weigang, MC Marinho, DL Li… - Frontiers of Information …, 2024 - Springer
While large language models (LLMs) have made significant strides in natural language
processing (NLP), they continue to face challenges in adequately addressing the intricacies …

Word vector processing for foreign languages

S Cao, X Li - US Patent 10,430,518, 2019 - Google Patents
A word vector processing method is provided. Word segmentation is performed on a corpus
to obtain words, and n-gram strokes corresponding to the words are determined. Each n …

Word vector processing for foreign languages

S Cao, X Li - US Patent 10,878,199, 2020 - Google Patents
(57) ABSTRACT A word vector processing method is provided. Word seg mentation is
performed on a corpus to obtain words, and n-gram strokes corresponding to the words are …