[PDF][PDF] Vietnamese word segmentation with CRFs and SVMs: An investigation

CT Nguyen, TK Nguyen, XH Phan… - Proceedings of the …, 2006 - aclanthology.org
Word segmentation for Vietnamese, like for most Asian languages, is an important task
which has a significant impact on higher language processing levels. However, it has …

Word segmentation for the Myanmar language

TT Thet, JC Na, WK Ko - Journal of information science, 2008 - journals.sagepub.com
This study reports the development of a Myanmar word segmentation method using Unicode
standard encoding. Word segmentation is an essential step prior to natural language …

A hybrid approach to vietnamese word segmentation using part of speech tags

DD Pham, GB Tran, SB Pham - 2009 International Conference …, 2009 - ieeexplore.ieee.org
Word segmentation is one of the most important tasks in NLP. This task, within Vietnamese
language and its own features, faces some challenges, especially in words boundary …

A proposal of ontology-based health care information extraction system: Vnhies

TQ Dung, W Kameyama - … on research, innovation and vision for …, 2007 - ieeexplore.ieee.org
This paper presents an ontology-based health care information extraction system-VnHIES.
In the system, we develop and use two effective algorithms called" semantic elements …

Vers un modèle d'indexation sémantique adapté aux dossiers médicaux de patients

D Dinh, L Tamine - … francophone en Recherche d'Information et …, 2010 - hal.science
Ce papier présente un modèle d'indexation sémantique adapté aux dossiers électroniques
de patients. Ce modèle servira de support à des processus de recherche d'information …

Vietnamese text classification algorithm using long short term memory and Word2Vec

HN Phat, NTM Anh - Информатика и автоматизация, 2020 - proceedings.spiiras.nw.ru
In the context of the ongoing forth industrial revolution and fast computer science
development the amount of textual information becomes huge. So, prior to applying the …

Developing a Persian chunker using a hybrid approach

S Kiani, T Akhavan, M Shamsfard - … on Computer Science and …, 2009 - ieeexplore.ieee.org
Text segmentation is the process of recognizing boundaries of text constituents, such as
sentences, phrases and words. This paper focuses on phrase segmentation also known as …

An unsupervised learning and statistical approach for vietnamese word recognition and segmentation

H Le Trung, V Le Anh, K Le Trung - Asian Conference on Intelligent …, 2010 - Springer
There are two main topics in this paper:(i) Vietnamese words are recognized and sentences
are segmented into words by using probabilistic models;(ii) the optimum probabilistic model …

Complex Word Identification in Vietnamese: Towards Vietnamese Text Simplification

P Nguyen, D Kauchak - Proceedings of the Workshop on …, 2022 - aclanthology.org
Text Simplification has been an extensively researched problem in English, but has not
been investigated in Vietnamese. We focus on the Vietnamese-specific Complex Word …

Thai words segmentation using an unsupervised learning technique

J Sunkpho, M Hofmann - … Technology 2020: Proceedings of the 16th …, 2020 - Springer
Abstract Word Segmentation or Tokenization is the process of determining the best likely
sequence of words from a sequence of text. For Thai language, word segmentation is not a …