[PDF][PDF] Contrastive approach towards text source classification based on top-bag-of-word similarity

CR Huang, LH Lee - Proceedings of the 22nd pacific asia …, 2008 - aclanthology.org
This paper proposes a method to automatically classify texts from different varieties of the
same language. We show that similarity measure is a robust tool for studying comparable …

Revisiting the automatic prediction of lexical errors in Mandarin

M Allassonnière-Tang, IP Wan - Linguistics Vanguard, 2024 - degruyter.com
Speech errors provide cues for explaining the process of word retrieval. For example,
speech errors are less likely to occur with high-frequency words since these words already …

Overview of the IALP 2016 shared task on dimensional sentiment analysis for Chinese words

LC Yu, LH Lee, KF Wong - 2016 International Conference on …, 2016 - ieeexplore.ieee.org
This paper presents the IALP 2016 shared task on Dimensional Sentiment Analysis for
Chinese Words (DSAW) which seeks to identify a real-value sentiment score of Chinese …

Practical and Robust Chinese Word Segmentation and PoS Tagging

CR Huang - … : Data Collection, Linguistic Analysis, Annotation and …, 2023 - Springer
The ability to automatically segment and PoS tag any Chinese text at any time with high
accuracy and recall is a prerequisite for the online processing of Chinese texts. While this …

Chinese preposition selection for grammatical error diagnosis

HH Huang, YC Shao, HH Chen - Proceedings of COLING 2016 …, 2016 - aclanthology.org
Misuse of Chinese prepositions is one of common word usage errors in grammatical error
diagnosis. In this paper, we adopt the Chinese Gigaword corpus and HSK corpus as L1 and …

The effect of word frequency and position-in-utterance in Mandarin speech errors: a connectionist model of speech production

IP Wan, M Allassonnière-Tang - … : 21st Workshop, CLSW 2020, Hong Kong …, 2021 - Springer
The connectionist model of speech processing infers that word frequency and position-in-
utterance play a major role in the occurrence of speech errors. First, words that are not …

Predicting speech errors in Mandarin based on word frequency

M Tang, IP Wan - From Minimal Contrast to Meaning Construct: Corpus …, 2020 - Springer
This paper investigates the effect of word frequency on the occurrence of speech errors in
Mandarin. A corpus of 390 speech errors along with their surrounding linguistic context was …

Toward a professional platform for Chinese character conversion

T Hao, C Zhu - ACM Transactions on Asian Language Information …, 2013 - dl.acm.org
Increasing communication among Chinese-speaking regions using respectively traditional
and simplified Chinese character systems has highlighted the subtle-yet-extensive …

Simplified-traditional Chinese character conversion based on multi-data resources: Towards a fused conversion algorithm

T Hao, C Zhu - The 2nd International Conference on Next …, 2011 - ieeexplore.ieee.org
In recent years, communication between Chinese communities in different parts of the world
has been on a constant increase. However, between the traditional Chinese character used …

The effect of part-of-speech on Mandarin speech recognition

C Gong, X Li, X Wu - 2013 Asia-Pacific Signal and Information …, 2013 - ieeexplore.ieee.org
This paper concentrates on the effect of part-of-speech on Mandarin speech recognition by
incorporating it into language model and pronunciation dictionary. This work is motivated by …