Cantonese natural language processing in the transformers era: a survey and current challenges

R Xiang, E Chersoni, Y Li, J Li, CR Huang… - Language Resources …, 2024 - Springer
Despite being spoken by a large population of speakers worldwide, Cantonese is under-
resourced in terms of the data scale and diversity compared to other major languages. This
limitation has excluded it from the current “pre-training and fine-tuning” paradigm that is
dominated by Transformer architectures. In this paper, we provide a comprehensive review
on the existing resources and methodologies for Cantonese Natural Language Processing,
covering the recent progress in language understanding, text generation and development …

Cantonese Natural Language Processing in the Transformers Era

R Xiang, M Liao, J Li - … of the 10th SIGHAN Workshop on Chinese …, 2024 - aclanthology.org
Despite being spoken by a large population of speakers worldwide, Cantonese is under-
resourced in terms of the data scale and diversity compared to other major languages. This
limitation has excluded it from the current “pre-training and fine-tuning” paradigm that is
dominated by Transformer architectures. In this paper, we provide a comprehensive review
on the existing resources and methodologies for Cantonese Natural Language Processing,
covering the recent progress in language understanding, text generation and development …
以上显示的是最相近的搜索结果。 查看全部搜索结果