PyCantonese: Cantonese linguistics and NLP in python- 学术资源搜索

文章

学术资源搜索

PyCantonese: Cantonese linguistics and NLP in python

J Lee, L Chen, C Lam, CM Lau… - Proceedings of the …, 2022 - aclanthology.org

Proceedings of the thirteenth language resources and evaluation …, 2022•aclanthology.org

This paper introduces PyCantonese, an open-source Python library for Cantonese
linguistics and natural language processing. After the library design, implementation, corpus
data format, and key datasets included are introduced, the paper provides an overview of
the currently implemented functionality: stop words, handling Jyutping romanization, word
segmentation, part-of-speech tagging, and parsing Cantonese text.

Abstract

This paper introduces PyCantonese, an open-source Python library for Cantonese linguistics and natural language processing. After the library design, implementation, corpus data format, and key datasets included are introduced, the paper provides an overview of the currently implemented functionality: stop words, handling Jyutping romanization, word segmentation, part-of-speech tagging, and parsing Cantonese text.

aclanthology.org

展开收起

被引用次数：18 相关文章所有 6 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

PyCantonese: Cantonese linguistics and NLP in python

引用