linguistics and natural language processing. After the library design, implementation, corpus
data format, and key datasets included are introduced, the paper provides an overview of
the currently implemented functionality: stop words, handling Jyutping romanization, word
segmentation, part-of-speech tagging, and parsing Cantonese text.