Pkuseg: A toolkit for multi-domain chinese word segmentation

R Luo, J Xu, Y Zhang, Z Zhang, X Ren… - arXiv preprint arXiv …, 2019 - arxiv.org
Chinese word segmentation (CWS) is a fundamental step of Chinese natural language
processing. In this paper, we build a new toolkit, named PKUSEG, for multi-domain word …

Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system

X Li, D Ma, B Yin - Computers and Electronics in Agriculture, 2021 - Elsevier
Abstract Agricultural Text-to-Speech (TTS) has attracted increasingly more attention. The
application of agricultural TTS and its problems are analyzed in this paper, and the …

Improving Chinese word segmentation with wordhood memory networks

Y Tian, Y Song, F Xia, T Zhang… - Proceedings of the 58th …, 2020 - aclanthology.org
Contextual features always play an important role in Chinese word segmentation (CWS).
Wordhood information, being one of the contextual features, is proved to be useful in many …

Neural word segmentation with rich pretraining

J Yang, Y Zhang, F Dong - arXiv preprint arXiv:1704.08960, 2017 - arxiv.org
Neural word segmentation research has benefited from large-scale raw texts by leveraging
them for pretraining character and word embeddings. On the other hand, statistical …

[PDF][PDF] Transition-based neural word segmentation

M Zhang, Y Zhang, G Fu - … of the 54th Annual Meeting of the …, 2016 - aclanthology.org
Character-based and word-based methods are two main types of statistical models for
Chinese word segmentation, the former exploiting sequence labeling models over …

Fast and accurate neural word segmentation for Chinese

D Cai, H Zhao, Z Zhang, Y Xin, Y Wu… - arXiv preprint arXiv …, 2017 - arxiv.org
Neural models with minimal feature engineering have achieved competitive performance
against traditional methods for the task of Chinese word segmentation. However, both …

Subword encoding in lattice LSTM for Chinese word segmentation

J Yang, Y Zhang, S Liang - arXiv preprint arXiv:1810.12594, 2018 - arxiv.org
We investigate a lattice LSTM network for Chinese word segmentation (CWS) to utilize
words or subwords. It integrates the character sequence features with all subsequences …

Toward fast and accurate neural chinese word segmentation with multi-criteria learning

W Huang, X Cheng, K Chen, T Wang… - arXiv preprint arXiv …, 2019 - arxiv.org
The ambiguous annotation criteria lead to divergence of Chinese Word Segmentation
(CWS) datasets in various granularities. Multi-criteria Chinese word segmentation aims to …

Chinese word segmentation: Another decade review (2007-2017)

H Zhao, D Cai, C Huang, C Kit - arXiv preprint arXiv:1901.06079, 2019 - arxiv.org
This paper reviews the development of Chinese word segmentation (CWS) in the most
recent decade, 2007-2017. Special attention was paid to the deep learning technologies …

Improving semantic relevance for sequence-to-sequence learning of chinese social media text summarization

S Ma, X Sun, J Xu, H Wang, W Li, Q Su - arXiv preprint arXiv:1706.02459, 2017 - arxiv.org
Current Chinese social media text summarization models are based on an encoder-decoder
framework. Although its generated summaries are similar to source texts literally, they have …