Tokenization is widely regarded as a solved problem due to the high accuracy that rule- based tokenizers achieve. But rule-based tokenizers are hard to maintain and their rules …
FP Such, D Peri, F Brockler, H Paul… - 2018 16th International …, 2018 - ieeexplore.ieee.org
Handwritten text recognition is challenging because of the virtually infinite ways a human can write the same message. Our fully convolutional handwriting model takes in a …
The task of automatic text summarization consists of generating a summary of the original text that allows the user to obtain the main pieces of information available in that text, but …
Automated discourse analysis tools based on Natural Language Processing (NLP) aiming at the diagnosis of language-impairing dementias generally extract several textual metrics of …
This paper is motivated by the automation of neuropsychological tests involving discourse analysis in the retellings of narratives by patients with potential cognitive impairment. In this …
DF Wong, LS Chao, X Zeng - The Scientific World Journal, 2014 - Wiley Online Library
Sentence boundary detection (SBD) system is normally quite sensitive to genres of data that the system is trained on. The genres of data are often referred to the shifts of text topics and …
R López, TAS Pardo - … Linguistics and Intelligent Text Processing: 16th …, 2015 - Springer
Abstract Sentence Boundary Detection (SBD) is a very important prerequisite for proper sentence analysis in different Natural Language Processing tasks. During the last years …
Sentence segmentation or tokenization is a primary text processing in natural language processing [1]. To begin processing each token of words, we need to detect whether those …
H Shaw - US Patent 9,183,323, 2015 - Google Patents
Field of Classification S h rimary Examiner—AJ1un Jaco is." SSCO Sea 707f766 (74) Attorney, Agent, or Firm—Fish & Richardson PC See application file for complete search …