[PDF][PDF] Text tiling: Segmenting text into multi-paragraph subtopic passages

MA Hearst - Computational linguistics, 1997 - aclanthology.org
TextTiling is a technique for subdividing texts into multi-paragraph units that represent
passages, or subtopics. The discourse cues for identifying major subtopic shifts are patterns …

A critique and improvement of an evaluation metric for text segmentation

L Pevzner, MA Hearst - Computational Linguistics, 2002 - direct.mit.edu
The Pk evaluation metric, initially proposed by Beeferman, Berger, and Lafferty (1997), is
becoming the standard measure for assessing text segmentation algorithms. However, a …

[图书][B] Automatic text summarization

JM Torres-Moreno - 2014 - books.google.com
Textual information in the form of digital documents quickly accumulates to create huge
amounts of data. The majority of these documents are unstructured: it is unrestricted text and …

[图书][B] Topic segmentation: Algorithms and applications

JC Reynar - 1998 - search.proquest.com
Most documents are about more than one subject, but the majority of natural language
processing algorithms and information retrieval techniques implicitly assume that every …

[图书][B] Discourse processing

M Stede - 2012 - books.google.com
Discourse Processing here is framed as marking up a text with structural descriptions on
several levels, which can serve to support many language-processing or text-mining tasks …

Text segmentation based on document understanding for information retrieval

V Prince, A Labadié - International Conference on Application of Natural …, 2007 - Springer
Abstract Information retrieval needs to match relevant texts with a given query. Selecting
appropriate parts is useful when documents are long, and only portions are interesting to the …

A new hybrid summarizer based on vector space model, statistical physics and linguistics

I Da Cunha, S Fernández, P Velázquez Morales… - MICAI 2007: Advances …, 2007 - Springer
In this article we present a hybrid approach for automatic summarization of Spanish medical
texts. There are a lot of systems for automatic summarization using statistics or linguistics …

[PDF][PDF] Cut as a Querying Unit for WWW, Netnews, and E-mail

K Tajima, Y Mizuuchi, M Kitagawa… - Proceedings of the ninth …, 1998 - dl.acm.org
In this paper? WC'propose a query framework for hypcrtext data in general, and for WWW
pages, Netnews articles, and e-mails in particular. In existing query tools for hypertext data …

[PDF][PDF] Thematic Segmentation of Texts: Two Methods for Two Kind of Texts

O Ferret, B Grau, N Masson - 36th Annual Meeting of the …, 1998 - aclanthology.org
To segment texts in thematic units, we present here how a basic principle relying on word
distribution can be applied on different kind of texts. We start from an existing method well …

Text segmentation into paragraphs based on local text cohesion

IA Bolshakov, A Gelbukh - Text, Speech and Dialogue: 4th International …, 2001 - Springer
The problem of automatic text segmentation is subcategorized into two different problems:
thematic segmentation into rather large topically self-contained sections and splitting into …