performance on the identification of topical words in texts, which is based on a probabilistic
model of text categorization. We use texts which are not explicitly structured. A text structure
is identified by measuring the similarity between segments comprising the text and its title. It
is shown that a text structure thus identified gives a good clue to finding out parts of the text
most relevant to its content. The significance of exploiting information on the structure for …