Grouping words with equivalent substrings by automatic clustering based on suffix relationships

E Gaussier, G Grefenstette, JP Chanod - US Patent 6,308,149, 2001 - Google Patents
(57) ABSTRACT A set of Words of a natural language is grouped by auto matically obtaining
suf? x relation data that indicate a rela tion value for each of a set of relationships betWeen …

System and method for computing a measure of similarity between documents

A Franciosa, CR Dance - US Patent 7,493,322, 2009 - Google Patents
Documents A Partial Document, Was The AfExactlatch Detected Bit ond document using a
first computed percentage indicating what percentage of keyword ratings in the first list also …

Real time detection of topical changes and topic identification via likelihood based methods

D Kanevsky, E Yashchin - US Patent 6,104,989, 2000 - Google Patents
2. Background Description The problem of automatically dividing a text Stream into topically
homogeneous blocks in real time arises in many fields that include a topic detection task …

Information management and retrieval

R Weeks - US Patent 6,338,057, 2002 - Google Patents
Amethod and apparatus is provided for extracting key terms from a data set, the method
includes identifying a? rst set of one or more Word groups of one or more Word that occur …

Systems and methods for identifying collocation errors in text

Y Futagi, PD Deane, MS Chodorow - US Patent 8,473,278, 2013 - Google Patents
Abstract Systems and methods for detecting collocation errors in a text sample using a
reference database from a corpus are provided. Collocation candidates are identified within …

Research and Monitoring Tool to Determine the Likelihood of the Public Finding Information Using a Keyword Search

J Byrne, R Schmidt, J Wei, G Helbling - US Patent App. 11/859,452, 2008 - Google Patents
US20080077577A1 - Research and Monitoring Tool to Determine the Likelihood of the Public
Finding Information Using a Keyword Search - Google Patents US20080077577A1 - Research …

Processing input text to generate the selectivity value of a word or word group in a library of texts in a field is related to the frequency of occurrence of that word or word …

PJ Dehlinger, S Chin - US Patent 7,181,451, 2007 - Google Patents
Disclosed is an automated system, machine-readable storage medium embodying computer-
executable code, and method for generating descriptive words and optionally, multi-word …

System and method for the triage and classification of documents

B Kolo, E Buhain, C Koslow, S Hueseman… - US Patent …, 2011 - Google Patents
A technique is provided for the classification of a document based on a lexicon structured
into categories. Terms in the document may be matched with terms in the lexicon along with …

Method and system for document presentation and analysis

PS Walsh - US Patent 8,739,032, 2014 - Google Patents
(57) ABSTRACT A documentanalysis system receives multiple concepts along with multiple
reference documents and generates sensory indicators that assista researcherinassessing …

Interactive connotative dictionary system

WO Chase - US Patent 6,529,864, 2003 - Google Patents
FILTRATION USER AND INTERFACE REREVAL PROCESSES rality of terms. Connotative
meaning, along with the inten sity of Such meaning are identified using a Statistical model of …