collections. We evaluate four different measures of subject similarity, derived from the Web
link structure, and determine how accurate they are in predicting document categories.
Using a Bayesian network model, we combine these measures with the results obtained by
traditional content-based classifiers. Experiments on a Web directory show that best results
are achieved when links from pages outside the directory are considered. Link information …