Automatic web pages categorization with relieff and hidden naive bayes

X Jin, R Li, X Shen, R Bie - Proceedings of the 2007 ACM symposium on …, 2007 - dl.acm.org
A great challenge of web mining arises from the increasingly large web pages and the high
dimensionality associated with natural language. Since classifying web pages of an …

Cleaning web pages for effective web content mining

J Li, CI Ezeife - International Conference on Database and Expert …, 2006 - Springer
Classifying and mining noise-free web pages will improve on accuracy of search results as
well as search speed, and may benefit web-page organization applications (eg, keyword …

A study on automatic web pages categorization

S Bo, S Qiurui, C Zhong… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Since the Internet has become a huge repository of information, many studies address the
issue of web pages categorization. For web page classification, we want to find a subset of …

Web classification using support vector machine

A Sun, EP Lim, WK Ng - Proceedings of the 4th international workshop …, 2002 - dl.acm.org
In web classification, web pages from one or more web sites are assigned to pre-defined
categories according to their content. Since web pages are more than just plain text …

[PDF][PDF] Web classification approach using reduced vector representation model based on html tags

VRMBON HTML - Journal of Theoretical and Applied Information …, 2013 - academia.edu
Automatic web page classification plays an essential role in information retrieval, web
mining and web semantics applications. Web pages have special characteristics (such as …

UPCA: an efficient URL-pattern based algorithm for accurate web page classification

Y Yang, L Zhang, G Liu, E Chen - 2015 12th International …, 2015 - ieeexplore.ieee.org
With the explosive growth of Web pages appearing in the Internet and mobile Internet, it is
quite challenging for Web search engines to provide users with desirable results from large …

A novel framework for web page classification using two-stage neural network

Y Li, Y Cao, Q Zhu, Z Zhu - Advanced Data Mining and Applications: First …, 2005 - Springer
Web page classification is one of the essential techniques for Web mining. This paper
presents a framework for Web page classification. It is hybrid architecture of neural network …

Text learning and hierarchical feature selection in webpage classification

X Peng, Z Ming, H Wang - Advanced Data Mining and Applications: 4th …, 2008 - Springer
One of the solutions of retrieving information from the Internet is by classifying web pages
automatically. In almost all classification methods that have been published, feature …

Web page classification using modified naïve bayesian approach

GS Tomar, S Verma, A Jha - TENCON 2006-2006 IEEE Region …, 2006 - ieeexplore.ieee.org
This paper introduces the concept of a classification tool for Web pages called WebClassify,
which uses modified naive Bayesian algorithm with multinomial model to classify pages into …

A comparison study: web pages categorization with bayesian classifiers

Z Fu, C Chen, Y Gong, R Bie - 2008 10th IEEE International …, 2008 - ieeexplore.ieee.org
In the recent few years, web mining has become a hotspot of data mining with the
development of Internet. Web pages classification is one of the essential techniques for web …