An Improved VIPS-based Algorithm of Extracting Web Content

L Li, AM Zhou, Y Fang, L Liu, Q Wu - Applied Mechanics and …, 2014 - Trans Tech Publ
The paper studies the VIPS algorithm, and improves VIPS which has the deficiency with
complex rules and low performance, according that the Web page has the feature of DIV …

Research on data mining technology in web based on the cloud computing

F Zhang, L Liu - Advanced Materials Research, 2012 - Trans Tech Publ
To improve the data mining efficiency, analyzed existing algorithm for data mining. However,
it has some uncertain knoledge are a major concern in data mining, it is great difficulty for …

An approach of web page information extraction

YH Li, LX Wang, JX Wang, J Yue… - Applied Mechanics and …, 2013 - Trans Tech Publ
The Web has become the largest information source, but the noise content is an inevitable
part in any web pages. The noise content reduces the nicety of search engine and increases …

Web content extraction using clustering with web structure

X Huang, Y Gao, L Huang, Z Zhang, Y Li… - Advances in Neural …, 2017 - Springer
Web content extraction is an essential part of data preprocessing in web information system.
An algorithm for web content extraction based on clustering with web structure is proposed …

An approach for text extraction from web news page

H Mingsheng, J Zhijuan… - 2012 IEEE Symposium on …, 2012 - ieeexplore.ieee.org
With the rapid development of Internet and Web technology, Web page has become a main
carrier of information publishing. In connection with the problems of current complex …

[HTML][HTML] 2 Web Mining Technology

L Bian - journals.riverpublishers.com
Although Web Mining and data mining are related, they belong to different concepts. Data
mining was formally proposed at the 11th International Conference on artificial intelligence …

Research of web classification mining based on classify support vector machine

M Gao, J Tian, S Zhou - 2009 ISECS international colloquium …, 2009 - ieeexplore.ieee.org
With the development and widely used of Internet and information technology, the Web has
become one of the most important means to obtain information for people. According to the …

[PDF][PDF] New Path Filling Method on Data Preprocessing in Web Mining.

C Zhang, L Zhuang - Comput. Inf. Sci., 2008 - Citeseer
The article discusses the importance of data preprocessing in web mining and gives the
topology structure for the website in the view of actual condition, analyzes the limitation of …

Research on Web information extraction based on spider algorithm and DOM thinking

X Han, XD Li, Q Zheng - 2010 International Conference on …, 2010 - ieeexplore.ieee.org
The structure characteristics of the website is complicated, Web information structure is not
fixed and not neat, so it is inefficient that the Web information is captured largely, the …

[引用][C] Survey of Web information extraction technologies

Z Chen, DM Zhang - Jisuanji Yingyong Yanjiu, 2010 - Sichuan Research Center of …