A systematic review of current trends in web content mining

MO Samuel, AI Tolulope… - Journal of Physics …, 2019 - iopscience.iop.org
Abstract Knowledge in web documents, Relevance ranking of webpages and so on are
some of the under-researched areas in web content mining (WCM). Apart from the general …

Fid: A faster image distribution system for docker platform

W Kangjin, Y Yong, L Ying, L Hanmei… - 2017 IEEE 2nd …, 2017 - ieeexplore.ieee.org
Docker has been widely adopted in enterprise-level container environment. As an important
part of Docker-based container ecosystem, Docker Registry provides the service of storing …

Speedreader: Reader mode made fast and private

M Ghasemisharif, P Snyder, A Aucinas… - The World Wide Web …, 2019 - dl.acm.org
Most popular web browsers include “reader modes” that improve the user experience by
removing un-useful page elements. Reader modes reformat the page to hide elements that …

All in One Place: Ensuring Usable Access to Online Shopping Items for Blind Users

Y Prakash, AK Nayak, M Sunkara… - Proceedings of the …, 2024 - dl.acm.org
Perusing web data items such as shopping products is a core online user activity. To prevent
information overload, the content associated with data items is typically dispersed across …

Language independent web news extraction system based on text detection framework

YC Wu - Information Sciences, 2016 - Elsevier
Web news provides a direct and efficient way to construct large text corpora. The creation of
text data requires an understanding of HTML code and the preparation of customized …

AutoDesc: Facilitating Convenient Perusal of Web Data Items for Blind Users

Y Prakash, M Sunkara, HN Lee, S Jayarathna… - Proceedings of the 28th …, 2023 - dl.acm.org
Web data items such as shopping products, classifieds, and job listings are indispensable
components of most e-commerce websites. The information on the data items are typically …

Specification and discovery of web patterns: a graph grammar approach

A Roudaki, J Kong, K Zhang - Information Sciences, 2016 - Elsevier
Finding useful information from the Web becomes increasingly difficult as the volume of Web
data rapidly grows. To facilitate effective Web browsing, Web designers usually display the …

An efficient language-independent method to extract content from news webpages

E Cardoso, I Jabour, E Laber, R Rodrigues… - Proceedings of the 11th …, 2011 - dl.acm.org
We tackle the task of news webpage segmentation, specifically identifying the news title,
publication date and story body. While there are very good results in the literature, most of …

An efficient content extraction method for webpage based on tag-line-block analysis

Z Chen, J Zhou, R Sun - Soft Computing, 2023 - Springer
Abstract World Wide Web is a vast information resource that can be used in a broad range of
applications. Web content is an efficient way to derive valuable information from webpages …

An FAR-SW based approach for webpage information extraction

Z Bu, C Zhang, Z Xia, J Wang - Information Systems Frontiers, 2014 - Springer
Automatically identifying and extracting the target information of a webpage, especially main
text, is a critical task in many web content analysis applications, such as information retrieval …