Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Methods and apparatus for distributed community finding

AHYM Muntz, BA Rezaei - US Patent 8,825,654, 2014 - Google Patents
Methods and apparatus for a new approach to the problem of finding communities in
complex networks relating to a social definition of communities and percolation are …

A computational study of mental health awareness campaigns on social media

K Saha, J Torous, SK Ernala, C Rizuto… - Translational …, 2019 - academic.oup.com
As public discourse continues to progress online, it is important for mental health advocates,
public health officials, and other curious parties and stakeholders, ranging from researchers …

Combining fasttext and glove word embedding for offensive and hate speech text detection

N Badri, F Kboubi, AH Chaibi - Procedia Computer Science, 2022 - Elsevier
Over the past decade, increased use of social media has led to an increase in hate content.
To address this issue new solutions must be implemented to filter out this kind of …

[图书][B] Real life applications of soft computing

A Shukla, R Tiwari, R Kala - 2010 - books.google.com
Rapid advancements in the application of soft computing tools and techniques have proven
valuable in the development of highly scalable systems. Although many resources on the …

Concept-level user intent profile extraction and applications

BA Rezaei, V Roychowdhury, S Ghate… - US Patent …, 2019 - Google Patents
Methods and systems for extracting intents and intent profiles of users, as inferred from the
different activities they execute and data they share on social media sites, and then (i) …

Document–document similarity approaches and science mapping: Experimental comparison of five approaches

P Ahlgren, C Colliander - Journal of informetrics, 2009 - Elsevier
This paper treats document–document similarity approaches in the context of science
mapping. Five approaches, involving nine methods, are compared experimentally. We …

Link‐based similarity measures for the classification of Web documents

P Calado, M Cristo, MA Gonçalves… - Journal of the …, 2006 - Wiley Online Library
Traditional text‐based document classifiers tend to perform poorly on the Web. Text in Web
documents is usually noisy and often does not contain enough information to determine their …

Categorisation of web pages for protection against inappropriate content in the internet

I Kotenko, A Chechulin… - International Journal of …, 2017 - inderscienceonline.com
The paper outlines a framework for automated categorisation of web pages to protect
against inappropriate content. The paper contains the framework overview, analysis of state …

Knowing a web page by the company it keeps

X Qi, BD Davison - Proceedings of the 15th ACM international …, 2006 - dl.acm.org
Web page classification is important to many tasks in information retrieval and web mining.
However, applying traditional textual classifiers on web data often produces unsatisfying …