D Biber, J Egbert - Journal of English Linguistics, 2016 - journals.sagepub.com
Most previous linguistic investigations of the web have focused on special linguistic features associated with Internet language (eg, the use of emoticons, abbreviations, contractions …
One major challenge for Web-As-Corpus research is that a typical Web search provides little information about the register of the documents that are searched. Previous research has …
This paper introduces a project to develop a reliable, cost‐effective method for classifying Internet texts into register categories, and apply that approach to the analysis of a large …
S Sharoff, Z Wu, K Markert - LREC, 2010 - Citeseer
We present experiments in automatic genre classification on web corpora, comparing a wide variety of features on several different genreannotated datasets (HGC, I-EN, KI-04, KRYS-I …
E Henriksson, A Myntti, A Eskelinen… - arXiv preprint arXiv …, 2024 - arxiv.org
This article explores deep learning models for the automatic identification of registers-text varieties such as news reports and discussion forums-in web-based datasets across 16 …
D Biber, J Egbert - … of Research Design and Statistics in …, 2015 - journal.equinoxpub.com
Most previous attempts at automatic genre identification have been based on corpus samples that are relatively small and artificially restricted. In this study we set out to …
DA Pritsos, E Stamatatos - … Retrieval: 35th European Conference on IR …, 2013 - Springer
Abstract Automated Genre Identification (AGI) of web pages is a problem of increasing importance since web genre (eg blog, news, e-shops, etc.) information can enhance modern …
M Santini - Genres on the web: Computational models and …, 2011 - Springer
The main aim of the experiments described in this chapter is to investigate ways of assessing the robustness and stability of an Automatic Genre Identification (AGI) model for …
The ESRC-funded Centre for Corpus Approaches to Social Science at Lancaster University (CASS) and the English Language Teaching Group at Cambridge University Press (CUP) …