[图书][B] Register variation online

D Biber, J Egbert - 2018 - books.google.com
While other books focus on special internet registers, like tweets or texting, no previous
study describes the full range of everyday registers found on the searchable web. These are …

Register variation on the searchable web: A multi-dimensional analysis

D Biber, J Egbert - Journal of English Linguistics, 2016 - journals.sagepub.com
Most previous linguistic investigations of the web have focused on special linguistic features
associated with Internet language (eg, the use of emoticons, abbreviations, contractions …

Exploring the composition of the searchable web: A corpus-based taxonomy of web registers

D Biber, J Egbert, M Davies - Corpora, 2015 - euppublishing.com
One major challenge for Web-As-Corpus research is that a typical Web search provides little
information about the register of the documents that are searched. Previous research has …

Developing a bottom‐up, user‐based method of web register classification

J Egbert, D Biber, M Davies - Journal of the Association for …, 2015 - Wiley Online Library
This paper introduces a project to develop a reliable, cost‐effective method for classifying
Internet texts into register categories, and apply that approach to the analysis of a large …

[PDF][PDF] The Web Library of Babel: evaluating genre collections.

S Sharoff, Z Wu, K Markert - LREC, 2010 - Citeseer
We present experiments in automatic genre classification on web corpora, comparing a wide
variety of features on several different genreannotated datasets (HGC, I-EN, KI-04, KRYS-I …

Untangling the unrestricted web: Automatic identification of multilingual registers

E Henriksson, A Myntti, A Eskelinen… - arXiv preprint arXiv …, 2024 - arxiv.org
This article explores deep learning models for the automatic identification of registers-text
varieties such as news reports and discussion forums-in web-based datasets across 16 …

Using grammatical features for automatic register identification in an unrestricted corpus of documents from the open web

D Biber, J Egbert - … of Research Design and Statistics in …, 2015 - journal.equinoxpub.com
Most previous attempts at automatic genre identification have been based on corpus
samples that are relatively small and artificially restricted. In this study we set out to …

Open-set classification for automated genre identification

DA Pritsos, E Stamatatos - … Retrieval: 35th European Conference on IR …, 2013 - Springer
Abstract Automated Genre Identification (AGI) of web pages is a problem of increasing
importance since web genre (eg blog, news, e-shops, etc.) information can enhance modern …

Cross-testing a genre classification model for the web

M Santini - Genres on the web: Computational models and …, 2011 - Springer
The main aim of the experiments described in this chapter is to investigate ways of
assessing the robustness and stability of an Automatic Genre Identification (AGI) model for …

[图书][B] The written British national corpus 2014: Design, compilation and analysis

A Hawtin - 2019 - search.proquest.com
The ESRC-funded Centre for Corpus Approaches to Social Science at Lancaster University
(CASS) and the English Language Teaching Group at Cambridge University Press (CUP) …