Unbounded length contexts for PPM JG Cleary, WJ Teahan The Computer Journal 40 (2_and_3), 67-75, 1997 | 515 | 1997 |
A compression-based algorithm for Chinese word segmentation WJ Teahan, Y Wen, R McNab, IH Witten Computational Linguistics 26 (3), 375-393, 2000 | 212 | 2000 |
Using compression-based language models for text categorization WJ Teahan, DJ Harper Language modeling for information retrieval, 141-165, 2003 | 179 | 2003 |
A repetition based measure for verification of text collections and for text categorization DV Khmelev, WJ Teahan Proceedings of the 26th annual international ACM SIGIR conference on …, 2003 | 129 | 2003 |
Text classification and segmentation using minimum cross-entropy WJ Teahan Content-Based Multimedia Information Access-Volume 2, 943-961, 2000 | 126 | 2000 |
Modelling english text WJ Teahan University of Waikato, 1998 | 113 | 1998 |
The entropy of English using PPM-based models WJ Teahan, JG Cleary Proceedings of Data Compression Conference-DCC'96, 53-62, 1996 | 105 | 1996 |
Text mining: A new frontier for lossless compression IH Witten, Z Bray, M Mahoui, B Teahan Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096), 198-207, 1999 | 98 | 1999 |
Universal text preprocessing for data compression J Abel, W Teahan IEEE Transactions on Computers 54 (5), 497-507, 2005 | 72 | 2005 |
Models of English text WJ Teahan, JG Cleary Proceedings DCC'97. Data Compression Conference, 12-21, 1997 | 62 | 1997 |
Probability estimation for PPM WJ Teahan the NZ Comp. Sci. Research Students' Conf., 1995, 1995 | 58 | 1995 |
Artificial Intelligence–Agents and Environments WJ Teahan Bookboon, 2010 | 46 | 2010 |
Enhancing the stability of organic photovoltaics through machine learning TW David, H Anizelli, TJ Jacobsson, C Gray, W Teahan, J Kettle Nano Energy 78, 105342, 2020 | 45 | 2020 |
Correcting English text using PPM models WJ Teahan, S Inglis, JG Cleary, G Holmes Proceedings DCC'98 Data Compression Conference (Cat. No. 98TB100225), 289-298, 1998 | 42 | 1998 |
Peer-to-Peer Protocols for Resource Discovery in the Grid. NA Al-Dmour, WJ Teahan Parallel and Distributed Computing and Networks, 319-324, 2005 | 39 | 2005 |
Storyboarding for visual analytics R Walker, L Ap Cenydd, S Pop, HC Miles, CJ Hughes, WJ Teahan, ... Information Visualization 14 (1), 27-50, 2015 | 38 | 2015 |
Using language models for generic entity extraction IH Witten, Z Bray, M Mahoui, WJ Teahan Proceedings of the ICML Workshop on Text Mining, 14, 1999 | 36 | 1999 |
Experiments on the zero frequency problem JG Cleary, WJ Teahan Proc. Data Compression Conference 480, 1995 | 35 | 1995 |
Artificial Intelligence–Agent Behaviour WJ Teahan bookboon, 2010 | 33 | 2010 |
Experimental evaluation of Arabic OCR systems M Alghamdi, W Teahan PSU Research Review 1 (3), 229-241, 2017 | 29 | 2017 |