 |
 | Using language models for generic entity extraction Witten, I. H., Bray, B., Mahoui, M., Teahan, W. J. (1999) Proc ICML’99 Workshop on Machine Learning in Text Data Analysis,edited by D. Mladenic and M. Grobelnik, Bled, Slovenia, pp 25-35. |
 | A compression-based algorithm for Chinese word segmentation Teahan, W. J., Wen, Y., McNab, R. J., Witten, I. H. (2000) Computational Linguistics 26(3) 375-393, September. |
 | Text mining: a new frontier for lossless compression Witten, I. H., Bray, Z., Mahoui, M., Teahan, W. J. (1999) Proc Data Compression Conference,198-207, IEEE Press, Los Alamitos, CA. |
|