REHM, Georg. Language-Independent Text Parsing of Arbitrary HTML-Documents. Towards A Foundation For Web Genre Identification. Journal for Language Technology and Computational Linguistics, [S. l.], v. 20, n. 2, p. 53–74, 2005. DOI: 10.21248/jlcl.20.2005.75. Disponível em: https://jlcl.org/article/view/75. Acesso em: 19 jun. 2024.