Jurish, Bryan, and Kay-Michael Würzner. “Word and Sentence Tokenization With Hidden Markov Models”. Journal for Language Technology and Computational Linguistics, vol. 28, no. 2, July 2013, pp. 61-83, doi:10.21248/jlcl.28.2013.176.