****************************
* Added the :code:`naive` tokenizer option when tokenizing in :code:`preprocessing.py`. This ensures that external
libraries need not be downloaded. The naive tokenizer does two things: replace word boundaries by a single
whitespace character, and substitute sequences of whitespaces with a single whitespace.
****************************