* BUGFIX: the issue with n-grams (https://github.com/Living-with-machines/DeezyMatch/issues/109)
* Define word token separators in the input file (https://github.com/Living-with-machines/DeezyMatch/issues/78)
* Prefix/suffix parameter moved as part of the mode, not preprocessing, as it applied to subword tokenization
* To generate vectors, we don't need three-column inputs anymore. We can have one column (or three columns for backward compatibility)
* Add specific datasets for each DeezyMatch functionality + Edit the README file.
* `normalizeString` and `string_split` functions are reviewed
* Improve documentation
* Several tests are added