Textgenrnn

Latest version: v2.0.0

Safety actively analyzes 682441 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.2.2

* Make `is_csv` work for real downstream.
* Description tweaks

1.2.1

* Added `validation` to disable validation training for speed.
* Added `is_csv`: Use with `train_from_file` if the source file is a one-column CSV (e.g. an export from BigQuery or Google Sheets) for proper quote/newline escaping.
* README tweaks

1.2

* Renamed `prop_keep` to `train_size`, and will use the remaining data for validation.
* Added `dropout`, which randomly excludes input tokens each epoch.

1.1

- Switched to a `fit_generator` implementation of generating sequences for training, instead of loading all sequences into memory. This will allow training large text files (10MB+) without requiring ridiculous amounts of RAM.
- Better `word_level` support:
- The model will only keep `max_words` words and discard the rest.
- The model will not train to predict words not in the vocabulary
- All punctuation (including smart quotes) are their own token.
- When generating, newlines/tabs have surrounding whitespace stripped. (this is not the case for other punctuation as there are too many rules around that)
- Training on single text no longer uses meta tokens to indicate the start/end of the text and does not use them when generating, which results in slightly better output.

1.0

First release after the major refactor.

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.