Text-scrubber

Latest version: v0.4.2

Safety actively analyzes 626513 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.4.2

-----

*(2023-04-14)*

- Changed ownership of and moved repository from `Slimmer-AI` to `sybrenjansen`

0.4.1

-----

*(2022-05-27)*

- Documentation issue prevented GitHub from uploading ``text-scrubber`` to PyPI

0.4.0

-----

*(2022-05-27)*

- Common country replacements have been updated to allow for fuzzy matching
- Country codes added such that they can be matched through :meth:`text_scrubber.geo.normalize_country`
- The geo normalize functions now return the canonical name together with the matched name
- The geo normalize functions now return a :class:`text_scrubber.geo.normalize.Location` object
- The geo find in string functions now return a :class:`text_scrubber.geo.find_in_string.ExtractedLocation` object

0.3.2

-----

*(2022-05-19)*

- Updated MANIFEST.in file to include the Cython files

0.3.1

-----
*(2022-05-19)*

- Optimized Levenshtein and trigram similarity functions, and all normalization functions. Speed-ups of x20-40 are to be
expected

0.3.0

-----

*(2022-04-13)*

- Renamed `normalize_state` to :meth:`text_scrubber.geo.normalize_region`, as it now handles all kinds of regions
- Expanded countries, regions, and cities with geonames database, increasing the completeness of the geo database
- :meth:`text_scrubber.geo.normalize_country`, :meth:`text_scrubber.geo.normalize_region`, and
:meth:`text_scrubber.geo.normalize_city` now return the match scores as well
- :meth:`text_scrubber.geo.normalize_region` and :meth:`text_scrubber.geo.normalize_city` also return the corresponding
normalized country
- Added :meth:`text_scrubber.geo.find_country_in_string`, :meth:`text_scrubber.geo.find_city_in_string`, and
:meth:`text_scrubber.geo.find_region_in_string` functions that find a location in a string
- Updated cleaning pipeline of :meth:`text_scrubber.geo.clean_country`, :meth:`text_scrubber.geo.clean_city`, and
:meth:`text_scrubber.geo.clean_region`
- Added ``case_sensitive`` boolean flag to :meth:`text_scrubber.text_scrubber.TextScrubber.remove_stop_words`
- Improved speed of trigram matching by mapping trigrams to integer indices

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.