Scribe-data

Latest version: v3.3.0

Safety actively analyzes 682416 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

3.0.0

✨ Features

- Scribe-Data now has the ability to generate SQLite databases from formatted language data.
- `data_to_sqlite.py` is used to read available JSON files and input their information into the databases.
- These databases are now sent to Scribe apps via defined paths.
- `send_dbs_to_scribe.py` finds all available language databases and copies them.
- Separating this step from the data update is in preparation for data import in the future where this will be an individual step.
- Scribe-Data now also creates autocomplete lexicons for each language within `data_to_sqlite.py`.
- JSON data is no longer able to be uploaded to Scribe app directories directly, with the SQLite directories now being exported instead.
- Emojis of singular nouns are now also linked to their plural counterparts if the plural isn't present in the emoji keyword outputs.
- The emoji process also now updates a column to the `data_table.txt` file for sharing on readmes with `update_data.py` maintaining it in the data update process.

♻️ Code Refactoring

- The Jupyter notebooks for autosuggestions and emojis as well as `update_data.py` were moved to the `extract_transform` directory given that they're not used to load data anymore.
- Their code was refactored to reflect their new locations.
- Massive amounts of refactoring happened to achieve the shift in the data export method:
- `format_WORD_TYPE.py` files export to a `formatted_data` directory within `extract_transform`.
- Copies of all data JSONs that were originally in Scribe apps are now in the `formatted_data` directories.
- Functions in `update_utils.py` were switched given that data is no longer uploaded into a `Data` directory within the language keyboard directories within Scribe apps.
- Lots of functions and variables were renamed to make them more understandable.
- Code to derive appropriate export locations within `format_WORD_TYPE.py` files was removed in favor of a language `formatted_data` directory.
- regex was added as a dependency.
- pylint comments were removed.
- Verb SPARQL query scripts for Spanish and Italian were simplified to remove unneeded repeat conditions ([7](https://github.com/scribe-org/Scribe-Data/issues/7)).

🐞 Bug Fixes

- The statements in translation files have been fixed as they were improperly defined after a file was moved.

2.2.2

✨ Features

- An option to remove the `is_base` and `rank` sub keys was added.

♻️ Code Refactoring

- The export filenames for emoji keywords were renamed to reflect their usage in autosuggestions and soon autocompletions as well.

2.2.1

✨ Features

- The number of suggested emojis for words can now be limited.
- The total number of emojis that suggestions can be made for can now be limited.

2.2.0

✨ Features

- Scribe-Data now allows the user to create JSONs of word-emoji key-value pairs ([24](https://github.com/scribe-org/Scribe-Data/issues/24)).

2.1.0

✨ Features

- Scribe-Data can now split Wikidata queries into multiple stages to break up those that were too large to run ([21](https://github.com/scribe-org/Scribe-Data/issues/21)).

2.0.0

✨ Features

- Scribe-Data now has the ability to download Wikipedia dumps of any language ([15](https://github.com/scribe-org/Scribe-Data/issues/15)).
- Functions have been added to parse and clean the above dumps ([15](https://github.com/scribe-org/Scribe-Data/issues/15)).
- Autosuggestions are generated from the cleaned texts by deriving most common words and those words that most commonly follow them ([15](https://github.com/scribe-org/Scribe-Data/issues/15)).
- A query for profane words has been added and integrated into the autosuggest flow to make sure that inappropriate words are not included ([16](https://github.com/scribe-org/Scribe-Data/issues/16)).
- The adjectives column has been removed from Scribe data tables until support is offered.

♻️ Code Refactoring

- The error messages for incorrect args in update_data.py have been updated.

Page 2 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.