Cazy-webscraper

Latest version: v2.3.0.3

Safety actively analyzes 693883 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 5

2.0.8

Update entry point import path and name to `cw_extract_db_seqs`

2.0.6

Summary
- Fixed incomplete retrieval of proteins from the local CAZyme database that match the specified criteria
- Improved clarity of data included in the Logs table
- Changed Pdbs 1-1 Genbanks relationship to many-to-many: Pdbs 1-* Genbanks_Pdbs *-1 Genbanks
- Made retrieval of proteins from the local CAZyme database that match the specified criteria significantly faster
- Updated documentation
- Finished API
- Fixed failed JSON serialisation of data retrieved by the API
- Add option to add prefix to filenames generated by the API
- Use the `saintBioutils` package to handle logging and some file_io operations

What's Changed
* Trouble shoot extract seqs by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/75
* Fix blank pdb accs by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/79
* Fix log table contents by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/81
* Fix parsing config and selecting candidates of interest by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/83
* Tidy and update docs by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/84


**Full Changelog**: https://github.com/HobnobMancer/cazy_webscraper/compare/v2.0.5...v2.0.6

2.0.5

What's Changed

Pull Requests
* Trouble shoot uniprot by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/73
* Trouble shoot getting data from NCBI by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/74

Details
* **UniProt:** cazy_webscraper can now be used successfully for retrieving data from UniProt and adding the data to the local CAZyme database. This includes retrieving:
* UniProt accessions
* Protein names
* Protein sequences
* EC number annotations
* PDB accessions

* **GenBank:** cazy_webscraper can now be used to automate the retreival of protein sequences from GenBank for proteins in a local CAZyme database mathcing the users specified critieria. These protein sequences are stored in the local CAZyme database, and can be extracted to a FASTA file using cazy_webscraper

* **Caching:**
* More data is cached
* Cached data can be used to continue data retrievals from UniProt and GenBank, when a previous retrieval and/or addition of the data to the database fails
* Improved default name of cache dirs and subdirs

* **Unit tests:** Started rewrite of unit tests to match the new program architecture

* **Documentation:** Updating the documentation to include the new flags/options, and adding new tutorials for rautomating the retrieval if data from UniProt, GenBank and PDB



**Full Changelog**: https://github.com/HobnobMancer/cazy_webscraper/compare/v2.0.3...v2.0.5

2.0.3

Beta release of version 2.

Bug fixes
- Fixes making only the parent dirs of an output database path
- Fixes not finding the `cazy_webscraper` module

What's Changed
* Update unit tests by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/68
* Update unit tests by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/70
* update v number by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/71
* Fix output dir making by HobnobMancer in https://github.com/HobnobMancer/cazy_webscraper/pull/72


**Full Changelog**: https://github.com/HobnobMancer/cazy_webscraper/compare/v2.0.0...v2.0.3

2.0.0

Beta release of version 2.

1.0.2

Page 4 of 5

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.