Paperetl

Latest version: v2.2.1

Safety actively analyzes 682416 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.4.0

This release adds the following enhancements and bug fixes:

- Handle PDF parsing exceptions (22)
- Increase test coverage (23)
- Modify merge method to handle no update merges (24)
- Fix bug with JSON export (25)
- Fix bug with study model training (26)

1.3.0

This release adds the following enhancements and bug fixes:

- Add file name as source for file process (12)
- Use XML id for file figure processing (13)
- Filter duplicate ids (14)
- Build test suite (15)

1.2.0

This release adds the following enhancements:

- Support recursive directory processing (7)
- Improve publication date parsing (8)
- Added incremental database updates (10)
- Remove citations (11)

1.1.1

Minor README update to note package can be installed from PyPI

1.1.0

Release addresses the following:

- PDF Extraction Improvements (1) - extract additional fields from TEI XML files generated via GROBID
- Fix Windows Install issues (2)

1.0.0

Initial release of paperetl, migrating ETL logic from existing [cord19q](https://github.com/neuml/cord19q) project.

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.