Paperetl

Latest version: v2.3.0

Safety actively analyzes 723685 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

1.5.0

This release adds the following enhancements and bug fixes:

- Add dockerfile for building paperetl environment (9)
- Add component to build entry-dates.csv (18)
- Add pre-trained study design models to GitHub (19)
- Update README to correct and improve documentation (20)
- Ensure length of sections is less than max nlp length (27)

1.4.0

This release adds the following enhancements and bug fixes:

- Handle PDF parsing exceptions (22)
- Increase test coverage (23)
- Modify merge method to handle no update merges (24)
- Fix bug with JSON export (25)
- Fix bug with study model training (26)

1.3.0

This release adds the following enhancements and bug fixes:

- Add file name as source for file process (12)
- Use XML id for file figure processing (13)
- Filter duplicate ids (14)
- Build test suite (15)

1.2.0

This release adds the following enhancements:

- Support recursive directory processing (7)
- Improve publication date parsing (8)
- Added incremental database updates (10)
- Remove citations (11)

1.1.1

Minor README update to note package can be installed from PyPI

1.1.0

Release addresses the following:

- PDF Extraction Improvements (1) - extract additional fields from TEI XML files generated via GROBID
- Fix Windows Install issues (2)

Page 2 of 3

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.