Pdf2doi

Latest version: v1.7

Safety actively analyzes 714875 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.1

- Improved the internal behavior of some functions. The input argument of the function pdf2doi.pdf2doi_singlefile can now be either a string (with a relative o absolute path to the file to process) or a file object, open elsewhere in the code. The first input argument of all the "finder" functions now must be a file object (open elsewhere) and not a string with the file path.
- Cleaned up dependencies, removed version constraints that are not anymore necessary

1.0.1

- Improved the look-up of DOIs and Arxiv IDs in a text.
In previous versions, for each possible regexp identifying a DOI or Arxiv ID, the search would only look for the first occurrence of a potential DOI or Arxiv ID in the text. Now instead it searches for all possible results.

1.0

-Re-organized all code
-Moved all BibTeX-related stuff to a new package [pdf2bib](https://github.com/MicheleCotrufo/pdf2bib)
-Fixed minor bugs in previous version

0.6

Main bugs fixed:
- When parsing the author field of a bibtex entry, a problem occurred if an author name contained the substring "and".
- Version 0.5 was not compatible with Mac systems because the library winreg was imported without first checking the operating system (issue https://github.com/MicheleCotrufo/pdf2doi/issues/5).
- Files with extension ".PDF" (capital case) were not recognized as valid pdf files.
- An error occurred when sanitizing text strings that contained more than one latex code.

0.5

- Fixed several bugs regarding the generation of bibtex entries;
- Improved parsing of latex code in bibtex;
- Added right-click context menu functionalities (for Windows);

0.4

Initial stable release.

Can also be installed via
bash
pip install pdf2doi

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.