Pdfdeal

Latest version: v1.0.2

Safety actively analyzes 723685 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 7

0.2.0

> [!caution]
> This version has major interface updates (impact range: all)
>
> - Function return parameters have changed, please check [update details](https://menghuan1918.github.io/pdfdeal-docs/changes/0.2.0.html) to see how to migrate

[English](https://menghuan1918.github.io/pdfdeal-docs/changes/) | [🌐简体中文](https://menghuan1918.github.io/pdfdeal-docs/zh/changes/#v0-2-0)

✨ Feature Changes

- Added CLI command line program `doc2x`, for quickly using doc2x to batch process PDF or image files, please refer to [here](https://menghuan1918.github.io/pdfdeal-docs/guide/CLI/) for usage
- Adaptation of CLI commands to graphrag has been added, please refer to [here](https://menghuan1918.github.io/pdfdeal-docs/guide/CLI/) for usage
- Updated Doc2X document translation functions to use [see here](https://menghuan1918.github.io/pdfdeal-docs/guide/Doc2X/5.html)
- Enhanced exception handling
- Function return parameters have changed, will return more detailed content
- Decoupled various parts of the processing process

🔧 BUG Fixes

- [Doc2X] When using personal API, if the input file has multiple corrupted files, it may cause an infinite loop
- [FileTool] The `get_files` function cannot accept `pdf` output format

🚀 Others

- Documentation updated to a separate repository [pdfdeal-docs](https://github.com/Menghuan1918/pdfdeal-docs)
- Updated unit tests

0.1.6

✅ No interface changes

✨ New Features

- Add a new function, `get_files`, to quickly generate all the files in a folder and keep the file structure consistent before and after processing. See [example](https://github.com/Menghuan1918/pdfdeal/blob/main/docs/doc2x.md#example-convert-a-pdf-in-a-folder-to-a-docx-file-and-keep-the-original-file-structure)

🐛 Bug Fixes

- Doc2X API does not return an obvious error when uploading files over 100MB (API limit).

---

✅ 没有更改的接口

✨ 新特性

- 添加一个新的函数`get_files`,使用其快速生成文件夹中的所有文件,并保持处理前后文件结构一致,查看[示范程序](https://github.com/Menghuan1918/pdfdeal/blob/main/docs/doc2x_cn.md#%E7%A4%BA%E4%BE%8B%E5%B0%86%E4%B8%80%E4%B8%AA%E6%96%87%E4%BB%B6%E5%A4%B9%E4%B8%AD%E7%9A%84pdf%E8%BD%AC%E6%8D%A2%E4%B8%BAdocx%E6%96%87%E4%BB%B6%E5%B9%B6%E4%BF%9D%E6%8C%81%E5%8E%9F%E6%9C%89%E6%96%87%E4%BB%B6%E7%BB%93%E6%9E%84)

🐛 Bug 修复

- Doc2X API上传文件超过100MB(API限制)时不会返回明显报错

0.1.5

✅ No interface changes

🐛 Bug Fixes

- Fixed an issue that prompted immobility in extreme cases:
When customizing output folders in some cases: `os.rename error - system cannot move files to other disks`

🚀 Other

- Modular pdf file/OCR recognition engine

0.1.4

✅ No interface changes

🚀 Other

- Updating docstring to follow Google Styleguide

0.1.3

✅ No interface changes

✨ New Features

- New feature: replace all remote images in Markdown files with local ones.
- Refactored `pdfdeal` function, now supports batch input of files.

🐛 Bug Fixes

- Reformatting the output of native OCR file processing functions.
- `pdfdeal` can't output md files under some circumstances.
- Remove `Doc2x` used in version 0.0.x.

🚀 Other

- Documentation will be refactored for the next release

0.1.2

✅ No interface changes for seamless upgrades

✨ New Features

- Refactored RPM limiter to enhance batch file processing stability.
- New unit tests for handling large number of files, all unit tests will be automatically completed by GitHub Actions.
- Backward compatible with python 3.8.

🐛 Bug Fixes

- Improve the stability of batch file processing
- Discard unnecessary parameters

Page 5 of 7

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.