Paddleocr

Latest version: v2.9.1

Safety actively analyzes 688323 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 3

2.7.3

What's Changed
fixed 11808

2.7.2

What's Changed
* add finnish language files by savikko in https://github.com/PaddlePaddle/PaddleOCR/pull/10850
* fix `cls_x` and `bbox_x` is possibly unbound by SigureMo in https://github.com/PaddlePaddle/PaddleOCR/pull/10973
* update ppocrv4_framework by tink2123 in https://github.com/PaddlePaddle/PaddleOCR/pull/11047
* Update ONNX conversion readme_ch.md by greyovo in https://github.com/PaddlePaddle/PaddleOCR/pull/11030
* [TIPC]update tipc scripts and rm fluid api by USTCKAY in https://github.com/PaddlePaddle/PaddleOCR/pull/11098
* fix a bug for rec_postprocess.py by Ataraxy33 in https://github.com/PaddlePaddle/PaddleOCR/pull/11408
* Modify readme 27 by zhangyubo0722 in https://github.com/PaddlePaddle/PaddleOCR/pull/11424
* fix: layout recovery image:xxx.png,err msg: list index out of range by santlchogva in https://github.com/PaddlePaddle/PaddleOCR/pull/11405
* rm QR code in the document by tink2123 in https://github.com/PaddlePaddle/PaddleOCR/pull/11511
* rm QR code by tink2123 in https://github.com/PaddlePaddle/PaddleOCR/pull/11533
* Update custom.md by jzhang533 in https://github.com/PaddlePaddle/PaddleOCR/pull/11636
* fix AttributeError by GreatV in https://github.com/PaddlePaddle/PaddleOCR/pull/11556
* update pre-commit config by jzhang533 in https://github.com/PaddlePaddle/PaddleOCR/pull/11682
* lazy import PyMuPDF by jzhang533 in https://github.com/PaddlePaddle/PaddleOCR/pull/11685
* setup a workflow for publishing package to pypi, and bump version to … by jzhang533 in https://github.com/PaddlePaddle/PaddleOCR/pull/11800

New Contributors
* savikko made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/10850
* greyovo made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/11030
* santlchogva made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/11405
* jzhang533 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/11636

**Full Changelog**: https://github.com/PaddlePaddle/PaddleOCR/compare/v2.7.0...v2.7.2

2.7.1

New Projects
* Add Parseq recognition model.(10836)
* Add text recognition function to return single character coordinates.(10515)

New Features
* Add savefile option to save OCR output results.(10628)
* Add more data preprocessing options to ppocr.py.(10217)
* A single damaged image does not affect data set inference. (10216)
* Compatible with fitz version. (10181)
* Compatible with Pillow10.0 upgrade. (10405)
* Add Finnish dictionary file. (10850)
* Onnxruntime supports GPU. (10668)
* TIPC supports XPU and NPU. (10658, 10460)
* Add inference on mlu devices. (10249)


BugFix
* Fixed cannot find the library error when packaged into exe on windows. (10502)
* Fixed the bug of recognize page is affected by the maximum number of PDF files when recognize multiple PDF files. (10290)
* Fixed the problem of PPOCRLabel startup failure caused by mismatch in channel number. (10847)
* Fix memory leak problem of cpp inference. (10441)
* Modify DSR error caused by data enhancement. (10662)
* Fix training seed problem. (10614)
* Fix table_master tipc error. (10514)
* Fixed the problem of error reporting when ppocr.py uses wandb. (10251)
* Fix memory leak in predict_rec.py. (10688)
* Fixed the issue where dis and iou cannot be calculated correctly due to the index error of structure_boxes in the PaddleStructure::rebuild_table function. (10810)
* Compatible with paddle 2.5 fluid exit. (10391)
* Fix the performance problem of Tensor.numpy under stride. (10773)
* Adapt the size of ABINet during export to the size of ABINetRecResizeImg. (10892)
* ABINet training error. (10744)
* Fix KeyError in ch_PP-OCRv4_rec_distill.yml. (10761)

Documentations Fix
* Fix algorithm_kie_vi_layoutxlm_en.md, kie_datasets_en.md, README.md, algorithm_overview.md, how_to_do_kie_en.md document issues. (10717)
* Update documentation issues with setup.py. (10749)
* Add pyyaml ​​library in requirements.txt. (10653)


New Projects
* 增加Parseq 识别模型。(10836)
* 增加文字识别返回单字识别坐标功能。(10515)

New Features
* 增加savefile选项,保存ocr输出结果。(10628)
* 增加ppocr.py 更多数据预处理选项。(10217)
* 单张破损图片不影响整体数据集推理。(10216)
* 兼容fitz版本。(10181)
* 兼容Pillow10.0升级。(10405)
* 增加芬兰语字典文件。(10850)
* Onnxruntime 支持GPU。(10668)
* TIPC支持XPU、NPU。(10658,10460)
* 增加在mlu设备上的推理。(10249)


BugFix
* 修复windows打包成exe找不到库的问题。(10502)
* 修复多次识别pdf,受第一次页面最大数量影响的bug。(10290)
* 修复通道数不匹配造成的PPOCRLabel启动失败问题。(10847)
* 修复cpp推理的内存泄漏问题。(10441)
* 修改数据增强导致的DSR报错。(10662)
* 修复训练seed传递问题。( 10614)
* 修复 table_master tipc 报错。(10514)
* 修复ppocr.py使用wandb报错问题。(10251)
* 修复predict_rec.py中的内存泄漏问题。(10688)
* 修复PaddleStructure::rebuild_table函数中structure_boxes的索引错误导致dis和iou无法正确计算的问题。(10810)
* 兼容paddle 2.5 fluid退场。 (10391)
* 修复stride下Tensor.numpy的性能问题。(10773)
* ABINet导出时尺寸适应 ABINetRecResizeImg 的尺寸。(10892)
* ABINet训练报错问题。(10744)
* 解决ch_PP-OCRv4_rec_distill.yml中的keyError报错问题。(10761)


Documentations Fix
* 修复 algorithm_kie_vi_layoutxlm_en.md、kie_datasets_en.md 、README.md、algorithm_overview.md、how_to_do_kie_en.md文档问题(10717)
* 更新setup.py的文档问题(10749)
* 修复requirements.txt中没有pyyaml库的问题(10653)


New Contributors
* RuohengMa made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/10614
* WilliamQf-AI made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/10628
* xlg-go made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/10744
* Gmgge made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/10777
* victor30608 made their first contribution in https://github.com/PaddlePaddle/PaddleOCR/pull/11025

**Full Changelog**: https://github.com/PaddlePaddle/PaddleOCR/compare/v2.7.0...v2.7.1

2.7.0

**Release Note**

- Release [PP-OCRv4](./doc/doc_ch/PP-OCRv4_introduction.md), support mobile version and server version
- PP-OCRv4-mobile:When the speed is comparable, the effect of the Chinese scene is improved by 4.5% compared with PP-OCRv3, the English scene is improved by 10%, and the average recognition accuracy of the 80-language multilingual model is increased by more than 8%.
- PP-OCRv4-server:Release the OCR model with the highest accuracy at present, the detection model accuracy increased by 4.9% in the Chinese and English scenes, and the recognition model accuracy increased by 2%
refer [quickstart](./doc/doc_en/quickstart_en.md) quick use by one line command, At the same time, the whole process of model training, reasoning, and high-performance deployment can also be completed with few code in the [General OCR Industry Solution](https://aistudio.baidu.com/aistudio/modelsdetail?modelId=286) in PaddleX.
- Release[PP-ChatOCR](https://aistudio.baidu.com/aistudio/modelsdetail?modelId=332), a new scheme for extracting key information of general scenes using PP-OCR model and ERNIE LLM.

2.6.0

**Release Note**
- Release [PP-Structurev2](./ppstructure/),with functions and performance fully upgraded, adapted to Chinese scenes, and new support for [Layout Recovery](./ppstructure/recovery) and **one line command to convert PDF to Word**;
- [Layout Analysis](./ppstructure/layout) optimization: model storage reduced by 95%, while speed increased by 11 times, and the average CPU time-cost is only 41ms;
- [Table Recognition](./ppstructure/table) optimization: 3 optimization strategies are designed, and the model accuracy is improved by 6% under comparable time consumption;
- [Key Information Extraction](./ppstructure/kie) optimization:a visual-independent model structure is designed, the accuracy of semantic entity recognition is increased by 2.8%, and the accuracy of relation extraction is increased by 9.1%.

2.5.0

**Release Note**

- Release [PP-OCRv3](./doc/doc_en/ppocr_introduction_en.mdpp-ocrv3): With comparable speed, the effect of Chinese scene is further improved by 5% compared with PP-OCRv2, the effect of English scene is improved by 11%, and the average recognition accuracy of 80 language multilingual models is improved by more than 5%.
- Release [PPOCRLabelv2](./PPOCRLabel): Add the annotation function for table recognition task, key information extraction task and irregular text image.
- Release interactive e-book [*"Dive into OCR"*](./doc/doc_en/ocr_book_en.md), covers the cutting-edge theory and code practice of OCR full stack technology.

Page 2 of 3

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.