Pix2text

Latest version: v1.1.2.3

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 3

1.1.1

Update 2024.06.17：**V1.1.0.7** Released

Major changes:

* adapted with cnstd>=1.2.4, thanks to [g1y5x3](https://github.com/g1y5x3) .

主要变更：

* 适配 cnstd>=1.2.4 ，感谢 [g1y5x3](https://github.com/g1y5x3) 。

Update 2024.06.04：**V1.1.0.6** Released

Major changes:

* Fix: The Text OCR incorrectly carried over the configuration from previous calls when it was called multiple times.

主要变更：

* 修复 bug：Text OCR 多次调用时错误沿用了之前的配置信息。

Update 2024.05.27：**V1.1.0.5** Released

Major changes:

* Fixed bugs such as that in `._parse_remaining`.

主要变更：

* 修复 `._parse_remaining` 等 bug。

Update 2024.05.20：**V1.1.0.4** Released

Major changes:

* set `table_as_image` as `True` if `self.table_ocr` is not available.
* fix typo: https://github.com/breezedeus/Pix2Text/pull/108 . Thanks to [billvsme](https://github.com/billvsme).

主要变更：

* 如果 `self.table_ocr` 不可用，将 `table_as_image` 设置为 `True`。
* 修复拼写错误：https://github.com/breezedeus/Pix2Text/pull/108 。感谢 [billvsme](https://github.com/billvsme)。

Update 2024.05.19：**V1.1.0.3** Released

Major changes:

* A new paid model, `mfr-plus`, has been added, which offers better recognition for multi-line formulas.
* When recognizing only English, CnOCR does not output Chinese.
* Bugs have been fixed.

主要变更：

* 加入新的付费模型：`mfr-plus`，对多行公式的识别效果更好。
* 在只识别英文时，CnOCR 不输出中文。
* 修复 bugs。

Update 2024.05.10：**V1.1.0.2** Released

Major changes:

* Fixed the error caused by empty lines in `merge_line_texts`.

主要变更：

* 修复 `merge_line_texts` 中空行导致的错误。

Update 2024.04.30：**V1.1.0.1** Released

Major changes:

* Fix the exception occurring when saving files on Windows.

主要变更：

* 修复 Windows 下存储文件时出现的异常。

Update 2024.04.28：**V1.1** Released

Major changes:

* Added layout analysis and table recognition models, supporting the conversion of images with complex layouts into Markdown format. See examples: [Pix2Text Online Documentation / Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples_en/).
* Added support for converting entire PDF files to Markdown format. See examples: [Pix2Text Online Documentation / Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples_en/).
* Enhanced the interface with more features, including adjustments to existing interface parameters.
* Launched the [Pix2Text Online Documentation](https://pix2text.readthedocs.io).

主要变更：

* 加入了版面分析和表格识别模型，支持把复杂排版的图片转换为 Markdown 格式，示例见：[Pix2Text 在线文档/Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples/)。
* 支持把整个 PDF 文件转换为 Markdown 格式，示例见：[Pix2Text 在线文档/Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples/)。
* 加入了更丰富的接口，已有接口的参数也有所调整。
* 上线了 [Pix2Text 在线文档](https://pix2text.readthedocs.io)。

Update 2024.03.30：**V1.0.2.3** Released

Major changes:

* Fixed the issue caused by `merge_line_texts`, see details at: https://github.com/breezedeus/Pix2Text/issues/84.
* Optimized the post-processing logic to handle some abnormal sequences.

主要变更：

* 修复 `merge_line_texts` 带来的错误，具体见：https://github.com/breezedeus/Pix2Text/issues/84 。
* 优化了后处理逻辑，处理部分不正常的序列。

Update 2024.03.18：**V1.0.2.2** Released

Major changes:

* The previously used `output_logits` argument is incompatible with transformers < 4.38.0, replaced by the `output_scores` argument. https://github.com/breezedeus/Pix2Text/issues/81
* Fixed a bug in `serve.py` that was not compatible with the new pix2text version.

主要变更：

* 之前使用的 `output_logits` 参数不兼容 transformers < 4.38.0，换为 `output_scores` 参数。 https://github.com/breezedeus/Pix2Text/issues/81
* 修复 `serve.py` 中未兼容新版接口的 bug。

Update 2024.03.15：**V1.0.2.1** Released

Major Changes:

* Fixed mishandling of LaTeX expressions during post-processing, such as replacing `\rightarrow` with `arrow`.
* Added `rec_config` parameter to `.recognize_text()` and `.recognize_formula()` methods for passing additional parameters for recognition.

主要变更：

* 修复对 LaTeX 表达式进行后处理时引入的误操作，如 `\rightarrow` 被替换为 `arrow`。
* 对 `.recognize_text()` 和 `.recognize_formula()` 加入了 `rec_config` 参数，以便传入用于识别的额外参数。

Update 2024.03.14：**V1.0.2** Released

Major Changes:

* Optimized the recognition process, improving the recognition of boundary punctuation that may have been missed before.
* Enhanced the LaTeX recognition results by restoring the formula tags to the formulas.
* Adjusted the output format of the recognition results, adding the `return_text` parameter to control whether to return only text or more detailed information. When returning more detailed information, confidence score `score` and position information `position` will also be provided. Thanks to [hiroi-sora](https://github.com/hiroi-sora) for the suggestion: https://github.com/breezedeus/Pix2Text/issues/67.

主要变更：

* 优化了识别的逻辑，以前可能漏识的边界标点现在可以比较好的识别。
* 对 Latex 识别结果进行了优化，把公式的 tag 还原到公式中。
* 调整了识别结果的输出格式，增加了参数 `return_text` 来控制结果是只返回文本还是更丰富的信息。当返回更丰富信息时，会返回置信度 `score` 以及位置信息 `position`。感谢 [hiroi-sora](https://github.com/hiroi-sora) 的建议：https://github.com/breezedeus/Pix2Text/issues/67 。

1.0.1

主要变更：

* 修复在 CUDA 环境下使用 `LatexOCR` 时出现的错误，具体见：https://github.com/breezedeus/Pix2Text/issues/65#issuecomment-1973037910 ，感谢 [MSZ-006NOC](https://github.com/MSZ-006NOC)。

1.0

主要变更：

* 数学公式识别（MFR）模型使用新架构，在新的数据集上训练，获得了 SOTA 的精度。具体说明请见：[Pix2Text V1.0 新版发布：最好的开源公式识别模型 | Breezedeus.com](https://www.breezedeus.com/article/p2t-v1.0)。

0.3

主要变更：

* 支持识别 **`80+` 种语言**，详细语言列表见 [支持的语言列表](./README_cn.md支持的语言列表)；

* 模型自动下载增加国内站点；

* 优化对检测 boxes 的合并逻辑。

0.2.3.3

主要变更：

* fix: bugfixed from [hiroi-sora](https://github.com/hiroi-sora) , thanks much.

0.2.3.2

主要变更：
* fix: 去掉 `consts.py` 无用的 `CATEGORY_MAPPINGS`。

Page 1 of 3

Releases

Has known vulnerabilities

Pix2text

Page 1 of 3

1.1.1

1.0.1

1.0

0.3

0.2.3.3

0.2.3.2

Page 1 of 3

Links

Releases