Pix2text

Latest version: v1.1.2.3

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 3

1.1.1

Update 2024.06.17:**V1.1.0.7** Released

Major changes:

* adapted with cnstd>=1.2.4, thanks to [g1y5x3](https://github.com/g1y5x3) .

主要变更:

* 适配 cnstd>=1.2.4 ,感谢 [g1y5x3](https://github.com/g1y5x3) 。

Update 2024.06.04:**V1.1.0.6** Released

Major changes:

* Fix: The Text OCR incorrectly carried over the configuration from previous calls when it was called multiple times.

主要变更:

* 修复 bug:Text OCR 多次调用时错误沿用了之前的配置信息。

Update 2024.05.27:**V1.1.0.5** Released

Major changes:

* Fixed bugs such as that in `._parse_remaining`.

主要变更:

* 修复 `._parse_remaining` 等 bug。

Update 2024.05.20:**V1.1.0.4** Released

Major changes:

* set `table_as_image` as `True` if `self.table_ocr` is not available.
* fix typo: https://github.com/breezedeus/Pix2Text/pull/108 . Thanks to [billvsme](https://github.com/billvsme).

主要变更:

* 如果 `self.table_ocr` 不可用,将 `table_as_image` 设置为 `True`。
* 修复拼写错误:https://github.com/breezedeus/Pix2Text/pull/108 。感谢 [billvsme](https://github.com/billvsme)。

Update 2024.05.19:**V1.1.0.3** Released

Major changes:

* A new paid model, `mfr-plus`, has been added, which offers better recognition for multi-line formulas.
* When recognizing only English, CnOCR does not output Chinese.
* Bugs have been fixed.

主要变更:

* 加入新的付费模型:`mfr-plus`,对多行公式的识别效果更好。
* 在只识别英文时,CnOCR 不输出中文。
* 修复 bugs。


Update 2024.05.10:**V1.1.0.2** Released

Major changes:

* Fixed the error caused by empty lines in `merge_line_texts`.

主要变更:

* 修复 `merge_line_texts` 中空行导致的错误。


Update 2024.04.30:**V1.1.0.1** Released

Major changes:

* Fix the exception occurring when saving files on Windows.

主要变更:

* 修复 Windows 下存储文件时出现的异常。


Update 2024.04.28:**V1.1** Released

Major changes:

* Added layout analysis and table recognition models, supporting the conversion of images with complex layouts into Markdown format. See examples: [Pix2Text Online Documentation / Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples_en/).
* Added support for converting entire PDF files to Markdown format. See examples: [Pix2Text Online Documentation / Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples_en/).
* Enhanced the interface with more features, including adjustments to existing interface parameters.
* Launched the [Pix2Text Online Documentation](https://pix2text.readthedocs.io).

主要变更:

* 加入了版面分析和表格识别模型,支持把复杂排版的图片转换为 Markdown 格式,示例见:[Pix2Text 在线文档/Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples/)。
* 支持把整个 PDF 文件转换为 Markdown 格式,示例见:[Pix2Text 在线文档/Examples](https://pix2text.readthedocs.io/zh-cn/latest/examples/)。
* 加入了更丰富的接口,已有接口的参数也有所调整。
* 上线了 [Pix2Text 在线文档](https://pix2text.readthedocs.io)。


Update 2024.03.30:**V1.0.2.3** Released

Major changes:

* Fixed the issue caused by `merge_line_texts`, see details at: https://github.com/breezedeus/Pix2Text/issues/84.
* Optimized the post-processing logic to handle some abnormal sequences.

主要变更:

* 修复 `merge_line_texts` 带来的错误,具体见:https://github.com/breezedeus/Pix2Text/issues/84 。
* 优化了后处理逻辑,处理部分不正常的序列。

Update 2024.03.18:**V1.0.2.2** Released

Major changes:

* The previously used `output_logits` argument is incompatible with transformers < 4.38.0, replaced by the `output_scores` argument. https://github.com/breezedeus/Pix2Text/issues/81
* Fixed a bug in `serve.py` that was not compatible with the new pix2text version.

主要变更:

* 之前使用的 `output_logits` 参数不兼容 transformers < 4.38.0,换为 `output_scores` 参数。 https://github.com/breezedeus/Pix2Text/issues/81
* 修复 `serve.py` 中未兼容新版接口的 bug。

Update 2024.03.15:**V1.0.2.1** Released

Major Changes:

* Fixed mishandling of LaTeX expressions during post-processing, such as replacing `\rightarrow` with `arrow`.
* Added `rec_config` parameter to `.recognize_text()` and `.recognize_formula()` methods for passing additional parameters for recognition.

主要变更:

* 修复对 LaTeX 表达式进行后处理时引入的误操作,如 `\rightarrow` 被替换为 `arrow`。
* 对 `.recognize_text()` 和 `.recognize_formula()` 加入了 `rec_config` 参数,以便传入用于识别的额外参数。

Update 2024.03.14:**V1.0.2** Released

Major Changes:

* Optimized the recognition process, improving the recognition of boundary punctuation that may have been missed before.
* Enhanced the LaTeX recognition results by restoring the formula tags to the formulas.
* Adjusted the output format of the recognition results, adding the `return_text` parameter to control whether to return only text or more detailed information. When returning more detailed information, confidence score `score` and position information `position` will also be provided. Thanks to [hiroi-sora](https://github.com/hiroi-sora) for the suggestion: https://github.com/breezedeus/Pix2Text/issues/67.

主要变更:

* 优化了识别的逻辑,以前可能漏识的边界标点现在可以比较好的识别。
* 对 Latex 识别结果进行了优化,把公式的 tag 还原到公式中。
* 调整了识别结果的输出格式,增加了参数 `return_text` 来控制结果是只返回文本还是更丰富的信息。当返回更丰富信息时,会返回置信度 `score` 以及位置信息 `position`。感谢 [hiroi-sora](https://github.com/hiroi-sora) 的建议:https://github.com/breezedeus/Pix2Text/issues/67 。

1.0.1

主要变更:

* 修复在 CUDA 环境下使用 `LatexOCR` 时出现的错误,具体见:https://github.com/breezedeus/Pix2Text/issues/65#issuecomment-1973037910 ,感谢 [MSZ-006NOC](https://github.com/MSZ-006NOC)。

1.0

主要变更:

* 数学公式识别(MFR)模型使用新架构,在新的数据集上训练,获得了 SOTA 的精度。具体说明请见:[Pix2Text V1.0 新版发布:最好的开源公式识别模型 | Breezedeus.com](https://www.breezedeus.com/article/p2t-v1.0)。

0.3

主要变更:

* 支持识别 **`80+` 种语言**,详细语言列表见 [支持的语言列表](./README_cn.md支持的语言列表);

* 模型自动下载增加国内站点;

* 优化对检测 boxes 的合并逻辑。

0.2.3.3

主要变更:

* fix: bugfixed from [hiroi-sora](https://github.com/hiroi-sora) , thanks much.

0.2.3.2

主要变更:
* fix: 去掉 `consts.py` 无用的 `CATEGORY_MAPPINGS`。

Page 1 of 3

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.