Openparse

Latest version: v0.5.5

Safety actively analyzes 622940 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.5.5

What's Changed
* [Memory Leak Fix] Create Fitz Pdf From Bytestream by Filimoa in https://github.com/Filimoa/open-parse/pull/39


**Full Changelog**: https://github.com/Filimoa/open-parse/compare/v0.5.4...v0.5.5

0.5.4

What's Changed
* 29 [minor tweak to mashihua's branch] by Filimoa in https://github.com/Filimoa/open-parse/pull/32
* Fix bug with parse.py by mashihua in https://github.com/Filimoa/open-parse/pull/29

New Contributors
* mashihua made their first contribution in https://github.com/Filimoa/open-parse/pull/29

**Full Changelog**: https://github.com/Filimoa/open-parse/compare/v0.5.3...v0.5.4

0.5.3

Minor bug fixes

What's Changed
* Update pymupdf.md by ada-lovecraft in https://github.com/Filimoa/open-parse/pull/20
* update the cookbooks link by brianjking in https://github.com/Filimoa/open-parse/pull/24
* fix: Fix sequence item 2: expected str instance, NoneType found exception when table output is set to markdown. by ic-xu in https://github.com/Filimoa/open-parse/pull/27

New Contributors
* ada-lovecraft made their first contribution in https://github.com/Filimoa/open-parse/pull/20
* brianjking made their first contribution in https://github.com/Filimoa/open-parse/pull/24
* ic-xu made their first contribution in https://github.com/Filimoa/open-parse/pull/27

**Full Changelog**: https://github.com/Filimoa/open-parse/compare/v0.5.2...v0.5.3

0.5.2

Features

- Better version display
- Fixed pytorch device bug. Thanks jinmang2
- Add global config to set pytorch device

0.5.1

Bug Fixes
- Fixed type hinting bug for python < 3.10

**Full Changelog**: https://github.com/Filimoa/open-parse/compare/v0.5.0...v0.5.1

0.5.0

What's Changed
- SemanticProcessing! This is the recommended processing pipeline.
- Add optional annotations to the pdf draw functions
- Fixed reading order bug

Breaking Changes
1. Renaming
- `Node.aggregate_position` renamed to `Node.reading_order`.
- `RemoveStubs` to `RemoveNodesBelowNTokens`
2. Refactored processing pipelines to use a class to promote ease of reuse

Previously
python
from openparse import ProcessingStep, default_pipeline, Node
from typing import List


class CustomCombineTables(ProcessingStep):
def process(self, nodes: List[Node]) -> List[Node]:
return nodes


copy the default pipeline (or create a new one)
custom_pipeline = default_pipeline.copy()
custom_pipeline.append(CustomCombineTables())

parser = openparse.DocumentParser(
table_args={"parsing_algorithm": "pymupdf"}, processing_pipeline=custom_pipeline
)
custom_10k = parser.parse(meta10k_path)


Now becomes

python
from openparse import processing, Node
from typing import List


class CustomCombineTables(processing.ProcessingStep):
def process(self, nodes: List[Node]) -> List[Node]:
return nodes


copy the default pipeline (or create a new one)
custom_pipeline = processing.BasicIngestionPipeline()
custom_pipeline.append_transform(CustomCombineTables())

parser = openparse.DocumentParser(
table_args={"parsing_algorithm": "pymupdf"}, processing_pipeline=custom_pipeline
)
custom_10k = parser.parse(meta10k_path)

3. `openai` and `numpy` as now required dependencies, will likely split this out in the future.

**Full Changelog**: https://github.com/Filimoa/open-parse/compare/v0.4.1...v0.5.0

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.