Latest version: v0.0.4
The information on this page was curated by experts in our Cybersecurity Intelligence Team.
A Python package for token-aware HTML chunking that preserves structure and attributes, with optional cleaning and attribute length control.
No known vulnerabilities found