- Update `REGEX_NO_ALPHANUM_CHARS` regex to better handle non latin characters
1.4.55
- Updated `cloudpathlib` to version `~0.18.0` - Remove `transformers` and `langchain` from `model-requirements.txt` - Add a development guide to the `README.md`
1.4.54
- Fix load ocr for gulfim, handle string page number
1.4.53
- Fix parse_raw_numeric function handle negative sign with multimple delimiters
1.4.52
- Fix `duplicate_text_for_spanned_cells=False` case
1.4.51
- Add `duplicate_text_for_spanned_cells` option in `TableTag._build_df`