- Fix division by zero bug introduced in previous release
1.2.7
- Fix bugs - Improve computation of image metrics on noisy documents - Modify row detection for borderless tables in order to account for merged cells - Implement Adaptive Run Length Smoothing Algorithm in order to isolate text areas in images
1.2.6
- Fix bugs related to OCR / table content extraction
1.2.5
- Fix bug in line detection - Fix bug in cell creation - Optimization of algorithm performances
1.2.4
- Improved processing of tables with dotted lines - Add detection of semi-bordered cells in tables - Update borderless table algorithm - Speed improvements and code optimization (2 to 4x faster depending on inputs)
1.2.3
- Add HTML representation to extracted tables - Call OCR only on pages/images containing tables - Bump Pillow requirements for vulnerabilities