Changes
✨ New Features
- [FEAT] Fix resource accounting in PyRunner jaychia (2567)
- [FEAT] Add `.str.count_matches()` Vince7778 (2580)
- [FEAT]: streaming json universalmind303 (2582)
- [FEAT]: `embedding.cosine_distance` function universalmind303 (2526)
- [FEAT] Enable buffered iteration on plans jaychia (2566)
- [FEAT] Streaming CSV Reads colin-ho (2565)
- [FEAT] Support Reading Iceberg Merge-on-Read Position Deletes kevinzwang (2563)
- [FEAT]: daft sql universalmind303 (2558)
- [FEAT] Initializes daft-sql and defines the daft.sql(..) function. RCHowell (2559)
- [FEAT] Add image mode casting Vince7778 (2562)
- [FEAT] Add tracing to local execution engine samster25 (2556)
- [FEAT] Create file when writing dataframe with no rows kevinzwang (2540)
- [FEAT] Add string tokenize expression Vince7778 (2503)
- [FEAT]: support optional rowgroups to `read_parquet` universalmind303 (2534)
- [FEAT] Add hashjoin, sort, and hashagg ops to new executor colin-ho (2530)
👾 Bug Fixes
- [BUG] Fix `.str.length()` on Unicode strings Vince7778 (2579)
- [BUG] Fix table filters with scalar mask colin-ho (2542)
- [BUG] Enable sorting bool columns colin-ho (2529)
📖 Documentation
- [FEAT] Add `.str.count_matches()` Vince7778 (2580)
- [DOCS]: small update to "scaling-up" universalmind303 (2577)
- [DOCS] quickstart-revision avriiil (2124)
- [DOCS] Update Ray Dataset link kevinzwang (2564)
- [FEAT] Add image mode casting Vince7778 (2562)
- [DOCS] Fix notebook CI jaychia (2544)
- [FEAT] Add string tokenize expression Vince7778 (2503)
- [DOCS] Do not generate PDFs jaychia (2539)
- [DOCS] Build PDF and htmlzip formats of docs for offline consumption jaychia (2538)
🧰 Maintenance
- [CHORE] Remove daft-execution colin-ho (2553)
- [CHORE] Enable all features for rust-analyzer Vince7778 (2560)
- [CHORE] Add TPC-H questions 11-22 to benchmarks (currently skipped) kevinzwang (2299)
- [CHORE]: move minhash to daft-functions universalmind303 (2518)
⬆️ Dependencies
- Bump async-compression from 0.4.10 to 0.4.12 dependabot (2548)