Txtai

Latest version: v8.4.0

Safety actively analyzes 714736 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 6 of 9

4.3.1

This release adds the following new features, improvements and bug fixes.

Bug Fixes
--------------------------
- Fix word embeddings regression with batch transformation (245)

4.3.0

This release adds the following new features, improvements and bug fixes.

New Features
--------------------------
- Add notebook covering txtai embeddings index file structure (237)
- Add Image Hash pipeline (240)
- Add support for custom SQL functions in embeddings queries (241)
- Add notebook for Embeddings SQL functions (243)
- Add notebook for near-duplicate image detection (244)

Improvements
--------------------------
- Rename SQLException to SQLError (232)
- Refactor API instance into a separate package (233)
- API should raise an error if attempting to modify a read-only index (235)
- Add last update field to index metadata (236)
- Update transcription pipeline to use AutoModelForCTC (238)

Bug Fixes
--------------------------
- Ensure limit always set in embeddings search/batchsearch (234)
- Fix issue with parsing multiline SQL statements bug (242)

4.2.1

This release adds the following new features, improvements and bug fixes.

Bug Fixes
--------------------------
- Fixed mislabeled API config definition (231)

4.2.0

This release adds the following new features, improvements and bug fixes.

New Features
--------------------------
- Add notebook for workflow notifications (225)
- Add default and custom docker configurations (226)
- Create docker configuration for AWS Lambda (228)
- Add support for loading/storing embedding indexes on cloud storage (229)

Improvements
--------------------------
- Add support for SQL || operator (223)
- Add flag to disable loading index data in API (230)

Bug Fixes
--------------------------
- Modify database decoder methods to check for None (220)
- Modify embeddings search to make return type consistent when index initialized and not initialized (221)
- Embeddings index returning malformed JSON errors in certain situations (222)
- Check for empty documents input before indexing (224)

4.1.0

This release adds the following new features, improvements and bug fixes.

New Features
--------------------------
- Add entity extraction pipeline (203)
- Add workflow scheduling (206)
- Add workflow search task to API (210)
- Add Console Task (215)
- Add Export Task (216)
- Add notebook for workflow scheduling (218)

Improvements
--------------------------
- Default documentation theme using system preference (197)
- Improve multi-user experience for workflow application (198)
- Documentation improvements (200)
- Add social preview image for documentation (201)
- Add links to txtai in all example notebooks (202)
- Add limit parameter to API search method (208)
- Add documentation on local API instances (209)
- Add shorthand syntax for creating workflow tasks in API (211)
- Accept functions as workflow task actions in API (213)

Bug Fixes
--------------------------
- Object detection model fails to load additional models (204)
- Update unit tests to limit cpu usage for word vector tests (207)
- Add better error handling around unindexed embedding instances (212)
- Fix issue when workflow task generates no output (214)
- Add lock to API search methods (217)

4.0

To quantify the changes, the code base increased by 50% with 36 resolved issues, by far the biggest release of txtai. These changes were designed to be fully backward compatible but keep in mind it is a new major release.

[What's new in txtai 4.0](https://colab.research.google.com/github/neuml/txtai/blob/master/examples/24_Whats_new_in_txtai_4_0.ipynb) covers all the changes with detailed examples. The [documentation site](https://neuml.github.io/txtai) has also been refreshed.

New Features
--------------------------
- Store text content (168)
- Add option to index dictionaries of content (169)
- Add SQL support for generating combined embeddings + database queries (170)
- Add reindex method to embeddings (171)
- Add index archive support (172)
- Add close method to embeddings (173)
- Update API to work with embeddings + database search (176)
- Add content option to tabular pipeline (177)
- Update workflow example to support embeddings content (179)
- Add index metadata to embeddings config (180)
- Add object storage (183)
- Aggregate partial query results when clustering (184)
- Add function parameter to embeddings reindex (185)
- Add support for user defined column aliases (186)
- Use SQL bracket notation to support multi word and more complex JSON path expressions (187)
- Support SQLite 3.22+ (190)
- Add pre-computed vector support (192)
- Change document/object inserts to only keep latest record (193)
- Update documentation with 4.0 changes (196)

Improvements
--------------------------
- Modify workflow to select batches with slices (158)
- Add tensor support to workflows (159)
- Read YAML config if provided as a file path (162)
- Make adding pipelines to API easier (163)
- Process task actions concurrently (164)
- Add tensor workflow notebook (167)
- Update default ANN parameters (174)
- Require Python 3.7+ (175)
- Consistently name embeddings id fields (178)
- Add txtai __version__ attribute (181)
- Refresh notebooks for 4.0 (188)
- Modify embeddings to only iterate over input documents once (189)
- Improve efficiency of vector transformations (191)

Bug Fixes
--------------------------
- Add thread lock around API write calls (160)
- Expose caption and objects pipeline via API (161)
- Change pickle calls to use protocol supporting lowest Python version (182)
- HFOnnx expects ORT provider bug (195)

Page 6 of 9

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.