Txtai

Latest version: v8.4.0

Safety actively analyzes 723685 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 9

5.1.0

This release adds new model support for the translation pipeline, OpenAI Whisper support in the transcription pipeline and ARM Docker images. Topic modeling was also updated with improvements, including how to use BM25/TF-IDF indexes to drive topic models.

See below for full details on the new features, improvements and bug fixes.

New Features
--------------------------
- Multiarch docker image (324)
- Add notebook covering classic topic modeling with BM25 (360)

Improvements
--------------------------
- Read authentication parameters from storage task (332)
- Update scoring algorithms (351)
- Add config option for list of stopwords to ignore with topic generation (352)
- Allow for setting custom translation model path (355)
- Update caption pipeline to call image-to-text pipeline (361)
- Update transcription pipeline to call automatic-speech-recognition pipeline (362)
- Only pass tokenizer to pipeline when necessary (363)
- Improve default max length logic for text generation (364)
- Update transcription notebook (365)
- Update translation notebook (366)
- Move mkdocs dependencies from docs.yml to setup.py (368)

Bug Fixes
--------------------------
- GitHub Actions build error with torch 1.12 on macOS (300)
- SQLite JSON support not built into Python Windows builds < 3.9 (356)
- Use tags field in application.add (359)
- Fix issue with Application autosequencing (367)

5.0

New Features
--------------------------
- Add scoring-based search (327)
- Add notebook demonstrating functionality of individual embeddings components (328)
- Add SQL expression columns (338)
- Add semantic graph component (339)
- Add notebook covering Semantic Graphs (341)
- Add graph documentation (343)
- Allow custom ann, database and graph instances (344)

Improvements
--------------------------
- Clarify embeddings.save documentation (325)
- Modify embeddings search candidate default logic (326)
- Update console to conditionally import library (333)
- Update ANN package to make terminology more consistent (334)
- Support non-text document elements in Applications (335)
- Update workflow documentation to note generator execution (336)
- Update audio transcription notebook to include example with OpenAI Whisper (345)

Bug Fixes
--------------------------
- Calling scoring.index with no tokens parsed results in error (337)
- Fix cached_path error with transformers v4.22 (340)
- Fix docker command "--it". Thank you to lipusz! (346)
- Error loading compressed indexes in console bug (347)

5.0.0

🎈🎉🥳 We're excited to announce the release of txtai 5.0! 🥳🎉🎈

_Thank you to the txtai community! Please remember to ⭐ txtai!_

4.6.0

🎈🎉🥳 txtai turns 2 🎈🎉🥳

_We're excited to release the 25th version of txtai marking it's 2 year anniversary. Thank you to the txtai community. Please remember to ⭐ txtai!_

txtai 4.6 is a large but backwards compatible release! This release adds better integration between embeddings and workflows. It also adds a number of significant performance improvements and bug fixes.

New Features
--------------------------
- Add transform workflow action to application (281)
- Add ability to resolve workflows within applications (290)
- OFFSET in sql query statement (293)
- Add webpage summary image generation notebook (299)
- Add notebook on running txtai with native code (304)
- Add mmap parameter to Faiss (308)
- Add indexing guide to docs (312)

Improvements
--------------------------
- Consume generator outputs in workflow tasks (291)
- Update pipeline workflow notebook (292)
- Update tabular notebook (297)
- Lower required version of Pillow library to prevent unnecessary upgrades (303)
- Embeddings vector batch improvements (309)
- Use single constant for current pickle protocol (310)
- Move quantize config param to Faiss (311)
- Update documentation with new demo and diagrams (313)
- Improve embeddings performance with large query limits (318)

Bug Fixes
--------------------------
- ModuleNotFoundError: No module named 'transformers.hf_api' (274)
- Dependency issue with ONNX and Protobuf (285)
- The key should be writable instead of path. Thank you to csnelsonchu! (287)
- Fix breaking change in build script from mkdocstrings bug (289)
- Index id sync issue when inserting multiple data types (text, documents, objects) into Embeddings (294)
- Labels pipeline outputs changed with transformers 4.20.0 (295)
- Tabular pipeline throws error when processing list fields (296)
- txtai load testing (305)
- Add cloud config to application.upsert method (306)

4.5.0

This release adds the following new features, improvements and bug fixes.

New Features
--------------------------
- Add scripts to train bashsql query translation model (271)
- Add QA database example notebook (272)
- Add CITATION file (273)

Improvements
--------------------------
- Improve efficiency of external vectors (275)
- Refactor vectors package to improve code reuse (276)
- Add logic to detect external vectors method (277)

Bug Fixes
--------------------------
- Fix summary pipeline issue with transformers>=4.19.0 (278)

4.4.0

This release adds the following new features, improvements and bug fixes.

New Features
--------------------------
- Add semantic search explainability (248)
- Add notebook covering model explainability (249)
- Add txtai console (252)
- Add sequences pipeline (261)
- Add scripts to train query translation models (265)
- Add query translation logic in embeddings searches (266)
- Add notebook for query translation (269)

Improvements
--------------------------
- Update HFTrainer to support sequence-sequence models (262)

Bug Fixes
--------------------------
- Unit tests failing with tokenizers>= 0.12 (253)
- Running default.config.yml returns TypeError: register() got an unexpected keyword argument 'ids' (256)
- Unit tests failing with transformers==4.18.0 (258)
- Update precommit to use latest version of psf black (259)

Page 5 of 9

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.