Spark-nlp

Latest version: v5.5.1

Safety actively analyzes 685507 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 23

5.5.0

========
----------------
New Features & Enhancements
----------------
* Introduced QWEN2Transformer (14188)
* Introduced MiniCPM (14205)
* Introduced NLLB (14209)
* Implemented Nomic embeddings (14217)
* Introduced CamemBertForZeroShotClassification annotator (14354)
* Implemented Mxbai Embeddings (14355)
* Introduced AlbertForZeroShotClassification (14361)
* Introduced Phi-3 (14373)
* Implemented Starcoder2 for causal language modeling (14358)
* Integrated llama.cpp (14364)
* Implemented SnowFlake (14353)
* Introduced ONNX support to vision annotators (14356)
* Introduced ONNX and OpenVINO support to Missing Annotators (14359)
* Added OpenVINO install instructions (14382)
* Exported notebooks for release candidate (14393)


========

5.4.2

========
----------------
New Features & Enhancements
----------------
* Added demo notebook for Image Classification Annotators
* Added aggressiveMatching parameter to DateMatcher and MultiDateMatcher annotators
* Added aggressiveMatching parameter to DocumentSimilarityRanker annotator


========

5.4.1

========
----------------
New Features & Enhancements
----------------
* Added support for loading duplicate models in Spark NLP, allowing multiple models from the same annotator to be loaded simultaneously.
* Updated the README for better coherence and added new pages to the website.
* Added support for a stop IDs list to halt text generation in Phi, Mistral, and Llama annotators.

----------------
Bug Fixes
----------------
* Fixed the default model names for Phi2 and Mistral AI annotators.

========

5.4.0

========
----------------
New Features & Enhancements
----------------
* Added OpenVINO Runtime integration for various models, enabling enhanced inference performance. (14246)
* Added Python APIs to incorporate OpenVINO support. (14242)
* Introduced support for ONNX models and average pooling in ONNX-based annotators. (14245)
* Implemented MPNet for token classification. (14244)
* Added support for MistralAI LLM and LLAMA2. (14243)
* Improved caching mechanisms in Streamlit demos. (14241)
* Enhanced models' card and README documentation for Models Hub. (14240)
* Added OpenVINO GPU dependencies. (14236)
* Locked macOS version for runners and added missing SBT setup. (14235)

----------------
Bug Fixes
----------------
* Fixed bugs in Colab notebooks. (14239)
* Resolved issues with BERT backend and broken annotators. (14238)
* Corrected LLAMA2 position ID and generation bug. (14237)


========

5.3.3

========
----------------
New Features & Enhancements
----------------
* **NEW:** Introduce UAEEmbeddings for sentence embeddings using Universal AnglE Embedding, aimed at improving semantic textual similarity tasks
* Introduce critical enhancements and optimizations to the processing of the CoNLL-U format for Dependency Parsers training, including enhanced multiword token handling and improved handling of missing uPos values
* Add example notebook for `DocumentCharacterTextSplitter`
* Add example notebook for `DeBertaForZeroShotClassification`
* Add example notebooks for `BGEEmbeddings` and `MPNetEmbeddings`
* Add example notebook for `MPNetForQuestionAnswering`
* Add example notebook for `MPNetForSequenceClassification`
* Implement cache mechanism for `metadata.json`, enhancing efficiency by avoiding unnecessary downloads

----------------
Bug Fixes
----------------
* Address a bug with serializing ONNX models that lack a `.onnx_data` file, ensuring better reliability in model serialization processes
* Delete redundant `Multilingual_Translation_with_M2M100.ipynb` notebook entries
* Fix Colab link for the M2M100 notebook


========

5.3.2

========
----------------
Bug Fixes
----------------
* Fix and add notebooks to import models from Hugging Face
* Add ONNX and TensorFlow notebooks
* Fix XlnetForSeqeunceClassification and added XlnetForTokenClassificaiton
* Rename DistilBertForZeroShotClassification
* Add missing notebooks
* Add MPNetEmbeddings to annotator
* Fix XLMRoBertaForQuestionAnswering, XLMRoBertaForTokenClassification, and XLMRoBertaForSequenceClassification: Reverted the change in tfFile naming that was causing exceptions while loading and saving the models
* Fix documentation for sparknlp.start()

========

Page 1 of 23

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.