Llama-assistant

Latest version: v0.1.41

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.1.41

https://github.com/user-attachments/assets/077c8692-e148-46aa-a659-7171a1bbc165

๐Ÿ”ง Changes
- [x] Utilize llama cpp KV cache mechanism to make faster inference. See (1)
- [x] Summarize the chat history when it is about to exceeds the context length
- [x] Recursive check and update missing setting from te DEFAULT CONFIG
- [x] Add validators (type, min, max value) for input fields in the setting dialog

(1) llama cpp's KV cache check prefix of your chat history to reuse the K-V cache. For example:
- Generated sequence so far = "ABCDEF"
- If we modify the chat history somehow like: "ABCDXT". Then it matches prefix and reuses the cache for "ABCD" and newly computes the Key and Value vectors for "XT", then generates new responses.
-> So we need to make the most of this mechanism by keep the history prefix as fixed as possible

0.1.40

![Llama Assistant v0.1.40](https://github.com/user-attachments/assets/07315fd1-7706-45b4-bc91-f9641c60edc0)

๐Ÿ”ง Changes
- ๐Ÿ’ฌ Supported continue conversation.
- ๐Ÿ” Added RAG support with [LlamaIndex](https://github.com/run-llama/llama_index).
- โš™๏ธ Added model settings.
- ๐Ÿ“ Added markdown support.
- โŒ› Added loading text animation while downloading the models and generating answers.
- ๐Ÿ”„ Fixed chatbox scrolling issue.


Thank gallegi for adding these features! ๐Ÿ‘

0.1.32

![Llama Assistant v0.1.28](https://github.com/user-attachments/assets/8e311537-57d4-41f0-b27c-2197b6c15ee5)

๐Ÿ”ง Changes
- ๐Ÿ”„ Replace Whisper implementation with [pywhispercpp](https://github.com/vietanhdev/pywhispercpp).
- ๐Ÿ–ฅ๏ธ Add built versions:
- ๐ŸŽ MacOS
- ๐ŸชŸ Windows
- ๐Ÿง Linux

0.1.28

![Llama Assistant v0.1.28](https://github.com/user-attachments/assets/8e311537-57d4-41f0-b27c-2197b6c15ee5)

๐Ÿ”ง Changes
- ๐ŸŽ™๏ธ๐Ÿ”ฅ **Add offline STT support:** WhisperCPP. The base model is downloaded from Hugging Face. Your audio is transcribed locally on your machine. ๐Ÿ”ฅ

0.1.26

![Llama Assistant v0.1.26](https://github.com/user-attachments/assets/8e311537-57d4-41f0-b27c-2197b6c15ee5)

๐Ÿ”ง Changes
- Add binary build for MacOS with PyInstaller.

0.1.24

![Llama Assistant v0.1.20](https://github.com/user-attachments/assets/8e311537-57d4-41f0-b27c-2197b6c15ee5)

๐Ÿ› Bugfixes
- ๐Ÿ–ฑ๏ธ Fixed: wrong cursor position when inserting text.
- ๐Ÿ’ฅ Fixed: crashing when changing shortcut.
- โŒจ๏ธ Fixed: wrong shortcut keys in macOS.
- ๐Ÿ™ˆ Hide "Copy Result" or "Clear" after clearing the chat.
- ๐Ÿ”„ Handle the last response correctly.

๐Ÿ”ง Changes
- ๐Ÿงน Clear results + input when clicking "Clear".
- โŒจ๏ธ Type "clear" or "cls" to clear.
- ๐Ÿšซ Prevent action buttons when no data is input.

Page 1 of 2

ยฉ 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.