Added
- `llm_wrapper.llama_cpp_http_chat` & `llm_wrapper.llama_cpp_http`; llama.cpp low-level HTTP API support
- `llm_wrapper.llama_response` class with flexible handling across API specs
- `window` init param for for `embedding.pgvector.MessageDB`, to limit message storage per history key
Changed
- Deprecated `first_choice_text` & `first_choice_message` methods in favor of `first_choice_text` attributes on response objects
- Clarify set quite setup docs