Added:
- Install instructions for Debian, Ubuntu, ArchLinux, Manjaro, macOS.
- Server: LlamaCppParams, CandleParams, LLMParams.
- Examples: sync_demo.py, async_demo.py, langchain_sync_demo.py, langchain_async_demo.py
Changed:
- Client examples.
- Server: data/msg of type LLMParams.
- Examples use context size 512.
- Insomnia examples use context size 1024.
- Package `mlipy` renamed to `mli`.
- Streaming sync and async langchain functions.
- Less noisy output on token generation.