Added:
- New high-level Python API
- Low-level C API calls from llama.h, llava.h, clip.h, ggml.h
- `completions` for high-level function for LLMs / VLMs
- `text_completions` for low-level function for LLMs
- `clip_completions` for low-level function for CLIP-based VLMs
- WIP: `mllama_completions` for low-level function for Mllama-based VLMs
Changed:
- All examples
Removed:
- `llama_generate` function
- `llama_cpp_cli`
- `llava_cpp_cli`
- `minicpmv_cpp_cli`