* Updates the bindings to work with the new llama.cpp API from https://github.com/ggerganov/llama.cpp/pull/370
* Adds two separate interfaces - `LlamaInference` which is similar to the bindings in v0.1.8 and the lower level `LlamaContext` (currently untested)
* The old bindings are still present in `PyLlama.cpp` but is currently not compiled and will be removed at a later date