**Release notes**
- Fixed signed integer division so it matches Python/NumPy
- Added a TDMA solver (by dionhaefner)
- Fixed many bugs and added python tests to verify the fixes
- Added support of `np.take()` and `np.put()`
- Added support of fancy indexing of vectors such as `a[[3,2,1]] = a[[1,2,5]]`
- Added OpenMP and OpenCL implementation of *some* LAPACK functions (as extension methods)
- Added Conjugate gradient
- Removed dependency of Cheetah
- Improved the compiler cache hit rate by implementing literals as variables in OpenMP and OpenCL kernels