release-note-bugfix
+ [Fix: copy function was missing in MaxPoolingCudnn](https://github.com/sony/nnabla-ext-cuda/pull/391)
release-note-core
+ [Fused weight decay into Solver update / Fix SgdW / Add Lamb solver](https://github.com/sony/nnabla-ext-cuda/pull/388)
+ [Enable customizing std types and functions with memory allocation](https://github.com/sony/nnabla-ext-cuda/pull/385)
release-note-op-layer
+ [Feature/20210608 cholesky decomposition](https://github.com/sony/nnabla-ext-cuda/pull/390)
+ [Speedup SortCuda with cub::DeviceSegmentedRadixSort](https://github.com/sony/nnabla-ext-cuda/pull/384)
release-note-utility
+ [Auto restart DALI iterator at StopIteration](https://github.com/sony/nnabla-ext-cuda/pull/389)
+ [Support NVTX range events](https://github.com/sony/nnabla-ext-cuda/pull/386)
Install the latest nnabla by:
pip install nnabla
pip install nnabla-ext-cuda102 For CUDA version 10.2 users
You can also install the cuda extension with specific versions from one of the following. [See also FAQ](https://nnabla.readthedocs.io/en/latest/python/install_on_linux.html#faq)