Performance optimizations
* Improved convolution performance on future Intel(R) Xeon Phi(TM) processors with AVX512_4FMAPS and AVX512_4VNNIW instruction groups support
* Improved convolution performance on Intel(R) Xeon processors with Intel(R) AVX512 instruction set support
* Improved performance of GEMM-based convolutions for small minibatches
* Improved performance of Winograd convolution algorithm on Intel Xeon Phi processors.
New functionality
* Added backpropagation support for dilated convolution.
* Eltwise primitive is extended with support for square, abs, square root, linear, bounded ReLU, soft ReLU and logistic.
Usability improvements
* Added macOS\* support.
Breaking changes to the API
* All real-value op descriptors' parameters now have `float` data type (previously `double`). The change breaks C-API backward compatibility for sum primitive. Please refer to 0bbb22e878a679ca870dc139b5c85e60e5ab78d3 for details. C++ API maintains backward compatibility.
Thanks to the contributors
This release contains contributions from many Intel(R) Performance Libraries developers as well as Yu Yang reyoung, Vladimir Mironov vamironov, Nishant Patel nbpatel, Leona Cook indie, Jayaram Bobba jbobba, Elena Gvozdeva. We would also like to thank everyone who asked questions and reported issues.
\* Other names and brands may be claimed as the property of others.