Onednn

Latest version: v2025.0.0

Safety actively analyzes 688823 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 8 of 26

2.5.4

This is a patch release containing the following changes to v2.5.3:
* Improved performance for batch normalization for tbb/threadpool (421a2cef07e2fe730a8ee6bbd0c55ad7e154eb3c, 7b7b763e8e264ec46bba1772d28aad07abf20d50)
* Fixed implicit conversion from double to float in examples (866b9ac4429d2a3e9751546ba101d0df11cfb519)
* Fixed issue in int8 matmul primitive for specific shapes (035c2d42e99e79956e4fe833f01b7b6e5509913c, 9a1bf19b40ed5493d8bbcc3ef5cb4d276a85e78e)
* Fixed performance regression for matmul primitive with binary post op and broadcast (dcd61efe83cb5b64e60fa6d294320a34fb8734c3, 31dec32e71624e9c7f6b4da39a0d2da757b06906)
* Fixed performance regression in binary primitive when using NHWC layout (228493c38711bf62d4dd4b534af890c2ae6b2ad1)

2.5.3

This is a patch release containing the following changes to v2.5.2:
* Fixed accuracy issue in GELU post-op (3ff2c3d6cfe1bd0b2eee015f97c6fe0515f14bff)
* Added ability to enable code only on non-x64 systems (ff7ae00c074ea09048bcc27f905385d1f6a6b830)
* Fixed issue in reorder primitive on non-x64 systems (5917860513ec94274a157d9642695393858cd205)
* Fixed build issue on OSX11 and older cmake (d9c8bbeb884ab916d47aa57b0bec9abbd7f89542)
* Fixed assert in reorder primitive (79090bc2907c7b201aa5ecd9a7ffe3a347fe2444)
* Documentation fixes (d29075852b8713c487253e7c664b2c78e3663327, ee7eacb012b95a677c3c1dae0d05e730d99f180e, 543b8f86f4659d36dddcff1d2bddb810d3740889)
* Fixed potential division by zero in example for binary primitive (2fffd96b7461ec1088b271e2df9d1fe077d45ff0)
* Fixed SIGFPE issue in reorder primitive (8c291fca11b08786948a1e198c7bca051e525347)
* Fixed potential size overflow in inner product primitive (c10f74a0e71d79ad06d813e403e63e2ee0e2b260)
* Added logic to reduce the number of threads (tasks spawned for threadpool) for small shapes (8f885e76a2221565ababd3718a2c1441b1300780, 405398994009fb97ea137a7e300a494489c29bc7, 49ec406751d2ba03e9166d36aec94e4d6dd236bd, 2977360e146148f17d60861829a41c674856e8f6)
* Fixed SEGFAULT issue in matmul primitive (62c1170d7741c261722167d06f28d7f5e18d14ee, a993d522ff68f310186b73c5e1ec473c221c7869)
* Added bf16 support for sum post-op (3d2c37e4b069d4188741c1a8c40e6eb7404e68a2)
* Added fp:precise compiler flag for Intel Compiler identified as IntelLLVM (1558a4bfd894d73f55030d06df73584af71525d6)
* Fixed issue in bf16 convolution primitive when fused with binary (b379fd9c3715af38fb2067f39aea19fa90191024)
* Fixed issue in backward depthwise convolution (d5e4122f6429cb73312620c9100a65a0ad66a0a7, f5cac2346d6198a958f50c8be7cbf968191018aa, eeaa19c4e87c2ee96a50a0df0dadd8d045e94774)
* Fixed SEGFAULT in int8 convolution with eltwise post_op (32a629fef18b087554dabcaa983d0158654b2fe3)
* Fixed NaN issue in bf16 backward inner product (0c5e49205d63f05b72779c2e2f9419bb42144e64)
* Fixed performance regression for binary with broadcast (f79b03072dbdc373ce3a5435c41d899ddee9eddb, 58ce3c1de0e8e8c387275f0d649a3a26b726c640)

graph-v0.4.1
This is a patch release containing the following changes to [graph-v0.4]( https://github.com/oneapi-src/oneDNN/releases/tag/graph-v0.4):
* Upgraded oneDNN to [v2.5.2](https://github.com/oneapi-src/oneDNN/releases/tag/v2.5.2) (b557b497, 6aae6f7a)
* Enabled MatMul + Div + Add fusions (effa3350, 3f5a8f7a, f9ffcc5c)

2.5.2

This is a patch release containing the following changes to v2.5.1:
* Fixed performance regression in binary primitive with broadcast (b9721743614f9dcb477a86d82fc19a96dc7e5736, ff751229eeb7ff546491f54b1060c03ec241c673)
* Fixed issue with SYCL device properties initialization (cabc5ca62e1b109161bc7cfccaa0ca5ba1f7b639, 095f13e77b9440307b152590f61c6e21e0c026a5)
* Fixed issue in matmul primitive with zero points (3157354dd5498fa83f2d8da17b25138d78a5c13b)
* Fixed segmentation fault in depthwise convolution primitive for shapes with huge spatial size for processors with Intel AVX-512 support (68347644ace88ef9dc7bcbba928674e3f9ac1b08, 1d2addcf5a11a6bf034001e809cbea1c89942f0f)
* Fixed issue in forward convolution primitive for processors with Intel AVX2 support (d691137c245efab99651c95387e1713d3cf91fb7)
* Fixed performance regression on GPUs with SYCL runtime (d8364e5b4c88f27143894bb7835c65eb22770e16)

graph-v0.4
This is a technical preview for oneDNN Graph API based on [oneDNN v2.5](https://github.com/oneapi-src/oneDNN/releases/tag/v2.5).

Functionality
* Introduced bf16 inference support.
* Introduced multi-head attention (MHA) fusion supported by oneDNN Graph compiler with optimized code generation (experimental).
* Updated API to comply with [oneDNN Graph API specification v0.9](https://spec.oneapi.io/onednn-graph/latest/index.html).

Known Issues and Limitations
* Some subgraphs might not be recognized as a partition even if it matches the general pattern description due to internal implementation.
* The weight’s opaque layout can be queried only from a compiled partition, which requires that tensor shapes must be known at compilation time.
* MHA fusion is not activated on machines without AVX-512 support, as oneDNN Graph compiler generates AVX-512 and newer instructions.

Thanks to the Contributors
This release contains contributions from the project core teams as well as Jiong Gong, Chunyuan Wu, Sanchit Jain, Yiqiang Li, Yunfei Mao, Kiefer Kuah and others.

2.5.1

This is a patch release containing the following changes to v2.5:
* Improved performance of binary primitive and binary post-op with broadcast over batch and spatial dimension (6d4b092ac3aaf81cf71e85e5a639c46f942c1e5c, c4dc38a70de60a92e541e581994f6a53e90c8110, be261ab0e3dae81fbf2b41b2a4038ffb940c5c75, 3ec15b6976eab124db4a5d22a02d8d1e8a2c2001, f1c2f9f3400446addd636194305138b4e6ce8a0b)
* Fixed undefined behavior for cases when different number of threads used at primitive creation and execution (0af92ec8ad0575883c04bc14436f13a5cc02d8fa, ba2e5a95d5585d28630971eba0edf59caa3673b0, 8863e34de693072cf5d299503a2601ab4cfacabe, 57b1e7ad3d80d61b2b3f820fd08090e548acd9b7, 72b54def7d421ee57acaa60642935c8348610632, 9b394dd5e8f661a7c0582daae9dc8fc562bf8220, 2d4d88a7c7e701aacdcde164bfbc166a73e9feef, 4c3e771c109bdc09f909baf339721dea26ceb6c6, 2458105c93b5451370efee2117a137e6223a63df, 67990405dc63d01f11cc5b464e1ce0a5106e0232, edc40fd6e65ee2a7cadd33129d4873fe4e6f6ccf)
* Replaced deprecated SYCL APIs with SYCL 2020 alternatives (2c2f4a4707484e15c59adb0aac3563a2ca4f202c, a090db862cc7ae8e77365f61cb8d716b3af3af99)
* Fixed documentation formatting issues (812085dd49ffe432b49ed6b86f28c37734fc2eeb, 591a0f449295c0d971348bbcfd3a3acf454158fd, 7eadf81d3bba83f4e38044144e165213dce09234, 75a2f06b7ad30b03b0f3eee20f786b94d744a5fb, b73c8a7034b3a7b5c3e95d181c267aea0d411092, ca1eb7710121ef2bacaca79d536471abf31daf25)
* Updated Microsoft Visual Studio build instructions (add953a66fdda58237aad5c57b93e106886b3b45, 42b9904847ae8d206a87fdb0222708a4334b676a)

2.5

Performance Optimizations
* Intel Architecture Processors
* Improved performance for future Intel Xeon Scalable processors (code name Sapphire Rapids). The functionality is now enabled by default and requires Linux kernel 5.16.
* Improved performance of matmul primitive for processors with Intel AVX-512 support.
* Intel Graphics Products
* Introduced initial optimizations for future Xe Architecture graphics (code name Ponte Vecchio).
* Improved pooling and layer normalization primitives performance.
* AArch64-based Processors
* Improved softmax and logsoftmax primitives performance with Arm Compute Library (ACL)

Functionality
* Introduced support for compiler with [SYCL 2020 standard support](https://www.khronos.org/registry/SYCL/specs/sycl-2020/html/sycl-2020.html).
* Introduced support for the ICX/ICPX and DPCPP compiler drivers distributed with Intel oneAPI DPC++ Compiler on Windows.

Usability
* Added compile time option to manage the set of supported instruction set architectures on Intel64/AMD64 processors. See ['DNNL_ENABLE_PRIMITIVE_CPU_ISA'](https://oneapi-src.github.io/oneDNN/dev_guide_build_options.html#dnnl-enable-primitive-cpu-isa) for more details. This feature further reduces the binary footprint.
* Added environment variables and build options with `ONEDNN` prefix.
* Introduced support for QNX operating system.
* Introduced support for RISC-V architecture.

Breaking Changes
* The Intel MKL-DNN compatibility API is removed. See [Transition from Intel MKL-DNN to oneDNN](https://oneapi-src.github.io/oneDNN/dev_guide_transition_to_dnnl.html) page for instructions on moving to the new API.
* Updated minimal supported ACL version to 21.11 (was 21.08).

Deprecated Functionality
* Support for Intel Xeon Phi processors is deprecated and will be removed in the next release.
* Support for SYCL 1.2.1 (aka SYCL 2017 standard) is deprecated and will be removed in future releases.

Thanks to the Contributors
This release contains contributions from the project core team as well as Aaron Franke aaronfranke, Arthur Mitrano aaraujom, Crefeda Rodrigues cfRod, Diana Bite diaena, Joel Dippold jedippold, Joe Konno thac0, Jonathan Deakin jondea, Luke Ireland LukeIreland1, Mark Ryan markdryan, Mesut Meterelliyoz mmeterel, Michel Migdal Michoumichmich, Nathan John Sircombe nSircombe, Pablo Romero pablorcum, Peter Caday petercad, Sergey Razumovskiy srazumov, and Tsao Zhong CaoZhongZ. We would also like to thank everyone who asked questions and reported issues.

2.5rc

This is a release candidate for oneDNN v2.5. Please provide feedback and submit defect reports via [Github issues](https://github.com/oneapi-src/oneDNN/issues/new/choose).

Performance Optimizations
* Intel Architecture Processors
* Improved performance for future Intel Xeon Scalable processors (code name Sapphire Rapids). The functionality is disabled by default and should be enabled via [CPU dispatcher control](https://oneapi-src.github.io/oneDNN/dev_guide_cpu_dispatcher_control.html).
* Improved performance of matmul primitive for processors with Intel AVX-512 support.
* Intel Graphics Products
* Introduced initial optimizations for future Xe Architecture graphics (code name Ponte Vecchio).
* Improved pooling and layer normalization primitives performance.
* AArch64-based Processors
* Improved softmax primitive performance with Arm Compute Library (ACL)

Functionality
* Introduced support for compiler with [SYCL 2020 standard support](https://www.khronos.org/registry/SYCL/specs/sycl-2020/html/sycl-2020.html).
* Introduced support for the ICX/ICPX and DPCPP compiler drivers available in the Intel oneAPI DPC++ Compiler.

Usability
* Added compile time option to manage the set of supported instruction set architectures on Intel64/AMD64 processors. See ['DNNL_ENABLE_PRIMITIVE_CPU_ISA'](https://oneapi-src.github.io/oneDNN/dev_guide_build_options.html#dnnl-enable-primitive-cpu-isa) for more details. This feature further reduces the binary footprint.
* Added environment variables and build options with 'ONEDNN' prefix.
* Introduced support for QNX operating system.
* Introduced support for RISC-V architecture.

Breaking Changes
* The Intel MKL-DNN compatibility API is removed. See [Transition from Intel MKL-DNN to oneDNN](https://oneapi-src.github.io/oneDNN/dev_guide_transition_to_dnnl.html) page for instructions on moving to the new API.

Deprecated Functionality
* Support for Intel Xeon Phi processors is deprecated and will be removed in the next release.
* Support for SYCL 1.2.1 (aka SYCL 2017 standard) is deprecated and will be removed in future releases.

Thanks to the Contributors
This release contains contributions from the project core team as well as Aaron Franke aaronfranke, Arthur Mitrano aaraujom, Crefeda Rodrigues cfRod, Diana Bite diaena, Joel Dippold jedippold, Joe Konno thac0, Jonathan Deakin jondea, Luke Ireland LukeIreland1, Mark Ryan markdryan, Mesut Meterelliyoz mmeterel, Michel Migdal Michoumichmich, Nathan John Sircombe nSircombe, Pablo Romero pablorcum, Peter Caday petercad, Sergey Razumovskiy srazumov, and Tsao Zhong CaoZhongZ. We would also like to thank everyone who asked questions and reported issues.

Page 8 of 26

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.