Numba-dpex

Latest version: v0.23.0

Safety actively analyzes 675368 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

0.23.0

Fixed
* Array alignment problem for stack arrays allocated for kernel arguments. (1357)
* Issue 892, 906 caused by incorrect code generation for indexing (1377)
* Generation of `KernelHasReturnValueError` error inside `KernelDispatcher`. (1394)
* Issue 1390: broken support for slicing into `dpctl.tensor.usm_ndarray` in kernels (1425)
* Support for Wheels package on Windows (1430)
* Incorrect mangled name for kernel function arguments (1443)
* Remove artifacts from conda/wheel packages residing in root level (1450)
* GDB tests to work properly on Intel Max GPU (1451)
* Improper wheels installation on unsupported platforms (1452)
* Ref-counting of Python object temporaries in unboxing code (1454)
* Segfault caused by using `malloc` to allocate `NRT_MemInfo`. Replaced with Numba's NRT `alloc` (1458)
* Incorrect package name in README.md (1463)

Added
* A new overloaded `dimensions` attribute for all index-space id classes (1359)
* Support for `AtomicRef` creation using multi-dimensional arrays (1367)
* Support for linearized indexing functions inside a JIT compiled kernel (1368)
* Improved documentation: overview (1341), kernel programming guide (1388), API docs (1414), configs options (1415), comparison with SYCL API (1417)
* New `PrivateArray` class in `kernel_api` to replace `dpex.private.array` (1370, 1377)
* Support for libsycinterface::DPCTLKernelArgType enum for specifying type of kernel args instead of hard coding (1382)
* New indexing unit tests for kernel_api simulator and JIT compiled modes (1378)
* New unit tests to verify all `kernel_api` features usable inside `device_func` (1391)
* A `sycl::local_accessor`-like API (`kernel_api.LocalAccessor`) for numba-dpex kernel (1331)
* Specialization support for `device_func` decorator (1398)
* Support for all `kernel_api` functions inside the `numba_dpex.kernel` decorator. (1400)
* Support for dpnp 0.15 (1434, 1464)
* Improvements to pyproject.toml configs to build numba-dpex from source. (1449)
* Load the `SPV_INTEL_variable_length_array` SPIR-V extension to supporting arrays in private address-space on Intel Max GPU. (1451)

Changed
* Default inline threshold value set to `2` from `None`. (1385)
* Port parfor kernel templates to `kernel_api` (1416), (1424)
* Use `SPIRVKernelDispatcher` for parfor kernel dispatch (1435, 1448)
* All examples use the latest dpctl API (1431)
* Minimum required dpctl version is now 0.16.1
* Minimum required numba version is now 0.59.0 (1462)

Removed
* OpenCL-like kernel API functions (1420)
* `func` decorator (replaced by `device_func`) (1400)
* `numba_dpex.experimental.kernel` and `numba_dpex.experimental.device_func` (1400)

0.22.0

Fixed
* Bug in boxing a DpnpNdArray from parent (1155)
* Strided layouts and F-contiguous layouts supported in experimental kernel (1178)
* Barrier call code-generation on OpenCL CPU devices (1280, 1310)
* Importing numba-dpex can break numba execution (1267)
* Overhead on launching numba_dpex.kernel functions (1236)

Added
* Support for dpctl.SyclEvent data type inside dpjit (1134)
* Support for kernel_api.Range and kernel_api.NdRange inside dpjit (1148)
* DPEX_OPT: a numba-dpex-specific optimization level config option (1158)
* Uploading wheels packages to anaconda (1160)
* flake8 eradicate linter option (1177)
* Support dpctl.SyclEvent.wait call inside dpjit (1179)
* Creation of sycl event and queue inside dpjit (1193, 1190, 1218)
* Experimental kernel dispatcher for kernel compilation (1178, 1205)
* Added experimental target context for SPIRV codegen (1213, 1225)
* GDB test cases in public CI (1209)
* Async kernel submission option (1219, 1249)
* A new literal type to store IntEnum as Literal types (1227)
* SYCL-like memory enum classes to the experimental module (1239)
* call_kernel function to launch kernels (1260)
* Experimental overloads for an AtomicRef class and fetch_* methods (1257, 1261)
* New device-specific USMNdArrayModel for USMNdArray and DpnpNdArray types (1293)
* Experimental atomic load, store and exchange operations (1297)
* Kernel_api module to simulate kernel functions in pure Python (1304, 1326)
* Experimental implementation of group barrier operation (1280)
* Experimental atomic compare_exchange implementation (1312)
* Experimental group index class (1310)
* OpenSSF scorecard (1320)
* Experimental feature index overload methods (1323)
* Experimental feature group index overload methods (1330)
* API Documentation for kernel API (1332)

Changed
* Switch to dpc++ compiler for building numba-dpex (1210)
* Versioneer and pytest configs into pyproject.toml (1212)
* numba-dpex can be imported even if no SYCL device is detected by dpctl (1272)

Removed
* Kernel launch params as lists/tuple. Only Range/NdRange supported (1251)
* DEFAULT_LOCAL_SIZE global constant (1291)
* Functions to invoke spirv-tools utilities from spirv_generator (1292)
* Incomplete vectorize decorator from numba-dpex (1298)
* Support for Numba 0.57 (1307)

Deprecated
* OpenCL-like kernel API functions in numba_dpex.ocldecl module

0.21.4

Fixed
* Remove dead code to silence Coverity errors. (1163)

0.21.3

Fixed
* Pin CI conda channels (1133)
* Mangled kernel name generation (1112)

Added
* Support tests on single point precision GPUs (1143)
* Initial work on Coverity scan CI (1128)
* Python 3.11 support (1123)
* Security policy (1117)
* scikit-build to build native extensions (1107, 1116, 1127, 1139, 1140)

Changed
* Rename helper function to clearly indicate its usage (1145)
* The data model used by the DpnpNdArray type for kernel functions(1118)

Removed
* Support for Python 3.8 (1113)

0.21.2

Fixed
* Bugs (1068, 774) in atomic addition caused due to improper floating point atomic emulation. (1103)

Changed
* Updated documentation and user guides (1097, 879)

Removed
* Dependency on `spirv-tools` (1103, 1108)
* floating point atomic add emulation using `atomic_ops.cl` (1103)
* `NUMBA_DPEX_ACTIVATE_ATOMICS_FP_NATIVE` configuration option (1103)

0.21.1

Changed
* Improved support for `queue` keyword in dpnp array constructor overloads (1083)
* Improved reduction kernel example (1089)

Fixed
* Update Itanium CXX ABI Mangler reference (1080)
* Update sourceware references in docstrings (1081)
* Typo in error messages of kernel interface (1082)

Page 1 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.