Taichi

Latest version: v1.7.3

Safety actively analyzes 723685 Python packages for vulnerabilities to keep your Python projects secure.

Page 4 of 23

1.5.0

Deprecation Notice
- ndarray no longer accepts field_dim, replaced by the ndim argument.
- [RFC] Deprecate ti.cc backend in favor of TiRT and its C API, if you have any concerns please let us know at https://github.com/taichi-dev/taichi/issues/7629
New features
AOT
- Taichi Runtime (TiRT) now supports Apple's Metal API and OpenGL ES for compatibility on old mobile platforms. Now Taichi programs can be deployed to any mainstream consumer devices.
NOTE Taichi program deployment on mobile platforms is experimental. Please contact us at contacttaichi.graphics for long-term services.
- Taichi AOT now fully supports float16 dtype.
Ndarray
- Out of bound check is now supported on ndarrays
Improvements
Python Frontend
We now support returning a struct on LLVM-based backends (CPU and CUDA backend). The struct can contain vectors and matrices, and it can also nest with other structs. Here's an example.
Python
s0 = ti.types.struct(a=ti.math.vec3, b=ti.i16)
s1 = ti.types.struct(a=ti.f32, b=s0)

ti.kernel
def foo() -> s1:
return s1(a=1, b=s0(a=ti.math.vec3(100, 0.2, 3), b=1))

print(foo()) {'a': 1.0, 'b': {'a': [100.0, 0.2, 3.0], 'b': 1}}

Performance
- Support atomic operation on half2 for CUDA backend (with compute capability > 60). You can enable this with ti.init(half2_vectorization=True). This feature could effectively accelerate the Nerf training process, please refer to [this repo](https://github.com/taichi-dev/taichi-nerfs) for details.
GGUI
- GGUI now has no computing backend restrictions! You can now use Metal, OpenGL, AMDGPU, or DirectX 11, in addition to CPU, CUDA, Vulklan that's previously suported by GGUI.
- GGUI now has been validated on mesa's software rasterizer lavapipe, you can utilize this solution for headless server visualization, or on servers with no graphics capabilities (such as A100)
- Add the fps_limit option which adjusts the maximal frame rate in GGUI.

Full changelog:

Highlights:
- **AMDGPU backend**
- Enable shared array on amdgpu backend (7403) (by **Zeyu Li**)
- Add print kernel amdgcn (7357) (by **Zeyu Li**)
- Add amdgpu backend profiler (7330) (by **Zeyu Li**)
- **Aot module**
- Let AOT kernel inherit CallableBase and use LaunchContextBuilder (by **lin-hitonami**)
- Deprecate element shape and field dim for AOT symbolic args (7100) (by **Haidong Lan**)
- **Bug fixes**
- Fix copy_from() of StructField (7294) (by **Yi Xu**)
- Fix caching same loop invariant global vars inside nested fors (7285) (by **Lin Jiang**)
- Fix num_splits in parallel_struct_for (7121) (by **Yi Xu**)
- Fix ret_type and cast_type of UnaryOpStmt in Scalarize (7082) (by **Yi Xu**)
- **Documentation**
- Update GGUI docs with correct API (7525) (by **pengyu**)
- Fix typos and improve example code in data_oriented_class.md (7520) (by **pengyu**)
- Update gui_system.md, remove unnecessary example (7487) (by **NextoneX**)
- Fix typo in API doc (7511) (by **pengyu**)
- Update math_module (7405) (by **Zhao Liang**)
- Update hello_world.md (7400) (by **Zhao Liang**)
- Update debugging.md (7401) (by **Zhao Liang**)
- Update hello_world.md (7380) (by **Zhao Liang**)
- Update type.md (7376) (by **Zhao Liang**)
- Update kernel_function.md (7375) (by **Zhao Liang**)
- Update hello_world.md (7369) (by **Zhao Liang**)
- Update hello_world.md (7368) (by **Zhao Liang**)
- Update data_oriented_class.md (6790) (by **Zhao Liang**)
- Update hello_world.md (7367) (by **Zhao Liang**)
- Update kernel_function.md (7364) (by **Zhao Liang**)
- Update hello_world.md (7354) (by **Zhao Liang**)
- Update llvm_sparse_runtime.md (7323) (by **Gabriel Vainer**)
- Update profiler.md (7358) (by **Zhao Liang**)
- Update kernel_function.md (7356) (by **Zhao Liang**)
- Update tut.md (7352) (by **Gabriel Vainer**)
- Update type.md (7350) (by **Zhao Liang**)
- Update hello_world.md (7337) (by **Zhao Liang**)
- Update append docstring (7265) (by **Zhao Liang**)
- Update ndarray.md (7236) (by **Gabriel Vainer**)
- Update llvm_sparse_runtime.md (7215) (by **Zhao Liang**)
- Remove doc tutorial (7198) (by **Olinaaaloompa**)
- Rename tutorial doc (7186) (by **Zhao Liang**)
- Update tutorial.md (7176) (by **Zhao Liang**)
- Update math_module.md (7175) (by **Zhao Liang**)
- Update debugging.md (7173) (by **Zhao Liang**)
- Fix C++ tutorial does not display on doc site (7174) (by **Zhao Liang**)
- Update doc regarding dynamic index (7148) (by **Yi Xu**)
- Move glossary to top level (7118) (by **Zhao Liang**)
- Update type.md (7038) (by **Zhao Liang**)
- Fix docstring (7065) (by **Zhao Liang**)
- **Error messages**
- Allow IfExp on matrices when the condition is scalar (7241) (by **Lin Jiang**)
- Remove deprecations in ti.ui in 1.6.0 (7229) (by **Lin Jiang**)
- Remove deprecated ti.linalg.sparse_matrix_builder in 1.6.0 (7228) (by **Lin Jiang**)
- Remove deprecations in ASTTransformer in 1.6.0 (7226) (by **Lin Jiang**)
- Remove deprecated a.atomic_op(b) in Taichi v1.6.0 (7225) (by **Lin Jiang**)
- Remove deprecations in taichi/__init__.py in v1.6.0 (7222) (by **Lin Jiang**)
- Raise error when using deprecated ifexp on matrices (7224) (by **Lin Jiang**)
- Better error message when creating sparse snodes on backends that do not support sparse (7191) (by **Lin Jiang**)
- Raise errors when using metal sparse (7113) (by **Lin Jiang**)
- **GUI**
- GGUI use shader "factory" (GGUI rework n/N) (7271) (by **Bob Cao**)
- **Intermediate representation**
- Unified type system for internal operations (6337) (by **daylily**)
- **Language and syntax**
- Keep ti.pyfunc (7530) (by **Lin Jiang**)
- Type check assignments between tensors (7480) (by **Yi Xu**)
- Fix pylance warnings raised by ti.static (7437) (by **Zhao Liang**)
- Deprecate arithmetic operations and fill() on ti.Struct (7456) (by **Yi Xu**)
- Fix pylance warnnings by ti.random (7439) (by **Zhao Liang**)
- Fix pylance types warning (7417) (by **Zhao Liang**)
- Add better error message for dynamic snode (7238) (by **Zhao Liang**)
- Simplify the swizzle generator (7216) (by **Zhao Liang**)
- Remove the deprecated dynamic_index switch (7195) (by **Yi Xu**)
- Remove deprecated packed switch (7104) (by **Yi Xu**)
- Raise errors when using the packed switch (7125) (by **Yi Xu**)
- Fix cannot use taichi in REPL (7114) (by **Zhao Liang**)
- Remove deprecated ti.Matrix.rotation2d() (7098) (by **Yi Xu**)
- Remove filename kwarg in aot Module save() (7085) (by **Ailing**)
- Remove sourceinspect deprecation warning message (7081) (by **Zhao Liang**)
- Make slicing a single row/column of a matrix return a vector (7068) (by **Yi Xu**)
- **Miscellaneous**
- Strictly check ndim with external array (7126) (by **Haidong Lan**)

Full changelog:
- [cc] Add deprecation notice for cc backend (7651) (by **Ailing**)
- [misc] Cherry pick struct return related commits (7575) (by **Haidong Lan**)
- [Lang] Keep ti.pyfunc (7530) (by **Lin Jiang**)
- [bug] Fix symbol conflicts with taichi_cpp_tests (7528) (by **Zhanlue Yang**)
- [bug] Fix numerical issue with TensorType'd arithmetics (7526) (by **Zhanlue Yang**)
- [aot] Enable Metal AOT test (7461) (by **PENGUINLIONG**)
- [Doc] Update GGUI docs with correct API (7525) (by **pengyu**)
- [misc] Implement KernelCompialtionManager::clean_offline_cache (7515) (by **PGZXB**)
- [ir] Except shared array from demote atomics pass. (7513) (by **Haidong Lan**)
- [bug] Fix error with windows-clang compilation for cuda_runtime.cu (7519) (by **Zhanlue Yang**)
- [misc] Deprecate field dim and update deprecation warnings (7491) (by **Haidong Lan**)
- [build] Fix build failure without nvcc (7521) (by **Ailing**)
- [Doc] Fix typos and improve example code in data_oriented_class.md (7520) (by **pengyu**)
- [aot] Kernel argument count limit (7518) (by **PENGUINLIONG**)
- [Doc] Update gui_system.md, remove unnecessary example (7487) (by **NextoneX**)
- [AOT] [llvm] Let AOT kernel inherit CallableBase and use LaunchContextBuilder (by **lin-hitonami**)
- [llvm] Let the offline cache record the type info of arguments and return values (by **lin-hitonami**)
- [ir] Separate LaunchContextBuilder from Kernel (by **lin-hitonami**)
- [Doc] Fix typo in API doc (7511) (by **pengyu**)
- [aot] Build Runtime C-API by default (7508) (by **PENGUINLIONG**)
- [bug] Fix run_tests.py --with-offline-cache (7507) (by **PGZXB**)
- [vulkan] Support printing constant strings containing % (7499) (by **魔法少女赵志辉**)
- [ci] Fix nightly version number, 2nd try (7501) (by **Proton**)
- [aot] Fixed memory leak in metal backend (7500) (by **PENGUINLIONG**)
- [ci] Fix nightly version number issue (7498) (by **Proton**)
- [example] Remove cv2, cairo dependency (7496) (by **Zhao Liang**)
- [type] Let Type * be serializable (by **lin-hitonami**)
- [ci] Second attempt at permission check for ghstack landing (7490) (by **Proton**)
- [docs] Reword words of warning about building from source (7488) (by **Anselm Schüler**)
- [lang] Fixed double release of Metal command buffer (7484) (by **PENGUINLIONG**)
- [ci] Switch Android bots lock redis to bot-master (7482) (by **Proton**)
- [ci] Status check of ghstack CI bot (7479) (by **Proton**)
- [Lang] Type check assignments between tensors (7480) (by **Yi Xu**)
- [doc] Fix typo in ndarray.md (7476) (by **Chenzhan Shang**)
- [opt] Enable half2 optimization for atomic_add operations on CUDA backend (7465) (by **Zhanlue Yang**)
- [Lang] Fix pylance warnings raised by ti.static (7437) (by **Zhao Liang**)
- Let the LaunchContextBuilder manage the result buffer (by **lin-hitonami**)
- [ci] Fix nightly build failure, and minor improvements (7475) (by **Proton**)
- [ci] Fix duplicated names in aot tests (7471) (by **Ailing**)
- [lang] Improve float16 support from Taichi type system (7402) (by **Zhanlue Yang**)
- [Lang] Deprecate arithmetic operations and fill() on ti.Struct (7456) (by **Yi Xu**)
- [misc] Add out of bound check for ndarray (7458) (by **Ailing**)
- [aot] Remove graph kernel interfaces (7466) (by **PENGUINLIONG**)
- [llvm] Let the RuntimeContext use the host result buffer (by **lin-hitonami**)
- [gui] Fix 3d line drawing & add test (7454) (by **Bob Cao**)
- [lang] Fixed texture assertions (7450) (by **PENGUINLIONG**)
- [aot] Fixed header generator (7455) (by **PENGUINLIONG**)
- [aot] AOT module convention GfxRuntime140 (7440) (by **PENGUINLIONG**)
- [misc] Add an explicit error in cc backend codegen for dynamic indexing (7449) (by **Ailing**)
- [ci] Lower C++ tests concurrency (7451) (by **Proton**)
- [aot] Properly handle texture attributes (7433) (by **PENGUINLIONG**)
- [Lang] Fix pylance warnnings by ti.random (7439) (by **Zhao Liang**)
- [ir] Get the StructType of the kernel parameters (by **lin-hitonami**)
- [ci] Report failure (not throwing exception) when C++ tests fail (7435) (by **Proton**)
- [llvm] Allocate the result buffer from preallocated memory (by **lin-hitonami**)
- [vulkan] Fix GGUI and vulkan swapchain on AMD drivers (7382) (by **Bob Cao**)
- [autodiff] Handle return statement (7389) (by **Mingrui Zhang**)
- [misc] Remove unnecessary functions of gfx::AotModuleBuilderImpl (7425) (by **PGZXB**)
- [bug] Fix offline_cache::clean_offline_cache_files (ti cache clean) (7426) (by **PGZXB**)
- [test] Refactor C++ tests runner (7421) (by **Proton**)
- [ci] Adjust perfmon GPU freq (7429) (by **Proton**)
- [misc] Remove AotModuleParams::enable_lazy_loading (7424) (by **PGZXB**)
- [aot] Use graphs.json instead of TCB (7392) (by **PENGUINLIONG**)
- [refactor] Introduce KernelCompilationManager (7409) (by **PGZXB**)
- [IR] Unified type system for internal operations (6337) (by **daylily**)
- [lang] Add is_lvalue() to Expr to check writeback_binary operand (7414) (by **魔法少女赵志辉**)
- [bug] Fix get_error_string ret type typo (7418) (by **Zeyu Li**)
- [aot] Reorganize graph argument creation process (7412) (by **PENGUINLIONG**)
- [Amdgpu] Enable shared array on amdgpu backend (7403) (by **Zeyu Li**)
- [Lang] Fix pylance types warning (7417) (by **Zhao Liang**)
- [aot] Simplify device capability assignment (7407) (by **PENGUINLIONG**)
- [Doc] Update math_module (7405) (by **Zhao Liang**)
- [ci] Lock GPU frequency in perf benchmarking (7413) (by **Proton**)
- [ci] Add 'Needed single revision' workaround to all tasks (7408) (by **Proton**)
- [Doc] Update hello_world.md (7400) (by **Zhao Liang**)
- [refactor] Introduce KernelCompiler and implement spirv::KernelCompiler (7371) (by **PGZXB**)
- [Amdgpu] Add print kernel amdgcn (7357) (by **Zeyu Li**)
- [Doc] Update debugging.md (7401) (by **Zhao Liang**)
- [refactor] Disable ASTSerializer::allow_undefined_visitor (7391) (by **PGZXB**)
- [amdgpu] Enable llvm FpOpFusion option on AMDGPU backend (7398) (by **Zeyu Li**)
- [aot] Add test for shared array (7387) (by **Ailing**)
- [vulkan] Change command list submit error message & misc device API cleanups (7395) (by **Bob Cao**)
- [bug] Fix arch_uses_spirv (7399) (by **PGZXB**)
- [gui] Fix ggui & vulkan swapchain sizes on HiDPI displays (7394) (by **Bob Cao**)
- [Doc] Update hello_world.md (7380) (by **Zhao Liang**)
- [aot] Remove support for depth24stencil8 format on Metal (7377) (by **PENGUINLIONG**)
- [bug] Add DeviceCapabilityConfig to offline cache key (7384) (by **PGZXB**)
- [Doc] Update type.md (7376) (by **Zhao Liang**)
- [refactor] Remove dependencies on Callable::program in cpp tests (7373) (by **PGZXB**)
- [lang] Experimental support of conjugate gradient solver (7035) (by **pengyu**)
- [aot] Metal interop APIs (7366) (by **PENGUINLIONG**)
- [Doc] Update kernel_function.md (7375) (by **Zhao Liang**)
- [gui] Add `fps_limit` for GGUI (7374) (by **Bob Cao**)
- [Doc] Update hello_world.md (7369) (by **Zhao Liang**)
- [aot] Fix blockers in static library build with XCode (7365) (by **PENGUINLIONG**)
- [vulkan] Remove GLFW from Vulkan rhi dependency (7351) (by **Bob Cao**)
- [misc] Remove useless semicolon in llvm_program.h (7372) (by **PGZXB**)
- [Doc] Update hello_world.md (7368) (by **Zhao Liang**)
- [Amdgpu] Add amdgpu backend profiler (7330) (by **Zeyu Li**)
- [lang] Stop broadcasting scalar cond in select statements (7344) (by **魔法少女赵志辉**)
- [bug] Fix validation erros due to inactive VK_KHR_16bit_storage (7360) (by **Zhanlue Yang**)
- [aot] Support texture in Metal (7363) (by **PENGUINLIONG**)
- [Doc] Update data_oriented_class.md (6790) (by **Zhao Liang**)
- [Doc] Update hello_world.md (7367) (by **Zhao Liang**)
- [refactor] Introduce lang::CompiledKernelData (7340) (by **PGZXB**)
- [bug] Fix matrix initialization error with numpy.floating data (7362) (by **Zhanlue Yang**)
- [Doc] Update kernel_function.md (7364) (by **Zhao Liang**)
- [test] [amdgpu] Fix bug with allocs bb in function body (7308) (by **Zeyu Li**)
- [Doc] Update hello_world.md (7354) (by **Zhao Liang**)
- [aot] Fixed C-API docs (7361) (by **PENGUINLIONG**)
- [refactor] Remove dependencies on Callable::program in lang::CompiledGraph::run (7288) (by **PGZXB**)
- [DOC] Update llvm_sparse_runtime.md (7323) (by **Gabriel Vainer**)
- [Doc] Update profiler.md (7358) (by **Zhao Liang**)
- [Doc] Update kernel_function.md (7356) (by **Zhao Liang**)
- [aot] Improve Taichi C++ wrapper implementation (7347) (by **PENGUINLIONG**)
- [Doc] Update tut.md (7352) (by **Gabriel Vainer**)
- [ci] Add doc snippet CI requirements (7355) (by **Proton**)
- [amdgpu] Update device memory free (7346) (by **Zeyu Li**)
- [Doc] Update type.md (7350) (by **Zhao Liang**)
- [aot] Enable 16-bit dtype support for Taichi AOT (7315) (by **Zhanlue Yang**)
- [example] Re-implement the Cornell Box demo with shorter lines of code (7252) (by **HK-SHAO**)
- [aot] AOT CI refactorization (7339) (by **PENGUINLIONG**)
- [llvm] Let the kernel return struct (by **lin-hitonami**)
- [Doc] Update hello_world.md (7337) (by **Zhao Liang**)
- [ci] Reduce doc test concurrency (7336) (by **Proton**)
- [ir] Refactor result fetching (by **lin-hitonami**)
- [ir] Get the offsets of elements in StructType (by **lin-hitonami**)
- [misc] Delete test.py (7332) (by **Bob Cao**)
- [vulkan] More subgroup operations (7328) (by **Bob Cao**)
- [vulkan] Add vulkan profiler (7295) (by **Haidong Lan**)
- [refactor] Move TaichiLLVMContext::runtime_jit_module and TaichiLLVMContext::create_jit_module() to LlvmRuntimeExecutor (7320) (by **PGZXB**)
- [refactor] Remove dependencies on LlvmProgramImpl::get_llvm_context() in TaskCodeGenLLVM (7321) (by **PGZXB**)
- [ci] Checkout with privileged token when landing ghstack PRs (7331) (by **Proton**)
- [ir] Add fields to StructType (by **lin-hitonami**)
- [gui] Remove renderable reuse & make renderable immediate (7327) (by **Bob Cao**)
- [Gui] GGUI use shader "factory" (GGUI rework n/N) (7271) (by **Bob Cao**)
- [bug] Fix u64 field cannot be assigned value >= 2 ** 63 (7319) (by **Lin Jiang**)
- [type] Let the compute type of quant uint be unsigned int (by **lin-hitonami**)
- [doc] Replace slack with discord (7318) (by **yanqingzhang**)
- [refactor] Change print statement to warnings.warn in taichi.lang.util.warning (7301) (by **Jett Chen**)
- [ci] ChatOps: ghstack land (7314) (by **Proton**)
- [refactor] Remove TaichiLLVMContext::lookup_function_pointer() (7312) (by **PGZXB**)
- [misc] Update MSVC flags (7254) (by **Bob Cao**)
- [doc] [ci] Cover code snippets in docs (7309) (by **Proton**)
- [refactor] Remove dependencies on LlvmProgramImpl::get_llvm_context() in KernelCodeGen (7289) (by **PGZXB**)
- [rhi] Device upload readback functions (7278) (by **Bob Cao**)
- [aot] Fixed external project inclusion (7297) (by **PENGUINLIONG**)
- [Doc] Update append docstring (7265) (by **Zhao Liang**)
- [refactor] Remove dependencies on Callable::program in lang::get_hashed_offline_cache_key (7287) (by **PGZXB**)
- [ci] [amdgpu] Enable amdgpu backend python unit tests (7293) (by **Zeyu Li**)
- [Bug] Fix copy_from() of StructField (7294) (by **Yi Xu**)
- [ci] Adapt new Android phone behavior (7306) (by **Proton**)
- [Bug] Fix caching same loop invariant global vars inside nested fors (7285) (by **Lin Jiang**)
- [amdgpu] Part5 enable the api of amdgpu (7202) (by **Zeyu Li**)
- [amdgpu] Enable struct for on amdgpu backend (7247) (by **Zeyu Li**)
- [misc] Update external/asset which was accidentally downgraded in 7248 (7284) (by **Lin Jiang**)
- [amdgpu] Update runtime module (7248) (by **Zeyu Li**)
- [llvm] Remove unused argument 'arch' in LlvmProgramImpl::get_llvm_context (7282) (by **Lin Jiang**)
- [misc] Remove deprecated kwarg in rw_texture type annotations (7267) (by **Ailing**)
- [ci] Tolerate duplicates when registering version (7281) (by **Proton**)
- [gui] Fix GGUI destruction order (7279) (by **Bob Cao**)
- [doc] Rename /doc/ndarray_android to /doc/tutorial (7273) (by **Lin Jiang**)
- [llvm] Unify the llvm context of host and device (7249) (by **Lin Jiang**)
- [misc] Fix manylinux2014 warning not printing (7270) (by **Proton**)
- [ci] Building: add complete PATH set for conda (7268) (by **Proton**)
- [autodiff] Support rsqrt operator (7259) (by **Mingrui Zhang**)
- [ci] Update pre-commit repos version (7257) (by **Proton**)
- [refactor] Fix "const CompileConfig *" to "const CompileConfig &" (Part2) (7253) (by **PGZXB**)
- [refactor] Fix "const CompileConfig *" to "const CompileConfig &" (7243) (by **PGZXB**)
- [aot] Added third-party render thread task injection for Unity (7151) (by **PENGUINLIONG**)
- [aot] Support statically linked C-API library on MacOS (7207) (by **Zhanlue Yang**)
- [gui] Force GGUI to go through host memory (nuking interops) (7218) (by **Bob Cao**)
- [Error] Allow IfExp on matrices when the condition is scalar (7241) (by **Lin Jiang**)
- [bug] Fix the parity of the RNG (7239) (by **Lin Jiang**)
- [Lang] Add better error message for dynamic snode (7238) (by **Zhao Liang**)
- [DOC] Update ndarray.md (7236) (by **Gabriel Vainer**)
- [Error] Remove deprecations in ti.ui in 1.6.0 (7229) (by **Lin Jiang**)
- [Doc] Update llvm_sparse_runtime.md (7215) (by **Zhao Liang**)
- [lang] Add validation checks for subscripts to reject negative indices (7212) (by **Zhanlue Yang**)
- [refactor] Remove legacy num_bits and acc_offsets from AxisExtractor (7227) (by **Yi Xu**)
- [Error] Remove deprecated ti.linalg.sparse_matrix_builder in 1.6.0 (7228) (by **Lin Jiang**)
- [Error] Remove deprecations in ASTTransformer in 1.6.0 (7226) (by **Lin Jiang**)
- [misc] Export DeviceAllocation into Python & support devalloc in field_info (7233) (by **Bob Cao**)
- [gui] Use templated bulk copy to simplify VBO preperation (7234) (by **Bob Cao**)
- [rhi] Add create_image_unique stub & misc RHI bug fixes (7232) (by **Bob Cao**)
- [opengl] Fix GLFW global context issue (7230) (by **Bob Cao**)
- [examples] Remove dependency on `ti.u8` compute type for ngp (7220) (by **Bob Cao**)
- [refactor] Remove Kernel::offload_to_executable (7210) (by **PGZXB**)
- [opengl] RW image binding & FP16 support (7219) (by **Bob Cao**)
- [Error] Remove deprecated a.atomic_op(b) in Taichi v1.6.0 (7225) (by **Lin Jiang**)
- [Error] Remove deprecations in taichi/__init__.py in v1.6.0 (7222) (by **Lin Jiang**)
- [Error] Raise error when using deprecated ifexp on matrices (7224) (by **Lin Jiang**)
- [refactor] Remove legacy BitExtractStmt (7221) (by **Yi Xu**)
- [amdgpu] Part4 link bitcode file (7180) (by **Zeyu Li**)
- [example] Reorganize example oit_renderer (7208) (by **Lin Jiang**)
- [aot] Fix ndarray aot with information from type hints (7214) (by **Ailing**)
- [gui] Fix wide line support on macOS (7205) (by **Bob Cao**)
- [Lang] Simplify the swizzle generator (7216) (by **Zhao Liang**)
- [refactor] Split constructing and compilation of lang::Function (7209) (by **PGZXB**)
- [doc] Fix netlify build command (7217) (by **Ailing**)
- [ci] M1 buildbot release tag (7213) (by **Proton**)
- [misc] Remove unused task_funcs (7211) (by **PGZXB**)
- [refactor] Program::this_thread_config() -> Program::compile_config() (7199) (by **PGZXB**)
- [doc] Fix format issues of windows debugging (7197) (by **Olinaaaloompa**)
- [aot] More OpenGL interop in C-API (7204) (by **PENGUINLIONG**)
- [metal] Disable a kernel test in offline cache to unblock CI (7154) (by **Ailing**)
- [ci] Switch Windows build script to build.py (6993) (by **Proton**)
- [misc] Update submodule taichi_assets (7203) (by **Lin Jiang**)
- [mac] Use ObjectLinkingLayer instead of RTDyldObjectLinkingLayer for aarch64 mac (7201) (by **Ailing**)
- [misc] Remove unused Program::jit_evaluator_id (7200) (by **PGZXB**)
- [misc] Remove legacy latex generation (7196) (by **Yi Xu**)
- [Lang] Remove the deprecated dynamic_index switch (7195) (by **Yi Xu**)
- [bug] Fix check_matched() failure with Ndarray holding TensorType'd element (7178) (by **Zhanlue Yang**)
- [Doc] Remove doc tutorial (7198) (by **Olinaaaloompa**)
- [bug] Fix example circle-packing (7194) (by **Lin Jiang**)
- [aot] C-API opengl runtime interop (7120) (by **damnkk**)
- [Error] Better error message when creating sparse snodes on backends that do not support sparse (7191) (by **Lin Jiang**)
- [example] Fix ti gallery close warning (7187) (by **Zhao Liang**)
- [lang] Interface refactors for MatrixType and VectorType (7143) (by **Zhanlue Yang**)
- [aot] Find Taichi in python wheel (7181) (by **PENGUINLIONG**)
- [gui] Update circles rendering to use quads (7163) (by **Bob Cao**)
- [Doc] Rename tutorial doc (7186) (by **Zhao Liang**)
- [ir] Fix gcc cannot compile inline template specialization (7179) (by **Lin Jiang**)
- [Doc] Update tutorial.md (7176) (by **Zhao Liang**)
- [aot] Replace std::exchange with local implementation for C++11 (7170) (by **PENGUINLIONG**)
- [ci] Fix near cache urls (missing comma) (7158) (by **Proton**)
- [docs] Create windows_debug.md (7164) (by **Bob Cao**)
- [Doc] Update math_module.md (7175) (by **Zhao Liang**)
- [aot] FindTaichi CMake module to help outside project integration (7168) (by **PENGUINLIONG**)
- [aot] Removed unused archs in C-API (7167) (by **PENGUINLIONG**)
- [Doc] Update debugging.md (7173) (by **Zhao Liang**)
- [refactor] Remove dependencies on Program::this_thread_config() in irpass::constant_fold (7159) (by **PGZXB**)
- [Doc] Fix C++ tutorial does not display on doc site (7174) (by **Zhao Liang**)
- [aot] C++ wrapper for memory slice and memory allocation with host access (7171) (by **PENGUINLIONG**)
- [aot] Fixed ti_get_last_error signature (7165) (by **PENGUINLIONG**)
- [misc] Log to stderr instead of stdout (7166) (by **PENGUINLIONG**)
- [aot] C-API get version wrapper (7169) (by **PENGUINLIONG**)
- [doc] Fix spelling of "paticle_field" (7024) (by **Xiang (Kevin) Li**)
- [misc] Remove useless Program::sync (7160) (by **PGZXB**)
- [doc] Update accelerate_python.md to use ti.max (7161) (by **Tao Jin**)
- [doc] Add doc ndarray (7157) (by **Olinaaaloompa**)
- [mac] Add .dylib and .cmake to built wheel (7156) (by **Ailing**)
- [refactor] Remove dependencies on Program::this_thread_config() in some tests (7155) (by **PGZXB**)
- [refactor] Remove dependencies on Program::this_thread_config() in llvm backends codegen (7153) (by **PGZXB**)
- [Lang] Remove deprecated packed switch (7104) (by **Yi Xu**)
- [example] Update quaternion arithmetics in fractal_3d_ggui (7139) (by **Zhao Liang**)
- [doc] Update field.md (Fields advanced) (6867) (by **Gabriel Vainer**)
- [ci] Use make_changelog.py to generate the full changelog (7152) (by **Lin Jiang**)
- [refactor] Rename Callable::*arg* to Callable::*param* (7133) (by **PGZXB**)
- [aot] Introduce new AOT deployment tutorial (7144) (by **PENGUINLIONG**)
- [bug] Unify error message matching with/without validation layers for CapiTest.FailMapDeviceOnlyMemory (7110) (by **Zhanlue Yang**)
- [lang] Remove redundant TensorType expansion for function returns (7124) (by **Zhanlue Yang**)
- [lang] Sign python library for Apple M1 (7138) (by **PENGUINLIONG**)
- [gui] Fix particle size limits (7149) (by **Bob Cao**)
- [lang] Migrate TensorType expansion in MatrixType/VectorType from Python code to Frontend IR (7127) (by **Zhanlue Yang**)
- [aot] Support texture arguments for AOT kernels (7142) (by **Zhanlue Yang**)
- [metal] Retain Metal commandBuffers & build command buffers directly (7137) (by **Bob Cao**)
- [rhi] Update `create_pipeline` API and add support of VkPipelineCache (7091) (by **Bob Cao**)
- [autodiff] Support grad in ndarray (6906) (by **PhrygianGates**)
- [Doc] Update doc regarding dynamic index (7148) (by **Yi Xu**)
- [refactor] Remove dependencies on Program::this_thread_config() in spirv::lower (7134) (by **PGZXB**)
- [Misc] Strictly check ndim with external array (7126) (by **Haidong Lan**)
- [ci] Run test when pushing to rc branches (7146) (by **Lin Jiang**)
- [refactor] Remove dependencies on Program::this_thread_config() in KernelCodeGen (7086) (by **PGZXB**)
- [ci] Disable backward_cpp on macOS (7145) (by **Proton**)
- [gui] Fix scene line renderable (7131) (by **Bob Cao**)
- [refactor] Remove useless Kernel::from_cache_ (7132) (by **PGZXB**)
- [cpu] Reuse VirtualMemoryAllocator for CPU ndarray memory allocation (7128) (by **Ailing**)
- [Lang] Raise errors when using the packed switch (7125) (by **Yi Xu**)
- [ci] Temporarily disable ad_external_array on Metal (7136) (by **Bob Cao**)
- [Error] Raise errors when using metal sparse (7113) (by **Lin Jiang**)
- [aot] AOT compat test in workflow (7033) (by **damnkk**)
- [Lang] Fix cannot use taichi in REPL (7114) (by **Zhao Liang**)
- [lang] Free ndarray memory when it's GC-ed in Python (7072) (by **Ailing**)
- [lang] Migrate TensorType expansion for FuncCallExpression from Python code to Frontend IR (6980) (by **Zhanlue Yang**)
- [amdgpu] Part2 add runtime (6482) (by **Zeyu Li**)
- [refactor] Remove dependencies on Program::this_thread_config() in codegen_cc.cpp (7088) (by **PGZXB**)
- [refactor] Remove dependencies on Program::this_thread_config() in gfx::run_codegen (7089) (by **PGZXB**)
- [Bug] Fix num_splits in parallel_struct_for (7121) (by **Yi Xu**)
- [Doc] Move glossary to top level (7118) (by **Zhao Liang**)
- [metal] Update Metal RHI impl & add support for shared arrays (7107) (by **Bob Cao**)
- [ci] Update amdgpu ci (7117) (by **Zeyu Li**)
- [refactor] Move Kernel::lower() outside the taichi::lang::Kernel (7048) (by **PGZXB**)
- [amdgpu] Part1 add codegen (6469) (by **Zeyu Li**)
- [Aot] Deprecate element shape and field dim for AOT symbolic args (7100) (by **Haidong Lan**)
- [refactor] Remove Program::current_ast_builder() (7075) (by **PGZXB**)
- [aot] Switch Metal to SPIR-V codegen (7093) (by **PENGUINLIONG**)
- [Lang] Remove deprecated ti.Matrix.rotation2d() (7098) (by **Yi Xu**)
- [doc] Modified some errors in the function examples (7094) (by **welann**)
- [ci] More Windows git hacks (7102) (by **Proton**)
- [Lang] Remove filename kwarg in aot Module save() (7085) (by **Ailing**)
- [aot] Rename device capability atomic_i64 to atomic_int64 for consistency (7095) (by **PENGUINLIONG**)
- [Lang] Remove sourceinspect deprecation warning message (7081) (by **Zhao Liang**)
- [example] Remove gui warning message (7090) (by **Zhao Liang**)
- [refactor] Remove unnecessary Kernel::arch (7074) (by **PGZXB**)
- [refactor] Remove unnecessary parameter of irpass::scalarize (7087) (by **PGZXB**)
- [Bug] Fix ret_type and cast_type of UnaryOpStmt in Scalarize (7082) (by **Yi Xu**)
- [lang] Migrate TensorType expansion for TextureOpExpression from Python code to Frontend IR (6968) (by **Zhanlue Yang**)
- [lang] Migrate TensorType expansion for ReturnStmt from Python code to Frontend IR (6946) (by **Zhanlue Yang**)
- [doc] Update ndarray deprecation warning to 1.5.0 (7083) (by **Haidong Lan**)
- [amdgpu] Update amdgpu module call (7022) (by **Zeyu Li**)
- [amdgpu] Add convert addressspace pass related unit test (7023) (by **Zeyu Li**)
- [ir] Let real function return nested StructType (by **lin-hitonami**)
- [ir] Replace FuncCallExpression with FrontendFuncCallStmt (by **lin-hitonami**)
- [example] Update gallery images (7053) (by **Zhao Liang**)
- [Doc] Update type.md (7038) (by **Zhao Liang**)
- [misc] Bump version to v1.5.0 (7077) (by **Lin Jiang**)
- [rhi] Update Stream `new_command_list` API (7073) (by **Bob Cao**)
- [Doc] Fix docstring (7065) (by **Zhao Liang**)
- [ci] Workaround windows checkout 'Needed a single revision' issue (7078) (by **Proton**)
- [Lang] Make slicing a single row/column of a matrix return a vector (7068) (by **Yi Xu**)

1.4.1

Highlights:

Full changelog:
- [ci] Tolerate duplicates when registering version (7281) (by **Proton**)
- [misc] Fix manylinux2014 warning not printing (7270) (by **Proton**)
- [misc] Bump version to 1.4.1 (by **Lin Jiang**)
- [misc] Update submodule taichi_assets (7203) (by **Lin Jiang**)
- [bug] Fix example circle-packing (7194) (by **Lin Jiang**)

1.4.0

Deprecation Notice
- Support for sparse SNodes on the Metal backend has been removed.
- ti.Matrix.rotation2d() has been removed.
- The packed switch in ti.init() has been removed.
- The dynamic_index switch in ti.init() is now deprecated and will be removed in v1.5.0. See the feature introduction below for details.
- Slicing from a single row/column of a matrix (e.g.a[x, a:b]) now returns a vector instead of a matrix.

New features
AOT

Taichi AOT is officially available in Taichi v1.4.0, along with a native Taichi Runtime (TiRT) library taichi_c_api. Native applications can now load compiled AOT modules and launch Taichi kernels without a Python interpreter.

In this release, TiRT has stabilized the Vulkan backend on desktop platforms and Android. You can find prebuilt TiRT binaries on the release page. You can refer to a comprehensive tutorial on the doc site; the detailed TiRT C-API documentation is available at https://docs.taichi-lang.org/docs/taichi_core.

Ndarray
Taichi ndarray is now formally released in v1.4.0. The ndarray is an array object that holds contiguous multi-dimensional data to allow easy exchange with external libraries. See documentation for more details.

Dynamic index
Before v1.4.0, when you wanted to access a vector/matrix with a runtime variable instead of a compile-time constant, you had to set ti.init(dynamic_index=True). However, that option only works for LLVM-based backends (CPU & CUDA) and may slow down runtime performance because all matrices are affected. Starting from v1.4.0, that option is no longer needed. You can use variable indices whenever necessary on all backends without affecting the performance of those matrices with only constant indices.

Improvements
Performance
- The compilation speed has been optimized by ~2x.

Example list & ti gallery
Since v1.0.0, we have been enriching our taichi example collection, bringing the number of demos in the gallery window from eight to twelve. Run ti gallery to check out some new demos!
![image](https://user-images.githubusercontent.com/90667349/212647128-8421782f-b18c-4fec-8392-812c95406e16.png)
Bug fixes
- Incorrect behavior of struct fors on sparse SNodes in certain cases has been fixed. (7121)
- CUDA will no longer allocate extra device memory when performing `to_numpy()` and `from_numpy()`. (7008)
- StructType is now allowed as a type hint to ti.func. (6964)
- Incorrect recompilation caused by filling in a matrix field with the same matrix has been fixed. (6951)
- Matrix type inference has been fixed. (6928)
- Getting 64-bit data from ndarrays in the Python scope is now handled correctly. (6836)
- Name collision problem in ti.dataclass has been fixed. (6737)

Highlights:
- **Aot module**
- Deprecate element shape and field dim for AOT symbolic args (7100) (by **Haidong Lan**)
- **Bug fixes**
- Fix num_splits in parallel_struct_for (7121) (by **Yi Xu**)
- Fix ret_type and cast_type of UnaryOpStmt in Scalarize (7082) (by **Yi Xu**)
- Fix getting 64-bit data from ndarray in Python scope (6836) (by **Yi Xu**)
- Avoid overwriting global tmp with dynamic_index=True (6820) (by **Yi Xu**)
- **Build system**
- Deprecate export_core (7028) (by **Zhanlue Yang**)
- **Command line interface**
- Add "ti cache clean" command to clean the offline cache files manually (6937) (by **PGZXB**)
- **Documentation**
- Update tutorial.md (7176) (by **Zhao Liang**)
- Update math_module.md (7175) (by **Zhao Liang**)
- Update debugging.md (7173) (by **Zhao Liang**)
- Fix C++ tutorial does not display on doc site (7174) (by **Zhao Liang**)
- Update doc regarding dynamic index (7148) (by **Yi Xu**)
- Move glossary to top level (7118) (by **Zhao Liang**)
- Update type.md (7038) (by **Zhao Liang**)
- Fix docstring (7065) (by **Zhao Liang**)
- Remove packed mode in doc (7030) (by **Zhao Liang**)
- Minor doc update (6952) (by **Zhao Liang**)
- Glossary (6101) (by **Olinaaaloompa**)
- Update dac (6875) (by **Gabriel Vainer**)
- Update faq.md (6921) (by **Zhao Liang**)
- Update dataclass.md (6876) (by **Gabriel Vainer**)
- Update the documentation about Dynamic SNode (6752) (by **Lin Jiang**)
- Stop mentioning packed mode (6755) (by **Yi Xu**)
- **Error messages**
- Raise errors when using metal sparse (7113) (by **Lin Jiang**)
- Do not show warning when the offline cache path does not exist (7005) (by **PGZXB**)
- **GUI**
- Support colored texts (7036) (by **Dunfan Lu**)
- **Intermediate representation**
- Allow a maximum of 12 SNode indices (6901) (by **Dunfan Lu**)
- **Language and syntax**
- Raise errors when using the packed switch (7125) (by **Yi Xu**)
- Fix cannot use taichi in REPL (7114) (by **Zhao Liang**)
- Remove deprecated ti.Matrix.rotation2d() (7098) (by **Yi Xu**)
- Remove filename kwarg in aot Module save() (7085) (by **Ailing**)
- Remove sourceinspect deprecation warning message (7081) (by **Zhao Liang**)
- Make slicing a single row/column of a matrix return a vector (7068) (by **Yi Xu**)
- Deprecate the dynamic_index switch (7071) (by **Yi Xu**)
- Add irpass::eliminate_immutable_local_vars() test cases for TensorType (7043) (by **Zhanlue Yang**)
- Fix gui docstring (7003) (by **Zhao Liang**)
- Support dynamic indexing in spirv (6990) (by **Yi Xu**)
- Support dynamic indexing in metal (6985) (by **Yi Xu**)
- Support LU sparse solver on CUDA backend (6967) (by **pengyu**)
- Fix struct type problem (6949) (by **Zhao Liang**)
- Add warning message when converting dynamic snode to numpy (6853) (by **Zhao Liang**)
- Deprecate sourceinspect dependency (6894) (by **Zhao Liang**)
- Warn users if ndarray size is out of int32 boundary (6846) (by **Yi Xu**)
- Remove the real_matrix switch (6885) (by **Yi Xu**)
- Enable real_matrix and real_matrix_scalarize by default (6801) (by **Zhanlue Yang**)
- Raise an error for the semantic change of transpose() (6813) (by **Yi Xu**)
- Add bool type in python as an alias to i32 (6742) (by **daylily**)
- Add deprecation warning for the removal of the packed switch (6753) (by **Yi Xu**)
- **Metal backend**
- Raise deprecate warning and error when using sparse snodes on metal (6739) (by **Lin Jiang**)
- **Miscellaneous**
- Strictly check ndim with external array (7126) (by **Haidong Lan**)
- Refactored flattend_values() to avoid potential conflicts in flattened statements (6749) (by **Zhanlue Yang**)

Full changelog:
- [Doc] Update tutorial.md (7176) (by **Zhao Liang**)
- [aot] (cherry-pick) Removed unused archs in C-API (7167), FindTaichi CMake module to help outside project integration (7168) (7177) (by **PENGUINLIONG**)
- [docs] Create windows_debug.md (7164) (by **Bob Cao**)
- [Doc] Update math_module.md (7175) (by **Zhao Liang**)
- [Doc] Update debugging.md (7173) (by **Zhao Liang**)
- [Doc] Fix C++ tutorial does not display on doc site (7174) (by **Zhao Liang**)
- [doc] Fix spelling of "paticle_field" (7024) (by **Xiang (Kevin) Li**)
- [doc] Update accelerate_python.md to use ti.max (7161) (by **Tao Jin**)
- [aot] Fixed ti_get_last_error signature (7165) (by **PENGUINLIONG**)
- [example] Update quaternion arithmetics in fractal_3d_ggui (7139) (by **Zhao Liang**)
- [doc] Add doc ndarray (7157) (by **Olinaaaloompa**)
- [doc] Update field.md (Fields advanced) (6867) (by **Gabriel Vainer**)
- [ci] Use make_changelog.py to generate the full changelog (7152) (by **Lin Jiang**)
- [aot] Introduce new AOT deployment tutorial (7144) (by **PENGUINLIONG**)
- [Doc] Update doc regarding dynamic index (7148) (by **Yi Xu**)
- [Misc] Strictly check ndim with external array (7126) (by **Haidong Lan**)
- [ci] Run test when pushing to rc branches (7146) (by **Lin Jiang**)
- [ci] Disable backward_cpp on macOS (7145) (by **Proton**)
- [gui] Fix scene line renderable (7131) (by **Bob Cao**)
- [Lang] Raise errors when using the packed switch (7125) (by **Yi Xu**)
- [cpu] Reuse VirtualMemoryAllocator for CPU ndarray memory allocation (7128) (by **Ailing**)
- [ci] Temporarily disable ad_external_array on Metal (7136) (by **Bob Cao**)
- [Error] Raise errors when using metal sparse (7113) (by **Lin Jiang**)
- [misc] Cherry-pick 7072 into rc-v1.4.0 (7135) (by **Ailing**)
- [aot] Rename device capability atomic_i64 to atomic_int64 for consistency (7095) (by **PENGUINLIONG**)
- [Lang] Fix cannot use taichi in REPL (7114) (by **Zhao Liang**)
- [Bug] Fix num_splits in parallel_struct_for (7121) (by **Yi Xu**)
- [Doc] Move glossary to top level (7118) (by **Zhao Liang**)
- [Aot] Deprecate element shape and field dim for AOT symbolic args (7100) (by **Haidong Lan**)
- [Lang] Remove deprecated ti.Matrix.rotation2d() (7098) (by **Yi Xu**)
- [doc] Modified some errors in the function examples (7094) (by **welann**)
- [ci] More Windows git hacks (7102) (by **Proton**)
- [Lang] Remove filename kwarg in aot Module save() (7085) (by **Ailing**)
- [Lang] Remove sourceinspect deprecation warning message (7081) (by **Zhao Liang**)
- [example] Remove gui warning message (7090) (by **Zhao Liang**)
- [Bug] Fix ret_type and cast_type of UnaryOpStmt in Scalarize (7082) (by **Yi Xu**)
- [doc] Update ndarray deprecation warning to 1.5.0 (7083) (by **Haidong Lan**)
- [example] Update gallery images (7053) (by **Zhao Liang**)
- [Doc] Update type.md (7038) (by **Zhao Liang**)
- [Doc] Fix docstring (7065) (by **Zhao Liang**)
- [Lang] Make slicing a single row/column of a matrix return a vector (7068) (by **Yi Xu**)
- [ci] Workaround windows checkout 'Needed a single revision' issue (7078) (by **Proton**)
- [lang] Make sure ndarrays created in python frontend are initialized as zero (7060) (by **Ailing**)
- [Lang] Deprecate the dynamic_index switch (7071) (by **Yi Xu**)
- [misc] Update python package metadata (7063) (by **Proton**)
- [bug] Fixed compilation error caused by 7047 (7069) (by **PGZXB**)
- [opt] Automatically identify allocas to scalarize (7055) (by **Yi Xu**)
- [refactor] Remove ir parameter of KernelCodeGen::KernelCodeGen(Kernel *kernel, IRNode *ir) (7046) (by **PGZXB**)
- [refactor] Remove unnecessary IRNode::kernel (7047) (by **PGZXB**)
- [refactor] Remove dependencies on Program::current_ast_builder() in C++ side (7044) (by **PGZXB**)
- [ci] Version sanity check before publishing (7062) (by **Proton**)
- [ci] Make changelog generation working again (7058) (by **Proton**)
- [rhi] Update CommandList dispatch API (7052) (by **Bob Cao**)
- [aot] C-API versioning (7050) (by **PENGUINLIONG**)
- [refactor] Remove offloaded parameter of Program::compile() (7045) (by **PGZXB**)
- [lang] Migrate TensorType expansion for subscription indices from Python to Frontend IR (6942) (by **Zhanlue Yang**)
- [opt] Add ExtractPointers pass for dynamic index (7051) (by **Yi Xu**)
- [Lang] Add irpass::eliminate_immutable_local_vars() test cases for TensorType (7043) (by **Zhanlue Yang**)
- [Lang] Fix gui docstring (7003) (by **Zhao Liang**)
- [rhi] Update compute CommandList APIs (except dispatch) (7037) (by **Bob Cao**)
- [ir] Let GetElementExpression&Statement support index list (7049) (by **Lin Jiang**)
- [aot] C-API opengl runtime interop (7042) (by **PENGUINLIONG**)
- [ci] Pin pre-commit python version to 3.10 (7041) (by **Proton**)
- [opengl] Enable more gles tests in CI (7031) (by **Ailing**)
- [ci] Tuning headless demo VRAM usage (7039) (by **Proton**)
- [Build] Deprecate export_core (7028) (by **Zhanlue Yang**)
- [GUI] Support colored texts (7036) (by **Dunfan Lu**)
- [aot] Revert "C-API opengl runtime interop (7014)" (7032) (by **Proton**)
- [ci] Update pre-commit app versions (7025) (by **Proton**)
- [Doc] Remove packed mode in doc (7030) (by **Zhao Liang**)
- Revert "[opengl] Enable more gles tests in CI" (7029) (by **Ailing**)
- [build] Remove libexport_core.so dependency for Android App CI (6997) (by **Zhanlue Yang**)
- [opengl] Enable more gles tests in CI (7010) (by **Ailing**)
- [aot] C-API opengl runtime interop (7014) (by **damnkk**)
- [misc] Add macro to control amdgpu-related header file (7021) (by **Zeyu Li**)
- [bug] Fix device memory allocation for numpy array on CUDA backend (7008) (by **Zhanlue Yang**)
- [ci] Try enabling MSVC and check build times (6905) (by **Bob Cao**)
- [gfx] Update Device API: Splitting ResourceBinder into seperate Shade… (7020) (by **Proton**)
- [gfx] Revert "Update Device API: Splitting ResourceBinder into sepera… (7019) (by **Proton**)
- [amdgpu] Update amdgpu device to new API (7018) (by **Bob Cao**)
- [perf] Fix fill ndarray size problem. (6992) (by **Haidong Lan**)
- [cuda] Fix LLVM15 rsqrt perf regression (7012) (by **Haidong Lan**)
- [gfx] Update Device API: Splitting ResourceBinder into seperate ShaderResourceSet & RasterResources (6954) (by **Bob Cao**)
- [opt] Add ImmediateIRModifier to provide amortized constant-time replace_usages_with() (7001) (by **Yi Xu**)
- [amdgpu] Part0 add render hardware interface (6464) (by **Zeyu Li**)
- [Error] Do not show warning when the offline cache path does not exist (7005) (by **PGZXB**)
- [Lang] [spirv] Support dynamic indexing in spirv (6990) (by **Yi Xu**)
- [misc] Remove unnecessary CompileConfig::lazy_compilation (7009) (by **PGZXB**)
- [ci] Add C++ tests on AMDGPU RHI (6597) (by **Zeyu Li**)
- [ci] Update taichi-release-tests branch (disable QuanTaichi GOL) (7011) (by **Proton**)
- [amdgpu] Part3 update runtime module (6486) (by **Zeyu Li**)
- [opengl] Fix tests running both on opengl and vulkan (7006) (by **Ailing**)
- [ir] Record the return types to a StructType (6995) (by **Lin Jiang**)
- [lang] Get the CHI-IR struct type in python (6994) (by **Lin Jiang**)
- [ir] Change type maps to unordered maps and add mutexes (7000) (by **Lin Jiang**)
- [ir] Add struct type to CHI-IR (6982) (by **Lin Jiang**)
- [misc] Add repography activity stats (6991) (by **Proton**)
- [aot] Enable validation layers for C-API tests (6893) (by **Zhanlue Yang**)
- [opengl] Add ti.gles arch and enable tests (6988) (by **Ailing**)
- [Lang] [metal] Support dynamic indexing in metal (6985) (by **Yi Xu**)
- [opengl] Reset opengl context when taichi program resets (6987) (by **Ailing**)
- [Lang] Support LU sparse solver on CUDA backend (6967) (by **pengyu**)
- [misc] Keeping up with new python-wheel implementation (6986) (by **Proton**)
- [aot] Recover AOT CI script (6970) (by **PENGUINLIONG**)
- [lang] Migrate TensorType expansion for svd from Python code to Frontend IR (6972) (by **Zhanlue Yang**)
- [misc] Adding XCode project support (6976) (by **Bob Cao**)
- [bug] Fix taichi_ngp starting from ti example (6973) (by **Ailing**)
- [ci] Revert "Fix missing c_api.so in linux nightly" (6974) (by **Ailing**)
- [ci] Build: auto install vulkan on Linux (6969) (by **Proton**)
- [ci] Auto setup miniforge3 env when build (6966) (by **Proton**)
- [Lang] Fix struct type problem (6949) (by **Zhao Liang**)
- [aot] C-API breaking changes! (6955) (by **PENGUINLIONG**)
- [lang] Fix scalarization for PrintStmt (6945) (by **Zhanlue Yang**)
- [bug] Allow StructType as type hint to ti.func (6964) (by **Yi Xu**)
- [refactor] Remove legacy code for dynamic index (6961) (by **Yi Xu**)
- [aot] Fix rwtexture with template_args (6960) (by **Ailing**)
- [ci] Fix missing c_api.so in linux nightly (6962) (by **Ailing**)
- [lang] Migrate TensorType expansion for SNode indices from Python to Frontend IR (6934) (by **Zhanlue Yang**)
- [doc] New FAQ added (6963) (by **Olinaaaloompa**)
- [ci] Sync CI cache script & workflow (6959) (by **Proton**)
- [ci] Update release test branch, reduce running time (6944) (by **Proton**)
- [ci] Remove redundant tests (6947) (by **Proton**)
- [bug] Fix recompilation of filling a matrix field with the same matrix (6951) (by **Yi Xu**)
- [aot] Fixed C-API behavior tests (6939) (by **PENGUINLIONG**)
- [refactor] Remove _PyScopeMatrixImpl (6943) (by **Yi Xu**)
- [aot] Fix validation warning: OpImageFetch should operate on OpImage instead of OpSampledImage (6925) (by **Zhanlue Yang**)
- [CLI] Add "ti cache clean" command to clean the offline cache files manually (6937) (by **PGZXB**)
- [Doc] Minor doc update (6952) (by **Zhao Liang**)
- [ci] Fix forgotten build script paths (6941) (by **Proton**)
- [opt] Add pass eliminate_immutable_local_vars (6926) (by **Yi Xu**)
- [ci] Fix pre-commit errors (6940) (by **Proton**)
- [doc] Editorial updates (6935) (by **Olinaaaloompa**)
- [ci] Workflow Rewrite: Building on Linux (6848) (by **Proton**)
- [refactor] Remove _IntermediateMatrix and _MatrixFieldElement (6932) (by **Yi Xu**)
- [aot] C_API behavior test (6904) (by **damnkk**)
- [lang] Fix matrix type inference and remove _MatrixEntriesInitializer (6928) (by **Yi Xu**)
- [lang] Reorder sparse matrix before solving (6886) (by **pengyu**)
- [Doc] Glossary (6101) (by **Olinaaaloompa**)
- [aot] Refactor C-API error tests (6890) (by **Zhanlue Yang**)
- [doc] Update layout.md (Fields) (6868) (by **Gabriel Vainer**)
- [Doc] Update dac (6875) (by **Gabriel Vainer**)
- [lang] Support 'len' with Matrix-typed operands (6923) (by **Zhanlue Yang**)
- [doc] Update sparse.md (6908) (by **Gabriel Vainer**)
- [doc] Update performance.md (6911) (by **Gabriel Vainer**)
- [doc] Update debugging.md (6909) (by **Gabriel Vainer**)
- [doc] Update profiler.md (6910) (by **Gabriel Vainer**)
- [bug] Add GetElementExpression to offline cache key (6918) (by **PGZXB**)
- [ci] Reenable AMDGPU CI, disable OpenGL tests in AMDGPU task (6887) (by **Proton**)
- [lang] Fix accidental changes during matrix refactor (6914) (by **Yi Xu**)
- [example] Add circle-packing example (6870) (by **Zhao Liang**)
- [Doc] Update faq.md (6921) (by **Zhao Liang**)
- [misc] Show suggestion when locking metadata.lock fails (6919) (by **PGZXB**)
- [doc] New FAQs (6055) (by **Olinaaaloompa**)
- [example] Add poission disk sampling example (6852) (by **Zhao Liang**)
- [vulkan] Improve Vulkan RHI impl with lower overhead internal implementations (6912) (by **Bob Cao**)
- [doc] Link to LLVM 15 built for Visual Studio 2022 (6916) (by **PENGUINLIONG**)
- [lang] Fix issue of IfExpr with TensorTyped operands (6897) (by **Zhanlue Yang**)
- [doc] Update hello_world.md (6889) (by **Gabriel Vainer**)
- [IR] Allow a maximum of 12 SNode indices (6901) (by **Dunfan Lu**)
- [doc] Update odop.md (6874) (by **Gabriel Vainer**)
- [doc] Update external.md (6869) (by **Gabriel Vainer**)
- [Doc] Update dataclass.md (6876) (by **Gabriel Vainer**)
- [doc] Update cloth_simulation.md (6898) (by **Vissidarte-Herman**)
- [example] Update marching squares example (6851) (by **Zhao Liang**)
- [Lang] Add warning message when converting dynamic snode to numpy (6853) (by **Zhao Liang**)
- [Lang] Deprecate sourceinspect dependency (6894) (by **Zhao Liang**)
- [aot] Added C-API behavior tests (6871) (by **damnkk**)
- [aot] Gather satellite repo URLs (6860) (by **PENGUINLIONG**)
- [refactor] Remove _TiScopeMatrixImpl (6892) (by **Yi Xu**)
- [ci] Python test minor fixes (6891) (by **Proton**)
- [ir] Add ir_traits namespace to use less dynamic casts & Run CFG only ever once (6812) (by **Bob Cao**)
- [Lang] Warn users if ndarray size is out of int32 boundary (6846) (by **Yi Xu**)
- [build] Enable strip for libtaichi_c_api.so with Release Build (6845) (by **Zhanlue Yang**)
- [Lang] Remove the real_matrix switch (6885) (by **Yi Xu**)
- [build] Turn on function level linking for taichi_c_api (6840) (by **Zhanlue Yang**)
- [test] Remove tests with real_matrix=True and real_matrix_scalarize=True (6873) (by **Yi Xu**)
- [misc] Revert back to master after 6843 merged (6883) (by **Bob Cao**)
- [vulkan] Cleanup spdlog related logging from Vulkan RHI (6843) (by **Bob Cao**)
- [ci] Temporarily disable AMDGPU CI (6872) (by **Proton**)
- [Lang] Enable real_matrix and real_matrix_scalarize by default (6801) (by **Zhanlue Yang**)
- [bug] MatrixType bug fix: Fix error with static-grouped-ndrange (6839) (by **Zhanlue Yang**)
- [example] Fix jacobian example (6849) (by **Mingrui Zhang**)
- [bug] Fix flaky mass_spring_game_ggui.py on Mac M1 by setting up default values for VulkanCapabilities (6850) (by **Zhanlue Yang**)
- [example] Solve implicit fem using sparsee solver (6827) (by **pengyu**)
- [build] Migrate cmake targets from OBJECT to STATIC for libtaichi_c_api.so (6831) (by **Zhanlue Yang**)
- [Bug] Fix getting 64-bit data from ndarray in Python scope (6836) (by **Yi Xu**)
- [test] Avoid constant folding in overflow tests (6835) (by **Ailing**)
- [aot] Added C-API behavior test (6837) (by **damnkk**)
- [bug] Matrix refactor bug fix: Fix cross scope matrix operations (6822) (by **Zhanlue Yang**)
- [build] Refactored and removed RuntimeCUDA and RuntimeCUDAInjector (6830) (by **Zhanlue Yang**)
- [bug] Matrix refactor bug fix: Fix logical binary operations with TensorTyped operands (6817) (by **Zhanlue Yang**)
- [example] Add order-independent transparency example (6829) (by **Lin Jiang**)
- [opt] Re-enable constant folding when debug=True (6824) (by **Ailing**)
- [Bug] Avoid overwriting global tmp with dynamic_index=True (6820) (by **Yi Xu**)
- [bug] Matrix refactor bug fix: Fix restrictions on BinaryOp/TernaryOp operands' broadcasting (6805) (by **Zhanlue Yang**)
- [aot] C-API Device capability improvements (6773) (by **PENGUINLIONG**)
- [misc] Headers dependency cleanup from RHI (6699) (by **Bob Cao**)
- [ci] Revert "Temporarily disable desktop headless tests (6811)" (6816) (by **Proton**)
- [misc] Bump version to v1.4.0 (6804) (by **PENGUINLIONG**)
- [ci] Add AMDGPU relected ci (6743) (by **Zeyu Li**)
- [test] Remove unnecessary duplicated python runtime test runs (6808) (by **Ailing**)
- [Lang] Raise an error for the semantic change of transpose() (6813) (by **Yi Xu**)
- [refactor] Remove unnecessary checks in program (6802) (by **Ailing**)
- [vulkan] Support texture type args in aot add_kernel (6796) (by **Ailing**)
- [ci] Temporarily disable desktop headless tests (6811) (by **Proton**)
- [bug] Fix name collision in ti.dataclass (6737) (by **Yi Xu**)
- [bug] MatrixType bug fix: Add additional restrictions for unpacking a Matrix (6795) (by **Zhanlue Yang**)
- [doc] Update docstring for grad replaced (6800) (by **Mingrui Zhang**)
- [build] Add MSBuild option to setup.py (6724) (by **Bob Cao**)
- [Lang] [type] Add bool type in python as an alias to i32 (6742) (by **daylily**)
- [lang] Use less gpu memory when building sparse matrix (6781) (by **pengyu**)
- [example] Add cuda options for sparse matrix examples (6785) (by **pengyu**)
- [misc] Remove usage of deprecated num_channels/channel_format type hint in rw_texture in codebase (6791) (by **Ailing**)
- [bug] MatrixType bug fix: Fix error with BLS (6664) (by **Zhanlue Yang**)
- [vulkan] Support rw_texture in aot add_kernel (6789) (by **Ailing**)
- [bug] MatrixType bug fix: Fix error with quant (6776) (by **Yi Xu**)
- [bug] MatrixType bug fix: Fix test_ad_gdar_diffmpm (6786) (by **Yi Xu**)
- [vulkan] Deprecate num_channels and channel_format args in rw_texture type annotation (6782) (by **Ailing**)
- [misc] Remove the default potential_bug label on bug report issues (6784) (by **Ailing**)
- [bug] MatrixType bug fix: Fix error with texture (6775) (by **Yi Xu**)
- [vulkan] Make sure kernel recompiles when texture dtype changes (6774) (by **Ailing**)
- [aot] Clean up exported symbols for libtaichi_c_api.so (6140) (by **Zhanlue Yang**)
- [Misc] Refactored flattend_values() to avoid potential conflicts in flattened statements (6749) (by **Zhanlue Yang**)
- [aot] Warn the user about out-of-range access in C++ wrapper (6492) (by **PENGUINLIONG**)
- [build] Initial distributed compiling support (6762) (by **Proton**)
- [aot] Revert C-API Device capability improvements (6772) (by **PENGUINLIONG**)
- [aot] C-API Device capability improvements (6702) (by **PENGUINLIONG**)
- [aot] C-API to get available archs (6766) (by **PENGUINLIONG**)
- [doc] Update sparse matrix document (6719) (by **pengyu**)
- [autodiff] Separate non-linear operators to an individual class (6700) (by **Mingrui Zhang**)
- [bug] Fix dereferencing nullptr (6763) (by **Yi Xu**)
- [Doc] Update the documentation about Dynamic SNode (6752) (by **Lin Jiang**)
- [doc] Update dev install about clang version (6759) (by **Ailing**)
- [build] Improve TI_WITH_CUDA guards for CUDA related test cases (6698) (by **Zhanlue Yang**)
- [Lang] Add deprecation warning for the removal of the packed switch (6753) (by **Yi Xu**)
- [lang] Improve sparse matrix building on GPU (6748) (by **pengyu**)
- [aot] JSON serde (6754) (by **PENGUINLIONG**)
- [bug] MatrixType bug fix: Fix error with to_numpy() and from_numpy() (6726) (by **Zhanlue Yang**)
- [Doc] Stop mentioning packed mode (6755) (by **Yi Xu**)
- [lang] Get the length of dynamic SNode by x.length() (6750) (by **Lin Jiang**)
- [llvm] Support nested struct with matrix return value on real function (6734) (by **Lin Jiang**)
- [Metal] [error] Raise deprecate warning and error when using sparse snodes on metal (6739) (by **Lin Jiang**)
- [build] Integrate backward_cpp to test targets for enabling C++ stack trace (6697) (by **Zhanlue Yang**)
- [aot] Load AOT module from memory (6692) (6714) (by **PENGUINLIONG**)
- [ci] Add dockerfile.ubuntu-18.04.amdgpu (6736) (by **Zeyu Li**)
- [doc] Update LLVM10 -> LLVM15 in installation guide (6747) (by **Zhanlue Yang**)
- [misc] Fix warnings of taichi examples (6740) (by **PGZXB**)
- [example] Ti-example: instant ngp renderer (6673) (by **Youtian Lin**)
- [build] Use a separate prebuilt llvm15 binary for manylinux environment (6732) (by **Ailing**)

1.3

Taichi now introduces a powerful feature for developers - Argument Packs. This new functionality enables efficient caching of unchanged parameters between multiple kernel calls, which not only provides convenience when launching a kernel, but also boosts the performance.

Key Advantages
- Argument Pack: User-defined data types that encapsulate multiple parameters into a single, manageable unit.
- Buffering Capability: Store and reuse parameters that remain constant across kernel calls, reducing the overhead of repeated parameter passing.
- Device-level Caching: Taichi optimizes performance by caching argpacks directly on the device.

Usage Example

import taichi as ti
ti.init()

Defining a custom argument type using "ti.types.argpack"
view_params_tmpl = ti.types.argpack(view_mtx=ti.math.mat4, proj_mtx=ti.math.mat4, far=ti.f32)

Declaration of a Taichi kernel leveraging Argument Packs
ti.kernel
def p(view_params: view_params_tmpl) -> ti.f32:
return view_params.far

Instantiation of the argument pack
view_params = view_params_tmpl(
view_mtx=ti.math.mat4(
[[1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, 1, 0],
[0, 0, 0, 1]]),
proj_mtx=ti.math.mat4(
[[1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, 1, 0],
[0, 0, 0, 1]]),
far=1)

Executing the kernel with the Argument Pack

1.3.0

Deprecation Notice
- Using sparse data structures on the Metal backend is now deprecated. The support for Dynamic SNode has been removed in v1.3.0, and the support for Pointer/Bitmasked SNode will be removed in v1.4.0.
- The `packed` switch in `ti.init()` is now deprecated and will be removed in v1.4.0. See the feature introduction below for details.
- `ti.Matrix.rotation2d()` is now deprecated and will be removed in v1.4.0. Use `ti.math.rotation2d()` instead.
- To clearly distinguish vectors from matrices, `transpose()` on a vector is no longer allowed. If you want something like `a b.transpose()`, write `a.outer_product(b)` instead.
- Ndarray: The arguments of ndarray type annotation `element_dim`, `element_shape` and `field_dim` will be deprecated in v1.4.0. The `field_dim` is renamed to `ndim` to make it more intuitive. `element_dim` and `element_shape` will be replaced by passing a matrix type into `dtype` argument. For example, the `ti.types.ndarray(element_dim=2, element_shape=(3,3))` will be replaced by `ti.types.ndarray(dtype=ti.matrix(3,3))`.

New features

Dynamic SNode

To support variable-length fields, Taichi provides dynamic SNodes.
You can now use the dynamic SNode on fields of different data types, even struct fields and matrix fields.
You can use `x[i].append(...)` to append an element, use `x[i].length()` to get the length, and use `x[i].deactivate()` to clear the list as shown in the following code snippet.

python
pair = ti.types.struct(a=ti.i16, b=ti.i64)
pair_field = pair.field()

block = ti.root.dense(ti.i, 4)
pixel = block.dynamic(ti.j, 100, chunk_size=4)
pixel.place(pair_field)
l = ti.field(ti.i32)
ti.root.dense(ti.i, 5).place(l)

ti.kernel
def dynamic_pair():
for i in range(4):
pair_field[i].deactivate()
for j in range(i * i):
pair_field[i].append(pair(i, j + 1))
pair_field = [[],
[(1, 1)],
[(2, 1), (2, 2), (2, 3), (2, 4)],
[(3, 1), (3, 2), ... , (3, 8), (3, 9)]]
l[i] = pair_field[i].length() l = [0, 1, 4, 9]

Packed Mode

Packed mode was introduced in [v0.8.0](https://github.com/taichi-dev/taichi/releases/tag/v0.8.0) to allow users to trade runtime performance for memory usage. In v1.3.0, after the elimination of runtime overhead in common cases, packed mode has become the default mode. There's no longer any automatic padding behavior behind the scenes, so users can use fields and SNodes without surprise.

Sparse Matrix

We introduce the experimental sparse matrix and sparse solver on the CUDA backend. The API of using is the same as CPU backend. Currently, only the `f32` data type and LLT linear solver are supported on CUDA. You can only use `ti.ndarray` to compute [SpMV](https://en.wikipedia.org/wiki/Sparse_matrix%E2%80%93vector_multiplication) and linear solver operation. Float64 data type and other linear solvers are under implementation.

Improvements

Python Frontend

- Matrix slicing now supports augmented assign (e.g. +=) besides assign.

Taichi Examples

1. Our user https://github.com/Linyou contributed an excellent example on instant ngp renderer [PR #6673](https://github.com/taichi-dev/taichi/pull/6673). Run `taichi_ngp` to check it out!

[Developers only] LLVM15 upgrade

Starting from v1.3.0, Taichi has upgraded its LLVM dependency to version 15.0.0. If you're interested in contributing or simply building Taichi from source, please follow our [installation doc for developers](https://docs.taichi-lang.org/docs/master/dev_install).
Note this change has no impact on Taichi users.

Highlights
- **Documentation**
- Update the documentation about Dynamic SNode (6752) (by **Lin Jiang**)
- Stop mentioning packed mode (6755) (by **Yi Xu**)
- **Language and syntax**
- Add deprecation warning for the removal of the packed switch (6753) (by **Yi Xu**)
- **Metal backend**
- Raise deprecate warning and error when using sparse snodes on metal (6739) (by **Lin Jiang**)

Full changelog
- [aot] Revert C-API Device capability improvements (6772) (by **PENGUINLIONG**)
- [aot] C-API Device capability improvements (6702) (by **PENGUINLIONG**)
- [aot] C-API to get available archs (6766) (by **PENGUINLIONG**)
- [doc] Update sparse matrix document (6719) (by **pengyu**)
- [autodiff] Separate non-linear operators to an individual class (6700) (by **Mingrui Zhang**)
- [bug] Fix dereferencing nullptr (6763) (by **Yi Xu**)
- [Doc] Update the documentation about Dynamic SNode (6752) (by **Lin Jiang**)
- [doc] Update dev install about clang version (6759) (by **Ailing**)
- [build] Improve TI_WITH_CUDA guards for CUDA related test cases (6698) (by **Zhanlue Yang**)
- [Lang] Add deprecation warning for the removal of the packed switch (6753) (by **Yi Xu**)
- [lang] Improve sparse matrix building on GPU (6748) (by **pengyu**)
- [aot] JSON serde (6754) (by **PENGUINLIONG**)
- [bug] MatrixType bug fix: Fix error with to_numpy() and from_numpy() (6726) (by **Zhanlue Yang**)
- [Doc] Stop mentioning packed mode (6755) (by **Yi Xu**)
- [lang] Get the length of dynamic SNode by x.length() (6750) (by **Lin Jiang**)
- [llvm] Support nested struct with matrix return value on real function (6734) (by **Lin Jiang**)
- [Metal] [error] Raise deprecate warning and error when using sparse snodes on metal (6739) (by **Lin Jiang**)
- [build] Integrate backward_cpp to test targets for enabling C++ stack trace (6697) (by **Zhanlue Yang**)
- [aot] Load AOT module from memory (6692) (6714) (by **PENGUINLIONG**)
- [ci] Add dockerfile.ubuntu-18.04.amdgpu (6736) (by **Zeyu Li**)
- [doc] Update LLVM10 -> LLVM15 in installation guide (6747) (by **Zhanlue Yang**)
- [misc] Fix warnings of taichi examples (6740) (by **PGZXB**)
- [example] Ti-example: instant ngp renderer (6673) (by **Youtian Lin**)
- [build] Use a separate prebuilt llvm15 binary for manylinux environment (6732) (by **Ailing**)

1.2.2

Molten-vk version is downgraded to v1.1.10 to fix a few GGUI issues.

Full changelog:
- [build] Downgrade molten-vk version to v1.1.10 (6564) (by **Zhanlue Yang**)

Page 4 of 23

Releases

Has known vulnerabilities

Previous Next

Taichi

Page 4 of 23

1.5.0

1.4.1

1.4.0

1.3

1.3.0

1.2.2

Page 4 of 23

Links

Releases