- [doc] Minor updates in `hello.rst` (963) (by **Yuanming Hu**)
- [doc] Improve `hello.rst` (972) (by **Xiao Zhai**)
- [misc] Add a link to taichi_elements in README.md (974) (by **彭于斌**)
- [Bug] [opengl] [llvm] [metal] Fix floordiv overflow when `lhs * rhs` is large (970) (by **彭于斌**)
- [test] Size Performance Regression Tests (SPRT) (959) (by **彭于斌**)
- [Metal] Skip allocating Runtime::ListManager if no sparse SNode is used (976) (by **Ye Kuang**)
- [cuda] Improve kernel return value performance when unified memory is available (965) (by **Yuanming Hu**)
- [doc] Update `syntax.rst` and related sections (967) (by **Yuanming Hu**)
- [opengl] add TI_WITH_OPENGL env var to disable OpenGL (962) (by **彭于斌**)
- [opt] [cuda] Fix constant folding in multithreaded CUDA applications (954) (by **Yuanming Hu**)
- [ir][refactor] Avoid throwing exception in `replace_statements_with` (968) (by **xumingkuan**)
- [opengl] Use `compile_to_offloads` for IR lowering (951) (by **彭于斌**)
- [Doc] fix typo in syntax_sugars.po (960) (by **Danni**)
- [refactor][ir] Simplify ir.h to reduce compile-time (934) (by **彭于斌**)
- [lang] [refactor] deprecate boardcast_if_scalar, all use binary and unary (943) (by **彭于斌**)
- [doc] Improve `hello.rst` (955) (by **Yuanming Hu**)
- [metal] Set labels on the command buffer/encoder (950) (by **Ye Kuang**)
- [ir] Remove BasicStmtVisitor::current_struct_for (952) (by **xumingkuan**)
- [misc] README.md use shields.io to show latest release (947) (by **彭于斌**)
- [Lang][IR][refactor] Refactor SNode and support coordinate offsets (945) (by **Yuanming Hu**)