Tilelang

Latest version: v0.1.3

Safety actively analyzes 722491 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.1.3

What's Changed
* [Docker] Add libstdcxx-ng-12 to Dockerfiles for CUDA versions by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/160
* Add cpu jit with backend ctypes by xs-keju in https://github.com/tile-ai/tilelang/pull/154
* [Carver] Multi-Threads Compilation for Fast Auto Tuning by SiriusNEO in https://github.com/tile-ai/tilelang/pull/156
* [Refactor] Replace T.If with native Python if statement for mla paged kernel by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/162
* [Enhancement] Improve CUDA path detection by xwhzz in https://github.com/tile-ai/tilelang/pull/157
* [Refactor] Replace `T.thread_binding` with `T.get_thread_binding` in examples and test cases by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/163
* [Bugfix] Cast bool dtype into int8 in blocksparse examples by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/167
* [Example] Implement NSA Decode tilelang exampls by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/168
* [Release] Bump version to v0.1.2.post1 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/166
* Use SS-GEMM for PV in mla by YouJiacheng in https://github.com/tile-ai/tilelang/pull/165
* [Example] Implement tilelang native sparse attention varlen example by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/170
* [Bugfix] Implement boundary check for the buffer shape with dynamic symbolic by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/173
* [AutoTune] Enable config-performance trace by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/174
* [Feat] Append Pass Context and TMA lowering configuration option by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/175
* [Feat] Introduce new caching mechanism for compiled kernels by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/176
* [Refactor] Enhance GPU Kernel Launch with Environment Thread Creation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/178
* [Bugfix] Improve Thread Variable Handling in Layout Inference by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/179
* [Examples] Implement NSA Backward kernels by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/180
* [Enhancement] Optimize CMake build process with dynamic job count calculation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/183
* [Bugfix] Add dynamic shape support with out_idx in Cython JIT kernel compilation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/185
* [Dev][Bugfix] Add RMS Normalization Kernels and Fix Reduce Bug by chengyupku in https://github.com/tile-ai/tilelang/pull/188
* [Dev] Add the failed nvcc command to the exception message by penguin-wwy in https://github.com/tile-ai/tilelang/pull/189
* [Bugfix] Fix `T.copy` for scalar datatypes by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/190
* [Enhancement] Simplify GEMM example with direct kernel compilation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/191
* [Bugfix] Make quickstart work properly on cu118 by penguin-wwy in https://github.com/tile-ai/tilelang/pull/193
* [Language] Support clamp in language by hyx1999 in https://github.com/tile-ai/tilelang/pull/192
* [Refactor] Add SetMaxNRegCollector to Improve Register Hint Handling in Warp Specialized Rewriter by chengyupku in https://github.com/tile-ai/tilelang/pull/194
* [Feature] Add TMA Store Synchronization Support by chengyupku in https://github.com/tile-ai/tilelang/pull/195
* Update expired example code. by 66RING in https://github.com/tile-ai/tilelang/pull/196
* [CMake] Add CUDA Major Version Detection for Conditional Compilation by chengyupku in https://github.com/tile-ai/tilelang/pull/197
* [Feature] Support Async Pipeline inference within if scope by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/198
* [Dev] Add new example for FlashAttention with pipelined execution by chengyupku in https://github.com/tile-ai/tilelang/pull/200
* [Enhancement] Enhancing the handling of conditional statements in the pipeline by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/201
* [Feature] Upgrade cutlass version and support fp8 T.gemm by zqh-wz in https://github.com/tile-ai/tilelang/pull/202
* [Docker] Update Dockerfiles to specify exact version of libstdcxx-ng by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/203
* [Dev] Add GQA backward example by chengyupku in https://github.com/tile-ai/tilelang/pull/205
* [LICENSE] Typo fix in LICENSE by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/208
* [Enhancement] Allow mma fallback when wgmma is not supported by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/206
* [Examples] Expand tuning configurations for FlashAttention example by chenghuaWang in https://github.com/tile-ai/tilelang/pull/204
* [Enhancement] Avoid tvm ffi handling when out_idx is specified by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/209
* [Fix] Fix K // block_K to T.ceildiv(K,block_K) and add tests by hyx1999 in https://github.com/tile-ai/tilelang/pull/210
* [Dev] Implement IfStmtBinding and MergeIfStmt transformations by chengyupku in https://github.com/tile-ai/tilelang/pull/211
* [Language] Introduce `T.reshape` and `T.view` by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/212
* [Enhancement] Improve device handling in Cython kernel adapter by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/220
* [Enhancement] Update format script to support force compare with upstream by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/221
* [Refactor] Introduce KernelParam integration across modules by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/223
* [Bugfix] Fix mismatch of shared memory layout and mma atom on Hopper by zqh-wz in https://github.com/tile-ai/tilelang/pull/224
* [Refactor] Update kernel compilation and profiling in examples by chengyupku in https://github.com/tile-ai/tilelang/pull/225
* [Examples] Add fp8 gemm 2xAcc and deepgemm example by cherichy in https://github.com/tile-ai/tilelang/pull/217
* [Doc] Add instructions for installing nightly version by xwhzz in https://github.com/tile-ai/tilelang/pull/226
* [Bugfix] Disable force inline for ldmatrix by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/227
* [Bugfix] Support duplicate tma desc declaration by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/228
* [Refactor] Rename clamp functions and enhance dtype handling in tests by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/232
* [Enhancement] Simplify kernel source extraction in JIT adapters by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/230
* [Feature] Add reduce_max corresponding tests by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/236
* [BugFix] Fix bug of missing MBarrierExpectTX by chengyupku in https://github.com/tile-ai/tilelang/pull/241
* [Refactor] Refactor for Better Layout Conflict Handling by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/240
* [Refactor] Align torch_assert_close tensor comparison with torch.testing.assert_close by xwhzz in https://github.com/tile-ai/tilelang/pull/239
* [Dev] Implement FlashAttention3 Backward by chengyupku in https://github.com/tile-ai/tilelang/pull/244
* [BugFix] Fix bug of mismatching dtype in testing by xwhzz in https://github.com/tile-ai/tilelang/pull/245
* [Enhancement] Add zero initialization option to GEMM operations by chengyupku in https://github.com/tile-ai/tilelang/pull/246
* [Enhancement][CUDA] Avoid C7508 for CUDA backend via assigning default value to `minBlocksPerMultiprocesor ` by cherichy in https://github.com/tile-ai/tilelang/pull/248
* [Feature] Add database storage for JITKernel cache with Cython and Ctypes adapters by Alex4210987 in https://github.com/tile-ai/tilelang/pull/213
* [Examples] Implement elementwise add kernel by chenghuaWang in https://github.com/tile-ai/tilelang/pull/219
* [Refactor] Phaseout LLVM Dependency by Making it Optional by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/247
* [Readme] Update Bib Citation Section by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/249
* [Enhancement] Support float variable as arguments by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/250
* add autotune to example_gemm.py by yyttt6 in https://github.com/tile-ai/tilelang/pull/252
* [Language] Introduce `T.alloc_var` to define a variable like `int var;` by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/255
* [Example] Implement Kernel Example cumsum by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/258
* [Refactor] Refactor CUDA post-processing callback registration in TileLang by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/259
* [Refactor] Move compilation outside critical section by YouJiacheng in https://github.com/tile-ai/tilelang/pull/260
* [CI] Use auditwheel to generate manylinux wheels by oraluben in https://github.com/tile-ai/tilelang/pull/251
* [Bugfix] Fix Benchmark/Example Code for Autotuning by SiriusNEO in https://github.com/tile-ai/tilelang/pull/254
* [Language] Enhance alias to support blockwise memory load by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/261
* [Bugfix] Fix auto tuning tma handling by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/263
* [Release] Bump version to 0.1.3 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/264

New Contributors
* xs-keju made their first contribution in https://github.com/tile-ai/tilelang/pull/154
* YouJiacheng made their first contribution in https://github.com/tile-ai/tilelang/pull/165
* penguin-wwy made their first contribution in https://github.com/tile-ai/tilelang/pull/189
* hyx1999 made their first contribution in https://github.com/tile-ai/tilelang/pull/192
* 66RING made their first contribution in https://github.com/tile-ai/tilelang/pull/196
* zqh-wz made their first contribution in https://github.com/tile-ai/tilelang/pull/202
* chenghuaWang made their first contribution in https://github.com/tile-ai/tilelang/pull/204
* cherichy made their first contribution in https://github.com/tile-ai/tilelang/pull/217
* Alex4210987 made their first contribution in https://github.com/tile-ai/tilelang/pull/213
* yyttt6 made their first contribution in https://github.com/tile-ai/tilelang/pull/252
* oraluben made their first contribution in https://github.com/tile-ai/tilelang/pull/251

**Full Changelog**: https://github.com/tile-ai/tilelang/compare/v0.1.2...v0.1.3

0.1.2.post1

Why we need this post release?
The v0.1.2 prebuild package used a legacy cython file, which may lead to some bugs.

What's Changed
* [Docker] Add libstdcxx-ng-12 to Dockerfiles for CUDA versions by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/160
* Add cpu jit with backend ctypes by xs-keju in https://github.com/tile-ai/tilelang/pull/154
* [Carver] Multi-Threads Compilation for Fast Auto Tuning by SiriusNEO in https://github.com/tile-ai/tilelang/pull/156
* [Refactor] Replace T.If with native Python if statement for mla paged kernel by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/162
* [Enhancement] Improve CUDA path detection by xwhzz in https://github.com/tile-ai/tilelang/pull/157
* [Refactor] Replace `T.thread_binding` with `T.get_thread_binding` in examples and test cases by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/163
* [Bugfix] Cast bool dtype into int8 in blocksparse examples by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/167
* [Example] Implement NSA Decode tilelang exampls by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/168

New Contributors
* xs-keju made their first contribution in https://github.com/tile-ai/tilelang/pull/154

**Full Changelog**: https://github.com/tile-ai/tilelang/compare/v0.1.2...v0.1.2.post1

0.1.2

What's Changed
* [Dev] Add MLA and GQA decode examples by chengyupku in https://github.com/tile-ai/tilelang/pull/109
* [Example] Add Split-K and Stream-K Examples and move MLA from fld to mla by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/110
* [Typo] Fix a typo in gemm splitk examples by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/111
* [Typo] Fix links in installation instructions in README.md by xwhzz in https://github.com/tile-ai/tilelang/pull/112
* [Typo] Fix formatting in installation instructions in README.md by xwhzz in https://github.com/tile-ai/tilelang/pull/113
* [Benchmark] Add benchmark scripts for block sparse attention by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/114
* [Dev] Support vectorized value pack and atomicAdd for BFloat16 DType by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/116
* [Bugfix] Bugfix of pass order for hopper by chengyupku in https://github.com/tile-ai/tilelang/pull/117
* [Dev] Update MLA decode kernel by chengyupku in https://github.com/tile-ai/tilelang/pull/120
* [Example] Add GQA Example by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/118
* [Example] Implement TileLang Native Sparse Attention Kernel by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/121
* [Doc] Update README.md with new example links for Flash MLA Decoding and Native Sparse Attention by chengyupku in https://github.com/tile-ai/tilelang/pull/122
* [Example] Update GEMM FP8 Example by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/123
* [Dev] Add RetNet Linear Attention example by chengyupku in https://github.com/tile-ai/tilelang/pull/124
* [JIT] Enhance cython/ctypes wrapper for tma descriptor by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/126
* [Dev][Bugfix] Fix bug in ThreadTagChecker; Add WgmmaSync rewriter and add MHA WGMMA pipelined example by chengyupku in https://github.com/tile-ai/tilelang/pull/128
* [Dev] Remove buffer flatten when debug print a shared buffer by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/129
* [Debug] Support `T.print` for `fragment` scope by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/130
* [Example] Implememt FMHA Varlen Example by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/131
* [Refactor] Set default log level from waning into info by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/132
* [Kernel] Implement different SEQ Q/KV examples with block sparse by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/133
* [Dev][Doc] Add DeepSeek MLA Decode Example with Documentation and Performance Benchmarks by chengyupku in https://github.com/tile-ai/tilelang/pull/134
* [Doc] Update MLA Documentation by chengyupku in https://github.com/tile-ai/tilelang/pull/135
* [Debug] Improve Memory Layout Plot by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/136
* [Doc] Add MLA Decoding Performance Benchmarks and Documentation by chengyupku in https://github.com/tile-ai/tilelang/pull/137
* [Bugfix] Add missing definition for AtomicAdd by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/138
* [Dev][Doc] Enhance Flash Attention Implementation in GQA Decoding Example and Fix Typo by chengyupku in https://github.com/tile-ai/tilelang/pull/139
* [Dev] Adjust computation logic to avoid precision loss when casting acc_s from float to float16 by chengyupku in https://github.com/tile-ai/tilelang/pull/141
* [Refactor] Rename gemm fp8 example as we currently lack `T.gemm` support for fp8 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/144
* [Enhancement] Support debug print for unsigned char datatype by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/145
* [Enhancement] Enable runtime tensor data type validation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/146
* [Refactor] Adapt Caver to benchmark by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/148
* [Refactor] Remove BitBLAS Import Check in Benchmark by SiriusNEO in https://github.com/tile-ai/tilelang/pull/150
* [Enhancement] Optimize TileLang install scripts with Dynamic CPU Cores by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/152
* [Carver] Enhance Carver Adaptation for MatMul Benchmarking by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/153
* [Dev][Benchmark] Add MLA paged decoding example and benchmark script by chengyupku in https://github.com/tile-ai/tilelang/pull/158
* [Release] Bump Version to v0.1.2 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/155

New Contributors
* SiriusNEO made their first contribution in https://github.com/tile-ai/tilelang/pull/150

**Full Changelog**: https://github.com/tile-ai/tilelang/compare/v0.1.1...v0.1.2

0.1.1

What's Changed
* [Doc] Update release news by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/80
* [Doc] Convert docs from rst format to Markdown format. by xwhzz in https://github.com/tile-ai/tilelang/pull/82
* [Bugfix] Bugfix of installing with develop mode by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/81
* [WHL] Support whl building for different python versions via tox by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/83
* [Refactor] Separate tilelang Pass Thread Sync (with Hopper support) from tvm by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/85
* [Backend][WebGPU] Support WebGPU WGSL code generation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/86
* [Wheel] Support pypi build scripts for different python via tox by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/93
* [Wrap] Use a ctypes-based kernel wrapper instead of dlpack for runtime efficiency by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/95
* [Bugfix] Update Dockerfile.cu120 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/98
* [Bugfix] Put `InjectPtxAsyncCopy` Pass behind `ThreadSync` Pass by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/97
* [Feature] Add CTypes JIT kernel support by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/100
* [Docker] Add Dockerfiles for multiple CUDA versions by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/103
* [JIT] Support Cython jit and make cython a default execution backend by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/102
* [Refactor] Phrase out torch cpp extension backend by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/104
* [Wheel] Provide a bare docker scripts to help build wheels for manylinux by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/105
* [Example] Implement simple block sparse kernel by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/106
* [Release] Bumpy version to v0.1.1 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/107


**Full Changelog**: https://github.com/tile-ai/tilelang/compare/v0.1.0...v0.1.1

0.1.0

What's Changed
* [LICENSE] Add LICENSE for flashinfer by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/19
* [Doc] Fix installation scripts and docs for dequantize gemm by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/20
* [Doc] Use sphinx to generate docs. by xwhzz in https://github.com/tile-ai/tilelang/pull/21
* [Doc] update installation.md and readme by Cunxiao2002 in https://github.com/tile-ai/tilelang/pull/22
* [Doc] fix a typo in installation.rst by Cunxiao2002 in https://github.com/tile-ai/tilelang/pull/24
* [Doc] Remove legacy files and update reference by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/25
* [CI][Test] Add test cases for tilelang transform `AnnotateDeviceRegions` and `MakePackedAPI` by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/26
* [Doc] Create a workflow to host docs using GitHub Pages. by xwhzz in https://github.com/tile-ai/tilelang/pull/28
* [CI][Test] Add test cases for tilelang transform InjectSoftwarePipeline and FrontendLegalize by Cunxiao2002 in https://github.com/tile-ai/tilelang/pull/30
* [Bugfix] Replace thread binding detector in LayoutInference Pass by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/31
* [CI] Comprehensive Test cases Implementation of Matmul Dequantize by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/32
* [Doc] Update GitHub Actions workflow for documentation deployment and add CNAME file. by xwhzz in https://github.com/tile-ai/tilelang/pull/33
* [Refactor] Simplify interface via replacing argument thread binding of intrinsics with `KernelFrame.Current` by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/34
* [Bugfix] Reorder Passes: Place Vectorize Loop Before StorageFlatten and FlattenBuffer to Prevent Redundant Allocations by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/37
* [Doc] Update documentation structure and content by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/39
* [Doc][CI] Update GitHub Actions workflow for documentation build and deployment. by xwhzz in https://github.com/tile-ai/tilelang/pull/42
* [CI] Allow manual triggering of documentation workflow in addition to… by xwhzz in https://github.com/tile-ai/tilelang/pull/43
* [CI][Test] Add test cases for tilelang transform PipelinePlanning by Cunxiao2002 in https://github.com/tile-ai/tilelang/pull/44
* [CI][Test] Add test cases for tilelang transform `LayoutInference` and `LowerTileOp` on loop tail split functionality by tzj-fxz in https://github.com/tile-ai/tilelang/pull/29
* [Debug] Introduce `T.print` for buffer and variables logging on frontend by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/45
* [CI] Change pull request trigger to `pull_request_target` for documen… by xwhzz in https://github.com/tile-ai/tilelang/pull/48
* [Dev] Add FlashDecoding example by chengyupku in https://github.com/tile-ai/tilelang/pull/46
* [Doc] update README that tilelang has been used in AttentionEngine by smallscientist1 in https://github.com/tile-ai/tilelang/pull/50
* [Doc] Remove unnecessary layout annotation by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/49
* [CI][Test] Add test cases for tilelang kernel convolution by chengyupku in https://github.com/tile-ai/tilelang/pull/51
* [Dev] Implement test case for tilelang transformations by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/53
* [CI][Test] Add test cases for tilelang kernel FlashAttention by chengyupku in https://github.com/tile-ai/tilelang/pull/54
* [CI][Test] Add test cases for element_add by Cunxiao2002 in https://github.com/tile-ai/tilelang/pull/47
* [CI] Clean up target repository before publishing documentation. by xwhzz in https://github.com/tile-ai/tilelang/pull/55
* [CI][Test] Add test cases for tilelang transform ClusterPlanning by chengyupku in https://github.com/tile-ai/tilelang/pull/57
* [Doc] Append debug relevant testing and documentations by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/58
* [CI][Test] Add test cases for tilelang transform LowerHopperIntrin by chengyupku in https://github.com/tile-ai/tilelang/pull/59
* [Doc] Add matmul kernel tutorial with tile library by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/60
* [Dev] Separate `LoopVectorize` Pass from upstream tvm by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/62
* [Dev] Support FP8 Codegen for cuda backend by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/64
* [Dev] Add test case for bfloat16 and int4 gemm with mma by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/65
* [CI][Test] Add test cases for tilelang transform InjectFenceProxy by chengyupku in https://github.com/tile-ai/tilelang/pull/66
* [Tools] Introduce `plot_layout` to visualize the fragment layout by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/68
* [Dev] Remove unnecessary python dependencies by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/69
* [Carver] Introduce a tile-structure based cost model for auto tuning by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/70
* [Bugfix] bug fix for bitblas dependency by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/71
* [CI][Test] Add test cases for tilelang transform MultiVersionBuffer and WarpSpecialized by chengyupku in https://github.com/tile-ai/tilelang/pull/72
* [CostModel][Carver] Support Hint Recommend for Shared memory Kernel Fusion by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/73
* [Carver] Remove legacy todo items in carver's readme by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/74
* [Dev] Add mha backward example by chengyupku in https://github.com/tile-ai/tilelang/pull/77
* [Release] Bump version into v0.1.0 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/76

New Contributors
* xwhzz made their first contribution in https://github.com/tile-ai/tilelang/pull/21
* Cunxiao2002 made their first contribution in https://github.com/tile-ai/tilelang/pull/22
* tzj-fxz made their first contribution in https://github.com/tile-ai/tilelang/pull/29
* chengyupku made their first contribution in https://github.com/tile-ai/tilelang/pull/46
* smallscientist1 made their first contribution in https://github.com/tile-ai/tilelang/pull/50

**Full Changelog**: https://github.com/tile-ai/tilelang/compare/v0.0.1...v0.1.0

0.0.1

Pre-release for the v0.0.1. Under testing, Only cuda prebuilt are provided.

What's Changed
* [Doc] Update the example figures in README by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/3
* [Doc] Replace SVG Figures with PNG due to some format issues by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/4
* [Dev][Language] Separate Base AST with Sugar Syntax by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/9
* [Dev] Enhance examples on README by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/10
* [Doc] Revert repo link by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/11
* [Dev][jit] Introduce jit for kernel functions by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/12
* Update README.md by rkinas in https://github.com/tile-ai/tilelang/pull/14
* [CI] Remove Code QL workflow by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/16
* [Doc] Add benchmark link in README by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/17
* [Release] Bump Version into 0.0.1 by LeiWang1999 in https://github.com/tile-ai/tilelang/pull/18

New Contributors
* LeiWang1999 made their first contribution in https://github.com/tile-ai/tilelang/pull/3
* rkinas made their first contribution in https://github.com/tile-ai/tilelang/pull/14

**Full Changelog**: https://github.com/tile-ai/tilelang/commits/v0.0.1

Links

Releases

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.