Hidet

Latest version: v0.3.1

Safety actively analyzes 625010 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.3.1

What's Changed
* [Version] Bump version to v0.3.1.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/361
* [Option] Add an option to disable imperative execution by serach24 in https://github.com/hidet-org/hidet/pull/362
* [Graph][Benchmark] Update benchmark function by Aalanli in https://github.com/hidet-org/hidet/pull/363
* [Compile Server] Update deps for compilation server by xinli-git in https://github.com/hidet-org/hidet/pull/365
* [Utils] Changed the multiprocessing context by destefy in https://github.com/hidet-org/hidet/pull/367
* [Dynamo] Refactoring code for Hidet remote compilation by destefy in https://github.com/hidet-org/hidet/pull/369
* [Graph][Dynamo Backend] Lshift/Rshift/Mod by Aalanli in https://github.com/hidet-org/hidet/pull/371
* [Graph][Operator] Fix reduce bug, add uint8x4 by Aalanli in https://github.com/hidet-org/hidet/pull/372
* [CompiledGraph] Add option to store dispatch table option by destefy in https://github.com/hidet-org/hidet/pull/377
* [Graph][Tensor] remove unnecessary synchronization by xiaocenxiaocen in https://github.com/hidet-org/hidet/pull/374
* [Graph][Dynamo Backend] Minor imperative run bug fix by Aalanli in https://github.com/hidet-org/hidet/pull/383
* [Graph] Fix CompiledGraph aliasing bug by Aalanli in https://github.com/hidet-org/hidet/pull/384
* [Frontend] Add mapping for `torch.sqrt` by yaoyaoding in https://github.com/hidet-org/hidet/pull/387
* [Fix][Graph] Write compiled graph to tempfile first by destefy in https://github.com/hidet-org/hidet/pull/392
* [Operators] Improving fp32 matrix multiplication on x86 CPUs by BolinSNLHM in https://github.com/hidet-org/hidet/pull/378
* [Fixbug] Fix a bug related to c/c++ integer promotion by yaoyaoding in https://github.com/hidet-org/hidet/pull/391
* [Option] Add option to set class Var id attribute to 0 by default by destefy in https://github.com/hidet-org/hidet/pull/393
* [CI] Add CI workflow and scripts by hjjq in https://github.com/hidet-org/hidet/pull/394
* [CI] Fix deadlock by hjjq in https://github.com/hidet-org/hidet/pull/395
* [Operator] Enhancements to Reduce by hjjq in https://github.com/hidet-org/hidet/pull/366
* [CI] Launch and stop compile server via workflow by hjjq in https://github.com/hidet-org/hidet/pull/396
* [Operator] Support advanced options for pooling operators by yaoyaoding in https://github.com/hidet-org/hidet/pull/399
* [Torch] Implements __torch_func__ protocol by yaoyaoding in https://github.com/hidet-org/hidet/pull/400
* [Docs] Add more documentation by yaoyaoding in https://github.com/hidet-org/hidet/pull/401
* [Fixbug] Fix a performance bug in auto-scheduler by yaoyaoding in https://github.com/hidet-org/hidet/pull/402
* [Library] Add cublas library by yaoyaoding in https://github.com/hidet-org/hidet/pull/404
* [Operator] Add `hidet.ops.matmul_cublas` operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/405
* [Fusion] Allow shallow fusion of cublas operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/407
* [CI] Clear op cache by hjjq in https://github.com/hidet-org/hidet/pull/406
* [Runtime] Add a new compiled format CompiledApp by yaoyaoding in https://github.com/hidet-org/hidet/pull/408
* CPU AVX implementation for Softmax, Norm by fishingguy456 in https://github.com/hidet-org/hidet/pull/357
* [CI] Reduce scope of secrets by hjjq in https://github.com/hidet-org/hidet/pull/413
* [Operator] Add a opaque operator base class by yaoyaoding in https://github.com/hidet-org/hidet/pull/414
* [IR] Support inplace operators by yaoyaoding in https://github.com/hidet-org/hidet/pull/416
* [Graph][Quantization] Multi-stage software pipelining and update parallel k rule by Aalanli in https://github.com/hidet-org/hidet/pull/364
* [CI] Trigger workflow by hjjq in https://github.com/hidet-org/hidet/pull/417
* [Scheduler] Add the fused task name to auto-scheduled kernels by yaoyaoding in https://github.com/hidet-org/hidet/pull/418
* [CI] Use cudagraph for benchmarks by hjjq in https://github.com/hidet-org/hidet/pull/419
* [CI] Remove unnecessary synchronization by hjjq in https://github.com/hidet-org/hidet/pull/420
* Update Netron viewer link by KTong821 in https://github.com/hidet-org/hidet/pull/421
* [Operator] Add cublas to matmul tune space by hjjq in https://github.com/hidet-org/hidet/pull/422
* [IR] Support integer subbyte by xiaocenxiaocen in https://github.com/hidet-org/hidet/pull/403
* [README] Fix ONNX link by dbabokin in https://github.com/hidet-org/hidet/pull/425
* [cuBLAS] Add cublas_gemm_batched and use cublasSetStream to set stream to the current stream in all cublas API calls by yudi0201 in https://github.com/hidet-org/hidet/pull/423
* [Fixbug] Fix dynamic memcpy bug by KTong821 in https://github.com/hidet-org/hidet/pull/427
* [Compile Server] Fetch repo before checking out by hjjq in https://github.com/hidet-org/hidet/pull/429
* [CI] Use slurm for runners by hjjq in https://github.com/hidet-org/hidet/pull/430
* [CI] CI migration by hjjq in https://github.com/hidet-org/hidet/pull/433
* [Fixbug] Fix graph metadata hash by KTong821 in https://github.com/hidet-org/hidet/pull/428
* [CI] Add back tests by hjjq in https://github.com/hidet-org/hidet/pull/436
* [Fix] Skip a failed test due to huggingface transformers update by yaoyaoding in https://github.com/hidet-org/hidet/pull/439
* [RC] Release candidate for version 0.3.1 by yaoyaoding in https://github.com/hidet-org/hidet/pull/442

New Contributors
* destefy made their first contribution in https://github.com/hidet-org/hidet/pull/367
* xiaocenxiaocen made their first contribution in https://github.com/hidet-org/hidet/pull/374
* fishingguy456 made their first contribution in https://github.com/hidet-org/hidet/pull/357
* KTong821 made their first contribution in https://github.com/hidet-org/hidet/pull/421
* dbabokin made their first contribution in https://github.com/hidet-org/hidet/pull/425
* yudi0201 made their first contribution in https://github.com/hidet-org/hidet/pull/423

**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.3.0...v0.3.1

0.3.0

Notes

In this release, we add more support for large language model inference, distributed inference, and quantization. We also make hidet script more stable and added more documentation for it. More operators and models are supported. See below for more details.

Frontend
* [Frontend] Dynamic shape fx trace by Aalanli in https://github.com/hidet-org/hidet/pull/294
* [Torch] Steal Pytorch weights by hjjq in https://github.com/hidet-org/hidet/pull/310
* [Dynamo Frontend] Refactor the dynamic shape support by yaoyaoding in https://github.com/hidet-org/hidet/pull/319
* [Torch][Graph][Operator] Add and fix various items for torchvision model support by hjjq in https://github.com/hidet-org/hidet/pull/347
* [Dynamo] minor enhancements to attention and register a few functions by xinli-git in https://github.com/hidet-org/hidet/pull/345

Operators and models
* [Operator] Further performance enhancements for conv2D by Aalanli in https://github.com/hidet-org/hidet/pull/290
* [Operator] Refactoring matrix multiplication implementation by yaoyaoding in https://github.com/hidet-org/hidet/pull/296
* [Model Support] Add support for wav2vec by yaoyaoding in https://github.com/hidet-org/hidet/pull/303
* [Operator] Update attention for dynamic shape by hjjq in https://github.com/hidet-org/hidet/pull/307
* [Operator] Resolve Adaptive Pool to reduce by hjjq in https://github.com/hidet-org/hidet/pull/308
* [Reduce] optimize and unify reduce operator to a single place by xinli-git in https://github.com/hidet-org/hidet/pull/311
* [Operator] optimize normalize op with vectorized load, dynamic shape and more by xinli-git in https://github.com/hidet-org/hidet/pull/316
* [Model] Add missing operators for T5 by yaoyaoding in https://github.com/hidet-org/hidet/pull/322
* [Fixbug] Reduce should perform syncthread after initializing shared memory to zero by xinli-git in https://github.com/hidet-org/hidet/pull/325
* [Models] Llama 2 support by Aalanli in https://github.com/hidet-org/hidet/pull/324
* [Models] Llama2 fix by Aalanli in https://github.com/hidet-org/hidet/pull/333
* [Operator] Composite Elementwise Operation by hjjq in https://github.com/hidet-org/hidet/pull/337
* [Operator] Add clamp/isinf/any/all op, enhance where op by yaoyaoding in https://github.com/hidet-org/hidet/pull/343
* [Torch][Operator] More torchvision model support by hjjq in https://github.com/hidet-org/hidet/pull/348
* [Operator] Add einsum by hjjq in https://github.com/hidet-org/hidet/pull/349
* [Operator][Graph][Regression] CNN optimizations by hjjq in https://github.com/hidet-org/hidet/pull/356
* [Graph] Minor bug fixes by hjjq in https://github.com/hidet-org/hidet/pull/358


Distributed inference
* [Distributed] all_reduce op and distributed info in graphs by soodoshll in https://github.com/hidet-org/hidet/pull/284
* [Distributed] Add more runtime distributed communication functions by soodoshll in https://github.com/hidet-org/hidet/pull/314
* [Fixbug] group_start and group_end should be able importable without nccl by soodoshll in https://github.com/hidet-org/hidet/pull/317

Quantization
* [Operators] preliminary symmetric weight quantization by Aalanli in https://github.com/hidet-org/hidet/pull/298
* [Quantization] Quantization API by Aalanli in https://github.com/hidet-org/hidet/pull/309
* [Quantization] fix quantization pass bug by Aalanli in https://github.com/hidet-org/hidet/pull/355


IR and passes
* [FixBug] Don't instantiate symbol for primitive functions by hjjq in https://github.com/hidet-org/hidet/pull/291
* [Fix] NCCL API mismatch and NCCL primitive fix by soodoshll in https://github.com/hidet-org/hidet/pull/301
* [Fixbug] Prevent allreduce op from being fused by soodoshll in https://github.com/hidet-org/hidet/pull/304
* [Enhancements] add a vcude device to help mitigate compile time GPU memory usage by xinli-git in https://github.com/hidet-org/hidet/pull/302
* [Task] More descriptive kernel names for nsys/ncu by Aalanli in https://github.com/hidet-org/hidet/pull/315
* [Fixbug][Hidet Script] Fix a bug that hidet script does not recognize return type by yaoyaoding in https://github.com/hidet-org/hidet/pull/329
* [Hidet script] Add `hidet.lang.types` submodule by yaoyaoding in https://github.com/hidet-org/hidet/pull/340
* [IR][Parser] Hidet IR grammar, parser and ir reconstructor by Aalanli in https://github.com/hidet-org/hidet/pull/354

Runtime
* [Runtime] Check for input tensor device by hjjq in https://github.com/hidet-org/hidet/pull/287
* [Fixbug] Is exiting fix by xinli-git in https://github.com/hidet-org/hidet/pull/293

Backends
* [Fixbug] Fix the c++ standard to c++11 for both nvcc and gcc compilers by yaoyaoding in https://github.com/hidet-org/hidet/pull/327
* [CPU][Scheduler] Use mutli-threads for autl-scheduler by yaoyaoding in https://github.com/hidet-org/hidet/pull/341

Documentation
* [Document] fix installation guide by soodoshll in https://github.com/hidet-org/hidet/pull/288
* [Docs] Update the documentation for the coming release by yaoyaoding in https://github.com/hidet-org/hidet/pull/360

Others
* [Version] Bump version to 0.3.0.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/286
* [Tools] simple benchmarking utility by Aalanli in https://github.com/hidet-org/hidet/pull/292
* [Compile Server] Support remote compilation via compilation server by yaoyaoding in https://github.com/hidet-org/hidet/pull/297
* [Compile Server] Allow the user to specify the repo and branch/tag to use by yaoyaoding in https://github.com/hidet-org/hidet/pull/300
* [Compile Server] Add a new option to specify the cuda arch by yaoyaoding in https://github.com/hidet-org/hidet/pull/305
* [Fixbug] Fix a bug in compile server by yaoyaoding in https://github.com/hidet-org/hidet/pull/306
* [Graph] Minor graph benchmark fix by Aalanli in https://github.com/hidet-org/hidet/pull/313
* [Regression] Local performance regression by hjjq in https://github.com/hidet-org/hidet/pull/321
* [Regression] Increase benchmark iters and update perf data by hjjq in https://github.com/hidet-org/hidet/pull/328
* [CI] List package versions in ci by yaoyaoding in https://github.com/hidet-org/hidet/pull/334
* [Fixbug] Clear the intermediate object files for kernel tuning by yaoyaoding in https://github.com/hidet-org/hidet/pull/339
* [Config] Add configuration file by Aalanli in https://github.com/hidet-org/hidet/pull/359


**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.2.4...v0.3.0

0.2.4

What's Changed
* [Version] Bump version to v0.2.4.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/188
* [Dynamo] module tests + operator support by AndreSlavescu in https://github.com/hidet-org/hidet/pull/148
* Refactor compilation workflow to support CPU without CUDA by LDY1998 in https://github.com/hidet-org/hidet/pull/189
* [Stack] Allow the the ulimit stack size less than expected by yaoyaoding in https://github.com/hidet-org/hidet/pull/195
* [Readme] Add platform requirements by yaoyaoding in https://github.com/hidet-org/hidet/pull/196
* [DataType] Add complex64 and complex128 data type by yaoyaoding in https://github.com/hidet-org/hidet/pull/200
* [Example] Add an example of running GPT-2 model by yaoyaoding in https://github.com/hidet-org/hidet/pull/203
* [Fusion] Use inline pass in fusion to allow template call functions with kernel params by yaoyaoding in https://github.com/hidet-org/hidet/pull/197
* [Frontend][Operator] Add missing operators for dinov2 by yaoyaoding in https://github.com/hidet-org/hidet/pull/206
* [Backend] Add openmp support by yaoyaoding in https://github.com/hidet-org/hidet/pull/208
* [Operator] Update batch_matmul to use Hidet Script by hjjq in https://github.com/hidet-org/hidet/pull/207
* [Cache] Add cache management command line interface by yaoyaoding in https://github.com/hidet-org/hidet/pull/212
* [IR] Creation-time constant fold for constant expressions by yaoyaoding in https://github.com/hidet-org/hidet/pull/209
* [Torch][Operator] Allow change torch tensor device when possible by yaoyaoding in https://github.com/hidet-org/hidet/pull/214
* [Torch][Operator] Add op mapping for torch.min/max/minimum/maximum by yaoyaoding in https://github.com/hidet-org/hidet/pull/216
* [Typo] Fix a typo in resnext.py by eltociear in https://github.com/hidet-org/hidet/pull/210
* [Operator] Adding missing operators for llama by yaoyaoding in https://github.com/hidet-org/hidet/pull/219
* [IR] Adding more support for dynamic shape on Task and FlowGraph level by yaoyaoding in https://github.com/hidet-org/hidet/pull/220
* [Torch] Add mapping for `torch.ops.aten.add` and `torch.ops.aten.cos` by yaoyaoding in https://github.com/hidet-org/hidet/pull/223
* [Operator][Backend] Add nvcc flags for faster math and update Attention schedule by hjjq in https://github.com/hidet-org/hidet/pull/221
* [CI] Always clear the cache before tests by yaoyaoding in https://github.com/hidet-org/hidet/pull/224
* fix batch_matmul for invalid mma config for sm < 80 by xinli-git in https://github.com/hidet-org/hidet/pull/227
* [Dynamic Shape] Adding more dynamic shape support by yaoyaoding in https://github.com/hidet-org/hidet/pull/228
* [CI] Add `importlib_metadata` to `requirements-dev.txt` by yaoyaoding in https://github.com/hidet-org/hidet/pull/233
* [Script] Add list comprehension support in hidet script by yaoyaoding in https://github.com/hidet-org/hidet/pull/235
* [Refactor][Dynamic Shape] Introduce SymbolVar to implement dynamic shape by yaoyaoding in https://github.com/hidet-org/hidet/pull/236
* [Script] Add pointer arthematic by yaoyaoding in https://github.com/hidet-org/hidet/pull/237
* [Operator][Torch] Add causal fmha and torch sdpa mapping by hjjq in https://github.com/hidet-org/hidet/pull/238
* [Fixbug][Pass] Fix a bug in the `inline_let_stmt` pass by yaoyaoding in https://github.com/hidet-org/hidet/pull/240
* [Options] Add option for controlling parallel build with number of jobs or memory reserved for each job by xinli-git in https://github.com/hidet-org/hidet/pull/230
* [Typo] Fix a typo by BolinSNLHM in https://github.com/hidet-org/hidet/pull/245
* [Typo] Fix minor spelling mistake by Aalanli in https://github.com/hidet-org/hidet/pull/246
* [Fixbug] Fix a bug in StmtRewriter which discard declare scope information by yaoyaoding in https://github.com/hidet-org/hidet/pull/248
* [Refactor] Adding support for compiled model by yaoyaoding in https://github.com/hidet-org/hidet/pull/247
* [Operator] batch_matmul: Remove duplicate smem declaration by hjjq in https://github.com/hidet-org/hidet/pull/249
* [Operator] Adding CPU support for matrix multiplication by BolinSNLHM in https://github.com/hidet-org/hidet/pull/251
* [Hidet Script] Allow `bind_tuple` argument in `mapping.on(...)` and `grid(...)` by yaoyaoding in https://github.com/hidet-org/hidet/pull/254
* [Hidet Script] Add `in` and `not in` expression in hidet script by yaoyaoding in https://github.com/hidet-org/hidet/pull/255
* [Codegen] Include header files as needed by yaoyaoding in https://github.com/hidet-org/hidet/pull/256
* [Operator] Add new operator "normalize" that makes a group of layers (layer norm, group norm and instance norm) faster using hidet script by xinli-git in https://github.com/hidet-org/hidet/pull/257
* [Testing][Models] Add gpt2 module in testing models by yaoyaoding in https://github.com/hidet-org/hidet/pull/252
* [Fixbug] Fix test warnings and the incompatibility of two recent PRs by yaoyaoding in https://github.com/hidet-org/hidet/pull/258
* [Operator] Add sm75 support for attention by hjjq in https://github.com/hidet-org/hidet/pull/259
* [Operator] batch_matmul: Remove unroll and reduce tuning space by hjjq in https://github.com/hidet-org/hidet/pull/260
* [Fixbug] Fix a bug when fused operator has no input by yaoyaoding in https://github.com/hidet-org/hidet/pull/263
* [Graph] Translate softmax and reduce to hidet script by Aalanli in https://github.com/hidet-org/hidet/pull/242
* [Fixbug] batch_matmul: move cc checking inside schedule by hjjq in https://github.com/hidet-org/hidet/pull/264
* [Refactor] Refactor building system and adding compiled products by yaoyaoding in https://github.com/hidet-org/hidet/pull/261
* [Fixbug] Reduce the default unroll factor to 4 by yaoyaoding in https://github.com/hidet-org/hidet/pull/266
* [Torch] Add some torch frontend mappings for roberta-base by hjjq in https://github.com/hidet-org/hidet/pull/267
* [Refactor] Remove `schedules` submodule under `hidet.graph.ops` by yaoyaoding in https://github.com/hidet-org/hidet/pull/269
* [Device] Add support for mixed cpu and cuda kernels in the same flow graph by yaoyaoding in https://github.com/hidet-org/hidet/pull/270
* [Dynamic Shape] Adding dynamic shape support for reduce by Aalanli in https://github.com/hidet-org/hidet/pull/268
* [Complex Type] Add more support for complex data type by yaoyaoding in https://github.com/hidet-org/hidet/pull/271
* [Tools] Model translator by Aalanli in https://github.com/hidet-org/hidet/pull/273
* [Model] Llama model implementation in hidet by Aalanli in https://github.com/hidet-org/hidet/pull/243
* [Operator] Add support for cross attention by hjjq in https://github.com/hidet-org/hidet/pull/275
* [Operator] Add dynamic shape support and tests for Operators. by Aalanli in https://github.com/hidet-org/hidet/pull/274
* [Fusion] Enhance the prologue epilogue fusion by yaoyaoding in https://github.com/hidet-org/hidet/pull/277
* [Drivers] Suppress OSError by hjjq in https://github.com/hidet-org/hidet/pull/278
* [Dynamic Shape] More correctness guards by Aalanli in https://github.com/hidet-org/hidet/pull/276
* [Operator] Make Convolution gemms fusible by resolving to batch_matmul by hjjq in https://github.com/hidet-org/hidet/pull/279
* [External Tasks] Move task build into method call for external kernel support by xinli-git in https://github.com/hidet-org/hidet/pull/282
* [Distributed] add nccl primitives by soodoshll in https://github.com/hidet-org/hidet/pull/280
* [Operators] Conv2d fp16 implicit gemm kernel by Aalanli in https://github.com/hidet-org/hidet/pull/283

New Contributors
* eltociear made their first contribution in https://github.com/hidet-org/hidet/pull/210
* BolinSNLHM made their first contribution in https://github.com/hidet-org/hidet/pull/245
* Aalanli made their first contribution in https://github.com/hidet-org/hidet/pull/246

**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.2.3...v0.2.4

0.2.3

What's Changed
* [Version] Bump version to v0.2.3.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/144
* [Workflow] Update workflow to use the stable version of pytorch by yaoyaoding in https://github.com/hidet-org/hidet/pull/145
* [Operator] Resolve matmul to batch_matmul when lower than sm80 by hjjq in https://github.com/hidet-org/hidet/pull/146
* [Dynamo] non-linear operator support + tests by AndreSlavescu in https://github.com/hidet-org/hidet/pull/143
* Remove tutorial msg by LDY1998 in https://github.com/hidet-org/hidet/pull/149
* [BUG] Conversion compile issue by xinli-git in https://github.com/hidet-org/hidet/pull/150
* [Dynamo] Fix dynamo tests and dump graph IR by xinli-git in https://github.com/hidet-org/hidet/pull/153
* [CI] Benchmark periodically by yaoyaoding in https://github.com/hidet-org/hidet/pull/155
* [CI] Update bench script by yaoyaoding in https://github.com/hidet-org/hidet/pull/156
* [CI] Add more env information to benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/157
* [CI] Remove benchmark workflow, but run it in dedicated server by yaoyaoding in https://github.com/hidet-org/hidet/pull/159
* [CI] Update benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/160
* [CI] Change the search space in benchmark script from 0 to 2 by yaoyaoding in https://github.com/hidet-org/hidet/pull/161
* [CI] Update benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/162
* [CI] Update benchmark scripts by yaoyaoding in https://github.com/hidet-org/hidet/pull/163
* [IR][Pass] Refactor the fusion implementation by yaoyaoding in https://github.com/hidet-org/hidet/pull/164
* [Dynamo] Add operator support to run UNet2DConditionModel from diffusers by xinli-git in https://github.com/hidet-org/hidet/pull/151
* [IR][Dynamic Shape] Enhance the Tensor Program IR to support dynamic shape by yaoyaoding in https://github.com/hidet-org/hidet/pull/165
* [Operator] Allow matmul_f16 fuse epilogue by yaoyaoding in https://github.com/hidet-org/hidet/pull/167
* [CI] Update benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/168
* [CUDA] Lazy initializing cuda context by yaoyaoding in https://github.com/hidet-org/hidet/pull/169
* [Fixbug] Allow one backend fail in benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/170
* [Fixbug] Use auto-scheduler for fp64 reduction by yaoyaoding in https://github.com/hidet-org/hidet/pull/171
* [Operator] Add `gather` operator and `torch.zeros`, `torch.neg` mapping by yaoyaoding in https://github.com/hidet-org/hidet/pull/174
* [CI] Update benchmark script by yaoyaoding in https://github.com/hidet-org/hidet/pull/179
* [Fixbug] Add `_stacklevel` to pytorch softmax mapping by yaoyaoding in https://github.com/hidet-org/hidet/pull/178
* [IR] Add unroll pragma for loop statement by yaoyaoding in https://github.com/hidet-org/hidet/pull/180
* [Operator] Flash Attention by hjjq in https://github.com/hidet-org/hidet/pull/175
* [Fixbug] Fix a bug in the mapping from device to its memory pool by yaoyaoding in https://github.com/hidet-org/hidet/pull/181
* [Dynamo] Small enchancements for graph dump ir and task arguments by xinli-git in https://github.com/hidet-org/hidet/pull/172
* [Docs] Update install instruction by hjjq in https://github.com/hidet-org/hidet/pull/182
* change norm to use smaller inputs to reduce running time by xinli-git in https://github.com/hidet-org/hidet/pull/185
* [IR] Add explicit unroll by yaoyaoding in https://github.com/hidet-org/hidet/pull/184
* [Runtime] Allow pass torch tensor to `PackedFunc` directly by yaoyaoding in https://github.com/hidet-org/hidet/pull/183
* Refactor codegen to separate GPU/CPU code generation by LDY1998 in https://github.com/hidet-org/hidet/pull/176
* [Pass] Support inline function by yaoyaoding in https://github.com/hidet-org/hidet/pull/186

New Contributors
* LDY1998 made their first contribution in https://github.com/hidet-org/hidet/pull/149

**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.2.2...v0.2.3

0.2.2

What's Changed
* [Version] Bump version to 0.2.2.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/118
* [Option] Add `debug_cache_tuning` option by yaoyaoding in https://github.com/hidet-org/hidet/pull/120
* [Fix] Remove lambda in shfl primitives by hjjq in https://github.com/hidet-org/hidet/pull/121
* [IR][Refactor] Refactor the functor/visitor/rewriters by yaoyaoding in https://github.com/hidet-org/hidet/pull/122
* [Fixbug] Fix bug in IR Printer by hjjq in https://github.com/hidet-org/hidet/pull/123
* [Fixbug] Fix a bug in IRModule.update_function by yaoyaoding in https://github.com/hidet-org/hidet/pull/124
* [Frontend] Fix typo by digital-nomad-cheng in https://github.com/hidet-org/hidet/pull/127
* [Operator] Add the support of using external kernels in hidet by yaoyaoding in https://github.com/hidet-org/hidet/pull/128
* [Tests] Reorganize tests files for frontends by yaoyaoding in https://github.com/hidet-org/hidet/pull/129
* [Dynamo] Added Operator Support by AndreSlavescu in https://github.com/hidet-org/hidet/pull/131
* [Fixbug] Allow grid compute to be inlined by hjjq in https://github.com/hidet-org/hidet/pull/134
* [Graph] Cast optimizations by xinli-git in https://github.com/hidet-org/hidet/pull/135
* [Fixbug] Fix a bug that map blockDim to blockIdx by yaoyaoding in https://github.com/hidet-org/hidet/pull/136
* [Fixbug] Fix a bug in rule based simplifier by yaoyaoding in https://github.com/hidet-org/hidet/pull/137
* [Workflow] Update concurrent graph of the ci workflow by yaoyaoding in https://github.com/hidet-org/hidet/pull/138
* [Runtime] Add `src_path` and `source()` members to `CompiledFunction` by yaoyaoding in https://github.com/hidet-org/hidet/pull/139
* [Runtime][IR] Support colored source code; add blockDim to extern_vars by yaoyaoding in https://github.com/hidet-org/hidet/pull/140
* [Fixbug] Convert tensor to cpu before dumping by hjjq in https://github.com/hidet-org/hidet/pull/141

New Contributors
* digital-nomad-cheng made their first contribution in https://github.com/hidet-org/hidet/pull/127
* xinli-git made their first contribution in https://github.com/hidet-org/hidet/pull/135

**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.2.1...v0.2.2

0.2.1

What's Changed
* [Version] Bump version to 0.2.1.dev by yaoyaoding in https://github.com/hidet-org/hidet/pull/73
* [CI] Prevent fork repos from running workflow by yaoyaoding in https://github.com/hidet-org/hidet/pull/74
* [Fixbug] Fix a bug in ``trace_from`` when the inputs are directly used as outputs by yaoyaoding in https://github.com/hidet-org/hidet/pull/76
* [Operator] Add reduce_f16 and squeeze as Reduce's resolve variants by hjjq in https://github.com/hidet-org/hidet/pull/75
* [IR] Input specification assertion message for valid IR check by AndreSlavescu in https://github.com/hidet-org/hidet/pull/78
* [Operator] Add conv3d, max_pool3d, avg_pool3d by hjjq in https://github.com/hidet-org/hidet/pull/79
* [Dynamo] Add the entry point registration for dynamo by yaoyaoding in https://github.com/hidet-org/hidet/pull/80
* [Fix] Update shape utility functions to expect Sequence instead of List by yaoyaoding in https://github.com/hidet-org/hidet/pull/86
* [Bugfix] 'double'->'float64' in onnx dtype conversion by soodoshll in https://github.com/hidet-org/hidet/pull/88
* [Fix] Mark the reduce fp16 operator not fusible by yaoyaoding in https://github.com/hidet-org/hidet/pull/100
* [Fixbug] Use uint64_t instead of unsigned long long for literals by yaoyaoding in https://github.com/hidet-org/hidet/pull/101
* [Fixbug] Fix a bug in the minimum and maximum operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/102
* [Dynamo] Update dynamo registration after pytorch refactored that part by yaoyaoding in https://github.com/hidet-org/hidet/pull/84
* [Fixbug] Fix bugs in binary_arithmetic op and swizzle layout by hjjq in https://github.com/hidet-org/hidet/pull/104
* [Fixbug] Call fuse in reduce_fp16 operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/105
* [ONNX] Fix the out of bound error in onnx slice function during importing by yaoyaoding in https://github.com/hidet-org/hidet/pull/106
* [Fixbug] Reverse map of binary operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/107
* [Fixbug] Add attributes to Clip operator by yaoyaoding in https://github.com/hidet-org/hidet/pull/108
* [Fixbug] Binary arthmatic ops raise error when one is scalar on GPU by yaoyaoding in https://github.com/hidet-org/hidet/pull/109
* [Graph] Refactor forward function of FlowGraph by yaoyaoding in https://github.com/hidet-org/hidet/pull/110
* [Fixbug] Use int64 as the output of arg-reduce by yaoyaoding in https://github.com/hidet-org/hidet/pull/111
* [README] Update readme by yaoyaoding in https://github.com/hidet-org/hidet/pull/114
* [Fixbug] Fix a bug when an graph output is constant by yaoyaoding in https://github.com/hidet-org/hidet/pull/113
* [Community] Create CODE_OF_CONDUCT.md by yaoyaoding in https://github.com/hidet-org/hidet/pull/115
* [Community] Update issue templates by yaoyaoding in https://github.com/hidet-org/hidet/pull/116
* [Fixbug] Resolve the min/max function according to compute capability by yaoyaoding in https://github.com/hidet-org/hidet/pull/112
* [Workflow] Update workflow by yaoyaoding in https://github.com/hidet-org/hidet/pull/117
* [Workflow] Update publish workflow by yaoyaoding in https://github.com/hidet-org/hidet/pull/119

New Contributors
* soodoshll made their first contribution in https://github.com/hidet-org/hidet/pull/88

**Full Changelog**: https://github.com/hidet-org/hidet/compare/v0.2.0...v0.2.1

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.