* [engine] support async and streaming by ByronHsu in https://github.com/sgl-project/sglang/pull/1614
* [Fix] Fix the style of test_large_max_new_tokens.py by merrymercy in https://github.com/sgl-project/sglang/pull/1638
* fix missing ignore_eos in v1/chat/completions by learninmou in https://github.com/sgl-project/sglang/pull/1642
* Fix ignore_eos in the OpenAI ChatCompletions API by merrymercy in https://github.com/sgl-project/sglang/pull/1645
* [Feature, Hardware] Enable SGLang on XPU GPUs via PyTorch by liangan1 in https://github.com/sgl-project/sglang/pull/1480
* Fix unit tests and type annotations by merrymercy in https://github.com/sgl-project/sglang/pull/1648
* Add an option to disable penalizer by merrymercy in https://github.com/sgl-project/sglang/pull/1651
* Add get_tokenizer function for Engine class by pjyi2147 in https://github.com/sgl-project/sglang/pull/1653
* Fix the batch_is_full check for jump-forward decoding by merrymercy in https://github.com/sgl-project/sglang/pull/1654
* Simplify the event loop and expose `--num-continuous-decode-steps` as an argument by merrymercy in https://github.com/sgl-project/sglang/pull/1652
* [doc] Add engine section in backend.md by ByronHsu in https://github.com/sgl-project/sglang/pull/1656
* [Fix] fix eos trim inconsistency by Ying1123 in https://github.com/sgl-project/sglang/pull/1650
* Add output_ids into ScheduleBatch by merrymercy in https://github.com/sgl-project/sglang/pull/1659
* [Minor] Rename no_eos_trim to no_stop_trim by Ying1123 in https://github.com/sgl-project/sglang/pull/1661
* Add a test case to test retract by merrymercy in https://github.com/sgl-project/sglang/pull/1662
* Move filter_batch out of stream_output by merrymercy in https://github.com/sgl-project/sglang/pull/1663
* Support double sparsity by andy-yang-1 in https://github.com/sgl-project/sglang/pull/1459
* Fix unit test order to balance the tasks in CI by merrymercy in https://github.com/sgl-project/sglang/pull/1665
* [Minor] Improve style by merrymercy in https://github.com/sgl-project/sglang/pull/1666
* Simplify chunked prefill by merrymercy in https://github.com/sgl-project/sglang/pull/1667
* [1/N] Remove `CacheConfig` import in all model files by ByronHsu in https://github.com/sgl-project/sglang/pull/1658
* [doc] improve engine doc and add to readme by ByronHsu in https://github.com/sgl-project/sglang/pull/1670
* [Minor] Add some utility functions by merrymercy in https://github.com/sgl-project/sglang/pull/1671
* Improve benchmark scripts by merrymercy in https://github.com/sgl-project/sglang/pull/1672
* Fix memory leak during abort by merrymercy in https://github.com/sgl-project/sglang/pull/1674
* Fix filter_batch function call by hnyls2002 in https://github.com/sgl-project/sglang/pull/1681
* Add OLMo model by janimo in https://github.com/sgl-project/sglang/pull/1676
* Add a new event loop by merrymercy in https://github.com/sgl-project/sglang/pull/1677
* Fix srt dependency by ispobock in https://github.com/sgl-project/sglang/pull/1685
* [Event] Add online meetup meeting link by Ying1123 in https://github.com/sgl-project/sglang/pull/1686
* Launch a thread to overlap CPU and GPU by merrymercy in https://github.com/sgl-project/sglang/pull/1687
* Returning a per request metric for number of cached_tokens read by havetc in https://github.com/sgl-project/sglang/pull/1599
* add orjson for jsonresponse by michaelfeil in https://github.com/sgl-project/sglang/pull/1688
* Update README.md by merrymercy in https://github.com/sgl-project/sglang/pull/1689
* Add date to logging messages (1623) by zeng-zc in https://github.com/sgl-project/sglang/pull/1679
* Update the transformers version in CI by merrymercy in https://github.com/sgl-project/sglang/pull/1690
* Use SGLang imports for linear layer by janimo in https://github.com/sgl-project/sglang/pull/1696
* feat: radix tree code optimize by wxsms in https://github.com/sgl-project/sglang/pull/1697
* ORJson. Faster Json serialization by michaelfeil in https://github.com/sgl-project/sglang/pull/1694
* Fix the failed unit tests by merrymercy in https://github.com/sgl-project/sglang/pull/1699
* Fix failed ci tests on long prompts; Better error messages for embedding models by merrymercy in https://github.com/sgl-project/sglang/pull/1700
* Fix engine unit test by merrymercy in https://github.com/sgl-project/sglang/pull/1701
* Fix mixed batch for multi modal models by merrymercy in https://github.com/sgl-project/sglang/pull/1702
* Add matched_stop token or str to distinguish between eos or stop str finish_reason generation by g-drozdov in https://github.com/sgl-project/sglang/pull/1684
* Fix regex and logprob conflicts when chunked prefilling by hnyls2002 in https://github.com/sgl-project/sglang/pull/1703
* Simplify flashinfer utilities by merrymercy in https://github.com/sgl-project/sglang/pull/1704
* Add dtype for more operations by merrymercy in https://github.com/sgl-project/sglang/pull/1705
* Add grouped free operations by merrymercy in https://github.com/sgl-project/sglang/pull/1706
* Skip unnecessary penalizer by merrymercy in https://github.com/sgl-project/sglang/pull/1707
* Simplify the nan detection and greedy check in sampler by merrymercy in https://github.com/sgl-project/sglang/pull/1709
* Fix `is_all_ready` for overlap copy by merrymercy in https://github.com/sgl-project/sglang/pull/1710
* Fix the race condition in overlap mode by merrymercy in https://github.com/sgl-project/sglang/pull/1712
* Update README.md by merrymercy in https://github.com/sgl-project/sglang/pull/1713