Ray Libraries
Ray Data
π New Features:
- Added read_hudi (https://github.com/ray-project/ray/pull/46273)
π« Enhancements:
- Improved performance of DelegatingBlockBuilder (https://github.com/ray-project/ray/pull/48509)
- Improved memory accounting of pandas blocks (https://github.com/ray-project/ray/pull/46939)
π¨ Fixes:
- Fixed bug where you canβt specify a schema with write_parquet (https://github.com/ray-project/ray/issues/48630)
- Fixed bug where to_pandas errors if your dataset contains Arrow and pandas blocks (https://github.com/ray-project/ray/pull/48583)
- Fixed bug where map_groups doesnβt work with pandas data (https://github.com/ray-project/ray/pull/48287)
- Fixed bug where write_parquet errors if your data contains nullable fields (https://github.com/ray-project/ray/pull/48478)
- Fixed bug where βIteration Blocked Timeβ charts looks incorrect (https://github.com/ray-project/ray/pull/48618)
- Fixed bug where unique fails with null values (https://github.com/ray-project/ray/pull/48750)
- Fixed bug where βRows Outputtedβ is 0 in the Data dashboard (https://github.com/ray-project/ray/pull/48745)
- Fixed bug where methods like drop_columns cause spilling (https://github.com/ray-project/ray/pull/48140)
- Fixed bug where async map tasks hang (https://github.com/ray-project/ray/pull/48861)
ποΈ Deprecations:
- Deprecated read_parquet_bulk https://github.com/ray-project/ray/pull/48691
- Deprecated iter_tf_batches https://github.com/ray-project/ray/pull/48693
- Deprecated meta_provider parameter of read functions (https://github.com/ray-project/ray/pull/48690)
- Deprecated to_torch (https://github.com/ray-project/ray/pull/48692)
Ray Train
π¨ Fixes:
- Fix StartTracebackWithWorkerRank serialization (48548)
π Documentation:
- Add example for fine-tuning Llama3.1 with AWS Trainium (48768)
Ray Tune
π¨ Fixes:
- Remove the `clear_checkpoint` function during Trial restoration error handling. (48532)
Ray Serve
π New Features:
- Initial version of local_testing_mode ([48477](https://github.com/ray-project/ray/pull/48477))
π« Enhancements:
- Handle multiple changed objects per LongPollHost.listen_for_change RPC ([48803](https://github.com/ray-project/ray/pull/48803/files))
- Add more nuanced checks for http proxy status errors ([47896](https://github.com/ray-project/ray/pull/47896))
- Improve replica access log messages to include HTTP status info and better resemble standard log format ([48819](https://github.com/ray-project/ray/pull/48819))
- Propagate replica constructor error to deployment status message and print num retries left ([48531](https://github.com/ray-project/ray/pull/47896))
π¨ Fixes:
- Pending requests that are cancelled before they were assigned to a replica now also return a serve.RequestCancelledError ([48496](https://github.com/ray-project/ray/pull/48496))
RLlib
π« Enhancements:
- Release test enhancements. ([45803](https://github.com/ray-project/ray/pull/45803), [#48681](https://github.com/ray-project/ray/pull/48681))
- Make opencv-python-headless default over opencv-python ([48776](https://github.com/ray-project/ray/pull/48776)[)](https://github.com/ray-project/ray/commit/aaac19c8307038021dd96ffc4c2e616fbbf14896)
- Reverse learner queue behavior of IMPALA/APPO (consume oldest batches first, instead of newest, BUT drop oldest batches if queue full). ([48702](https://github.com/ray-project/ray/pull/48702))
π¨ Fixes:
- Fix torch scheduler stepping and reporting. ([48125](https://github.com/ray-project/ray/pull/48125)[)](https://github.com/ray-project/ray/commit/ec9775d86fbf7eb93358d95268e9f62e53f790bd)
- Fix accumulation of results over n training_step calls within same iteration (new API stack). ([48136](https://github.com/ray-project/ray/pull/48136))
- Various other fixes: [48563](https://github.com/ray-project/ray/pull/48563), [#48314](https://github.com/ray-project/ray/pull/48314), [#48698](https://github.com/ray-project/ray/pull/48698), [#48869](https://github.com/ray-project/ray/pull/48869).
π Documentation:
- Upgrade examples script overview page (new API stack). ([48526](https://github.com/ray-project/ray/pull/48526)[)](https://github.com/ray-project/ray/commit/d39c9df1b69ba0451abff7075963c3a6e2824c9c)
- Enable RLlib + Serve example in CI and translate to new API stack. ([48687](https://github.com/ray-project/ray/pull/48687))
π Architecture refactoring:
- Switch new API stack on by default, APPO, IMPALA, BC, MARWIL, and CQL. ([48516](https://github.com/ray-project/ray/pull/48516), [#48599](https://github.com/ray-project/ray/pull/48599)[)](https://github.com/ray-project/ray/commit/03ea4f6663fafaf64b8d10ac8db8e962302be561)
- Various APPO enhancements (new API stack): Circular buffer [(](https://github.com/ray-project/ray/commit/05915c1b389ab0bada23217a3cb2768311d1184b)[#48798](https://github.com/ray-project/ray/pull/48798)), minor loss math fixes ([#48800](https://github.com/ray-project/ray/pull/48800)), target network update logic ([#48802](https://github.com/ray-project/ray/pull/48802)), smaller cleanups ([#48844](https://github.com/ray-project/ray/pull/48844)).
- Remove `rllib_contrib` from repo. ([48565](https://github.com/ray-project/ray/pull/48565)[)](https://github.com/ray-project/ray/commit/d2de98323f0848fec2dbeb61bbd39b507b9c97d8)
Ray Core and Ray Clusters
Ray Core
π New Features:
- [Core] uv runtime env support ([48479](https://github.com/ray-project/ray/pull/48479), [#48486](https://github.com/ray-project/ray/pull/48486), [#48611](https://github.com/ray-project/ray/pull/48611), [#48619](https://github.com/ray-project/ray/pull/48619), [#48632](https://github.com/ray-project/ray/pull/48632), [#48634](https://github.com/ray-project/ray/pull/48634), [#48637](https://github.com/ray-project/ray/pull/48637), [#48670](https://github.com/ray-project/ray/pull/48670), [#48731](https://github.com/ray-project/ray/pull/48731))
- [Core] GCS FT with redis sentinel (47335)
π« Enhancements:
- [CompiledGraphs] Refine schedule visualization (48594)
π¨ Fixes:
- [CompiledGraphs] Don't persist input_nodes in _CollectiveOperation to avoid wrong understanding about DAGs (48463)
- [Core] Fix Ascend NPU discovery to support 8+ cards per node (48543)
- [Core] Make Placement Group Wildcard and Indexed Resource Assignments Consistent (48088)
- [Core] Stop the GRPC server before Shut down the Object Store (48572)
Ray Clusters
π¨ Fixes:
- [KubeRay]: Fix ConnectionError on Autoscaler CR lookups in K8s clusters with custom DNS for Kubernetes API. ([48541](https://github.com/ray-project/ray/pull/48541))
Dashboard
π« Enhancements:
- Add global UTC timezone button in navbar with local storage (48510)
- Add memory graphs optimized for OOM debugging (48530)
- Improve tasks/actors metric naming and add graph for running tasks (48528)
add actor pid to dashboard (48791)
π¨ Fixes:
- Fix Placement Group Table table cells overflow (47323)
- Fix Rows Outputted being zero on Ray Data Dashboard (48745)
- fix confusing dataset operator name (48805)
Thanks
Thanks to all those who contributed to this release!
rynewang, rickyyx, bveeramani, marwan116, simonsays1980, dayshah, dentiny, KepingYan, mimiliaogo, kevin85421, SeaOfOcean, stephanie-wang, mohitjain2504, azayz, xushiyan, richardliaw, can-anyscale, xingyu-long, kanwang, aslonnie, MortalHappiness, jjyao, SumanthRH, matthewdeng, alexeykudinkin, sven1977, raulchen, andrewsykim, zcin, nadongjun, hongpeng-guo, miguelteixeiraa, saihaj, khluu, ArturNiederfahrenhorst, ryanaoleary, ltbringer, pcmoritz, JoshKarpel, akyang-anyscale, frances720, BeingGod, edoakes, Bye-legumes, Superskyyy, liuxsh9, MengjinYan, ruisearch42, scottjlee, angelinalg