<!-- Release notes generated using configuration in .github/release.yml at v0.2.3 -->
What's Changed
We now have 2 CLI scripts for sharing Lilac datasets (via huggingface):
bash
lilac upload local/Capybara --url_or_repo=lilacai/Capybara
To download the dataset to a local project directory:
bash
lilac download lilacai/Capybara
For more details on sharing datasets, see the [Sharing Guide](https://docs.lilacml.com//datasets/dataset_share.html)
With this change, we added a new environment variable `USE_TABLE_INDEX`, useful for frozen demos. This will dramatically improve the performance of queries as we use a cached DuckDB table. This will slow down labeling, or any edits, as the table will get re-computed upon each change.
Upload / Download
* Add an upload dataset script. Some other cleanups. by nsthorat in https://github.com/lilacai/lilac/pull/1059
Bug fixes
* Fix a bug with CSV source reader for TSV files, and named columns. by nsthorat in https://github.com/lilacai/lilac/pull/1040
* Progress bar by brilee in https://github.com/lilacai/lilac/pull/1043
* Fix bug with ItemMedia not rendering media fields that are deeply nested siblings. by nsthorat in https://github.com/lilacai/lilac/pull/1044
* Fix clustering an enriched field by dsmilkov in https://github.com/lilacai/lilac/pull/1048
* Propagate filters in the group by panel by dsmilkov in https://github.com/lilacai/lilac/pull/1041
Performance
* Add indexing on database startup, flag-guarded by brilee in https://github.com/lilacai/lilac/pull/1052
UI
* Add clustering in the UI by dsmilkov in https://github.com/lilacai/lilac/pull/1045
* Add search to the cluster UI. Add some polish. by nsthorat in https://github.com/lilacai/lilac/pull/1054
* Add clusters to the schema menu. Migrate to a custom carousel component so the page doesn't freeze. by nsthorat in https://github.com/lilacai/lilac/pull/1050
Clustering
* Add `dataset.cluster(input)` where `input` can be any lambda func by dsmilkov in https://github.com/lilacai/lilac/pull/1042
* `dataset.cluster()` flattens any repeated before clustering by dsmilkov in https://github.com/lilacai/lilac/pull/1051
Lilac Garden
* Add remote computation for the jina embedding by dsmilkov in https://github.com/lilacai/lilac/pull/1049
Other Changes
* Move the import of .env.local in publish_pip to the top of the file. by nsthorat in https://github.com/lilacai/lilac/pull/1039
* fix: migrate embeddings by azure openai to openai > 1.0.0 by dechantoine in https://github.com/lilacai/lilac/pull/1053
* Streamline lilac deployment by brilee in https://github.com/lilacai/lilac/pull/1057
* Add a notebook for working with concepts from python. by nsthorat in https://github.com/lilacai/lilac/pull/1055
**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.2.2...v0.2.3