Lilac

Latest version: v0.3.9

Safety actively analyzes 682387 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 9

0.1.6

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.6 -->

Features
* Make labels toggle-able by dsmilkov in https://github.com/lilacai/lilac/pull/748
* Add HDBScan with UMAP by nsthorat in https://github.com/lilacai/lilac/pull/749
* Add lilac docker image and use a lighter 5x image for HF by dsmilkov in https://github.com/lilacai/lilac/pull/750

Bug fixes / Other
* Fix duplicate stats requests by dsmilkov in https://github.com/lilacai/lilac/pull/754
* Add CLI prompts for token in deploy script by nsthorat in https://github.com/lilacai/lilac/pull/755
* Add docker deploy instructions in `dev.md` and multi-platform build by dsmilkov in https://github.com/lilacai/lilac/pull/753

**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.1.15...v0.1.6

0.1.4

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.4 -->

What's Changed
Other Changes
* Add --hf_token and dont require it when we have public datasets, or we set load on start server. by nsthorat in https://github.com/lilacai/lilac/pull/727


**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.1.3...v0.1.4

0.1.3

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.3 -->

You can now deploy Lilac to a HuggingFace dataset with just a few lines of Python, or from the CLI:

Deploy a single configuration object, and have it load entirely on the space:
python
ll.deploy_config(
hf_space='nsthorat-lilac/nikhil-demo',
Create the space if it doesn't exist.
create_space=True,
config=ll.Config(datasets=[
ll.DatasetConfig(
namespace='local',
name='glue_ax',
source=ll.HuggingFaceSource(dataset_name='glue', config_name='ax'))
]))


Deploy a Lilac project you've loaded locally:
python
ll.deploy_project(
hf_space='nsthorat-lilac/nikhil-project-demo',
project_dir='./data',
datasets=['local/glue_ax'], This is optional. If not defined, uploads all datasets.
Create the space if it doesn't exist.
create_space=True)


Or via the CLI:

`lilac deploy-project --project_dir='~/my_project'`

Features
* Completely overhaul the deploy scripts. by nsthorat in https://github.com/lilacai/lilac/pull/725

Other Changes
* Improve the schema view to be less overwhelming. by nsthorat in https://github.com/lilacai/lilac/pull/726

Demo
* Add the textbook quality programming dataset to the demo. Clean up old datasets. by nsthorat in https://github.com/lilacai/lilac/pull/723


**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.1.2...v0.1.3

0.1.2

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.2 -->

Internal change
* Migrate the span values to use `__span__` and make the code back-compat by dsmilkov in https://github.com/lilacai/lilac/pull/718


**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.1.1...v0.1.2

0.1.1

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.1 -->

Overview
- Embedding computation can now be larger-than-RAM! Computing lots of embeddings will iteratively write to a vector store.
- JSON and CSV sources are heavily optimized and go through duckdb for parsing.
- Clustering now supports semantic clustering with embeddings, using DBScan.

New features
* Add SQLite source and optimize the JSON and CSV sources by dsmilkov in https://github.com/lilacai/lilac/pull/710
* Add a dict source and convert `LangSmith` source to use it by dsmilkov in https://github.com/lilacai/lilac/pull/716
* Add clustering signal by dsmilkov in https://github.com/lilacai/lilac/pull/711

Performance
* Use iterables for compute_signal and compute_embedding. by nsthorat in https://github.com/lilacai/lilac/pull/706
* Write embeddings to the vector store iteratively by nsthorat in https://github.com/lilacai/lilac/pull/709
* Add SQLite source and optimize the JSON and CSV sources by dsmilkov in https://github.com/lilacai/lilac/pull/710
* Speed up the docker image build step by installing lilac from pip before installing the local wheel. by nsthorat in https://github.com/lilacai/lilac/pull/714
* Improve perf of server by removing UUID sort by dsmilkov in https://github.com/lilacai/lilac/pull/715

Bug fixes
* Fix semantic search on repeated by dsmilkov in https://github.com/lilacai/lilac/pull/704
* Fix syntax error with keyword search by dsmilkov in https://github.com/lilacai/lilac/pull/705
* Fix bug with span highlighting a repeated field by nsthorat in https://github.com/lilacai/lilac/pull/713
* Change the bootup load to be during the new FastAPI lifecycle API. by nsthorat in https://github.com/lilacai/lilac/pull/717


**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.1.0...v0.1.1

0.1.0

Not secure
<!-- Release notes generated using configuration in .github/release.yml at v0.1.0 -->
New Features

Lilac now supports labeling! For a detailed guide, see [Labeling a dataset](https://lilacml.com/datasets/dataset_labels.html)

Labels can be added for individual rows:

python
dataset.add_labels(
'good',
row_ids=['0003076800f1471f8f4c8a1b2deda742'])


Or for slices of the data:

python
dataset.add_labels(
'short',
filters=[
(('text', 'text_statistics', 'num_characters'), 'less', 1000)
]
)

They can then be exported:

python
short_rows = list(
dataset.select_rows(
['*', 'short'],
filters=[
(('short', 'label'), 'exists')
]
)
)
Print the first row.
print(short_rows[0])


Output:


{
'__rowid__': '0003076800f1471f8f4c8a1b2deda742',
'text': 'If you want to truly experience the magic (?) of Don Dohler, then check out "Alien Factor" or maybe "Fiend", but not this. Alien Factor is actually rather imaginative considering the low budget and it\'s fairly creepy, but "Nightbeast", which I guess is sort of an updating of Alien Factor, is just plain dumb. Actors sleepwalk through their roles, especially Mr. Monotone sheriff, and the monster is some dumb Halloween-mask kind of thing instead of the wildly imaginative (but kind of stupid) looking critters from Alien Factor. A spaceship crashes on Earth and there\'s a critter inside, of course, who runs around vaporizing people. And ripping off arms, etc. And he has a cool ray gun that he uses to vaporize people too, until it gets shot out of his hand. And that\'s really about it. "Alien Factor" beats this mess hands down, if you really want to see a good Don Dohler movie, check that out instead. And RIP Don Dohler, 12/2/06.',
'label': 'neg',
'__hfsplit__': 'test',
'good': {
'label': 'true',
'created': datetime.datetime(2023, 9, 20, 10, 16, 15, 545277)
}
}


Labels can also be added via the UI:

![image](https://github.com/lilacai/lilac/assets/1100749/ddfc4846-5d52-4c14-a24a-25133d448ed1)

What's changed
* Make '.' the default project. by nsthorat in https://github.com/lilacai/lilac/pull/701

Bug fixes
* Allow `add_labels` and `remove_labels` without selection by dsmilkov in https://github.com/lilacai/lilac/pull/698
* Fix UI regression and empty `lilac.yml` (no datasets) by dsmilkov in https://github.com/lilacai/lilac/pull/700


**Full Changelog**: https://github.com/lilacai/lilac/compare/v0.0.20...v0.1.0

Page 7 of 9

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.