Phantom-wiki

Latest version: v0.5.2

Safety actively analyzes 723217 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.5.2

What's Changed
* Package for release. `phantom_wiki.generate_dataset(**kwargs)` 258 and CLI commands `phantom-wiki-generate --flags` 261. Make package version dynamic 265 266
* Many flags changed in 258
* Demo notebook 252

Evaluation Pipeline Changes (independent of dataset release versions)
* Functionality to load data from local folder https://github.com/kilian-group/phantom-wiki/pull/252 and https://github.com/kilian-group/phantom-wiki/pull/246
* Async agent implementation https://github.com/kilian-group/phantom-wiki/pull/156
* Refactor agent.py to separate files https://github.com/kilian-group/phantom-wiki/pull/245
* Work on prompting with Prolog 242 249

**Full Changelog**: https://github.com/kilian-group/phantom-wiki/compare/v0.5.1...v0.5.2

0.5.1

Faster dataset generation and solution traces. No changes to contents of generated datasets.

What's Changed
* New flag `--use-multithreading` for faster phantom-wiki dataset generation https://github.com/kilian-group/phantom-wiki/pull/209. Flag works on linux and windows. Allows concurrent prolog database queries from multiple processes.
* Save solution traces to generated dataset json files https://github.com/kilian-group/phantom-wiki/pull/194.
* Save prolog facts when generating datasets https://github.com/kilian-group/phantom-wiki/pull/241
* Updates to README, python package dependencies https://github.com/kilian-group/phantom-wiki/pull/199, github workflow for pypi publish https://github.com/kilian-group/phantom-wiki/pull/227, huggingface model card https://github.com/kilian-group/phantom-wiki/pull/190, pre-commit config https://github.com/kilian-group/phantom-wiki/pull/208, license https://github.com/kilian-group/phantom-wiki/pull/212
* Cleanup https://github.com/kilian-group/phantom-wiki/pull/228

Evaluation Pipeline Changes (independent of dataset release versions)
* Improve tables and figures for paper, https://github.com/kilian-group/phantom-wiki/pull/186 .
* Add CoT https://github.com/kilian-group/phantom-wiki/pull/214, Nshot-RAG https://github.com/kilian-group/phantom-wiki/pull/181, CoT-RAG, React https://github.com/kilian-group/phantom-wiki/pull/239 support for deepseek-r1.
* Add RAG support for API-based models https://github.com/kilian-group/phantom-wiki/pull/187.
* Refactor files in `src/phantom_eval/` https://github.com/kilian-group/phantom-wiki/pull/201 https://github.com/kilian-group/phantom-wiki/pull/196.
* Refactor bash scripts in `eval/` https://github.com/kilian-group/phantom-wiki/pull/222.
* Testing how LLMs generate prolog queries to answer phantom-wiki questions https://github.com/kilian-group/phantom-wiki/pull/189 https://github.com/kilian-group/phantom-wiki/pull/235.

**Full Changelog**: https://github.com/kilian-group/phantom-wiki/compare/v0.5...v0.5.1

0.5

What's Changed
* Sample questions with backward chaining, much faster by anmolkabra in https://github.com/albertgong1/phantom-wiki/pull/174

Evaluation Pipeline Changes (independent of dataset release versions)
* Update evals to use phantom-wiki-v0.5 dataset by anmolkabra in https://github.com/albertgong1/phantom-wiki/pull/175
* PR for Rag branch by xforeverjlx in https://github.com/albertgong1/phantom-wiki/pull/171
* Merging reasoning agent R1 distill qwen 32b into main by xforeverjlx in https://github.com/albertgong1/phantom-wiki/pull/176
* Add --inf_usage_tier and update tier limits of GPT4o, Gemini by anmolkabra in https://github.com/albertgong1/phantom-wiki/pull/177


**Full Changelog**: https://github.com/albertgong1/phantom-wiki/compare/v0.4...v0.5

0.4

- 166 re-added aggregation questions to question generation pipeline. Bumped dataset version from 0.3 to 0.4 since LLM prompts will need to change.
- Changes in LLM evaluation are not tracked by releases.

Links

Releases

Has known vulnerabilities

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.