Paper-qa

Latest version: v5.20.0

Safety actively analyzes 724166 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 25

5.8.0

What's Changed
* Update all non-major dependencies by renovate in https://github.com/Future-House/paper-qa/pull/745
* Created `dev` extra for convenience by jamesbraza in https://github.com/Future-House/paper-qa/pull/750
* Update all non-major dependencies by renovate in https://github.com/Future-House/paper-qa/pull/754
* Populated `LICENSE` by jamesbraza in https://github.com/Future-House/paper-qa/pull/756
* Add partitioning func capabilities to allow doc-types-based embedding ranking by mskarlin in https://github.com/Future-House/paper-qa/pull/752
* Exposed seeding of LitQA2 read and shuffling by jamesbraza in https://github.com/Future-House/paper-qa/pull/758


**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.7.0...v5.8.0

5.7.0

What's Changed

* Moved `README` to use `session` over `answer` by jamesbraza in https://github.com/Future-House/paper-qa/pull/741
* Moved `Docs.aadd` to support `str | os.PathLike` by jamesbraza in https://github.com/Future-House/paper-qa/pull/742
* Cleared up 'Adding Documents Manually' docs by jamesbraza in https://github.com/Future-House/paper-qa/pull/740
* Support env states with custom status functions by mskarlin in https://github.com/Future-House/paper-qa/pull/743
* Update astral-sh/setup-uv action to v4 by renovate in https://github.com/Future-House/paper-qa/pull/746
* Moved JSON summary prompt to mention score is an integer by jamesbraza in https://github.com/Future-House/paper-qa/pull/748

**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.6.0...v5.7.0

5.6.1

**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.6.0...v5.6.1

5.6.0

Highlights

This release is mainly a bunch of bug fixes:

- Pulling in breaks in upstream dependencies (e.g. Pydantic 2.10, aviary 0.10.1)
- Makes `GradablePaperQAEnvironment`'s evaluations robust to an empty answer or multiple answers

Due to the introduction of `Complete.NO_ANSWER_PHRASE` in https://github.com/Future-House/paper-qa/pull/726 it was requested we consider this a minor version bump, as it will impact system performance.

What's Changed

* Fixed settings `session` into `EnvironmentState`, and suppressing PyMuPDF derived `DeprecationWarning` by jamesbraza in https://github.com/Future-House/paper-qa/pull/713
* Adding assertion `gather_evidence` doesn't populate `session.answer` by jamesbraza in https://github.com/Future-House/paper-qa/pull/716
* Lock file maintenance by renovate in https://github.com/Future-House/paper-qa/pull/715
* Fixes `gather_with_concurrency` typing by maykcaldas in https://github.com/Future-House/paper-qa/pull/714
* Latest tooling dependencies by jamesbraza in https://github.com/Future-House/paper-qa/pull/719
* Lock file maintenance by renovate in https://github.com/Future-House/paper-qa/pull/718
* Fixed `EVAL_PROMPT_TEMPLATE` to handle empty string or multiple match answers by jamesbraza in https://github.com/Future-House/paper-qa/pull/724
* Address missing `GenerateAnswer` in trajectories, no answers after `Complete` tools, and better history by mskarlin in https://github.com/Future-House/paper-qa/pull/726
* Pulling in latest `aviary` for `concurrency` rename by jamesbraza in https://github.com/Future-House/paper-qa/pull/728
* Pulling in latest `aviary` for dependencies fix, and retrying flaky `test_propagate_options` more by jamesbraza in https://github.com/Future-House/paper-qa/pull/729
* Pulling in latest `ldp` for `Callback.before_rollout` by jamesbraza in https://github.com/Future-House/paper-qa/pull/734
* Documenting why we don't handle evaluation failures in `GradablePaperQAEnvironment.step` by jamesbraza in https://github.com/Future-House/paper-qa/pull/738
* Created `LitQAEvaluation.calculate_accuracy_precision` utility by jamesbraza in https://github.com/Future-House/paper-qa/pull/733
* Refreshed test cassettes, fixed flaky test `test_search`, and fixed test type ignores by jamesbraza in https://github.com/Future-House/paper-qa/pull/739
* Unpins pydantic >2.10.2 requirement, removes TYPE_CHECKING by nadolskit in https://github.com/Future-House/paper-qa/pull/725
* Lock file maintenance by renovate in https://github.com/Future-House/paper-qa/pull/737
* Alternative maybe is text by loesinghaus in https://github.com/Future-House/paper-qa/pull/717

New Contributors

* maykcaldas made their first contribution in https://github.com/Future-House/paper-qa/pull/714
* loesinghaus made their first contribution in https://github.com/Future-House/paper-qa/pull/717

**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.5.0...v5.6.0

5.5.1

**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.5.0...v5.5.1

5.5.0

Highlights

In all of v5 before this release, we defined the presence of 1+ answer generations not containing the substring `"cannot answer"` as the agent loop's end. However, this (suboptimally) leads to the agent loop terminating early on partial answers like "Based on the sources provided, it appears no one has done x." We realized this, and have resolved this issue by:

- No longer coupling our done condition with the substring `"cannot answer"` being not present in 1+ generated answers
- No longer implicitly depending on clients mentioning this `"cannot answer"` sentinel in the input `qa` prompt

We also fixed several (bad) bugs:

- We support parallel tool calling (2+ `ToolCall`s in one `action: ToolRequestMessage`). However, our tools (notably `gather_evidence`) are not actually concurrent-safe. Our tool schemae instructed not to call certain tools in parallel, nonetheless we observed agents specifying `gather_evidence` to be called in parallel. So now we force our tools to be non-concurrently executed to work around this race condition
- When using `LitQAEvaluation` and the same `GradablePaperQAEnvironment` 2+ times, we repeatedly added the "unsure" option to the target multiple choice question, degrading performance over time
- When using `PaperQAEnvironment` 2+ times, each `reset` was not properly wiping the `Docs` object
- The reward distribution of `LitQAEvaluation` was mixing up "unsure" reward of `0.1` with the "incorrect" reward of `-1.0`, not properly incentivizing learning

There are a bunch of other minor features, cleanups, and bugfixes here too, see the full list below.

What's Changed

* Deprecation cycle for `AgentSettings.should_pre_search` by jamesbraza in https://github.com/Future-House/paper-qa/pull/679
* Moved agent prompts to `prompts.py` by jamesbraza in https://github.com/Future-House/paper-qa/pull/681
* Refactor to remove `skip_system` from `LLMModel.run_prompt` by jamesbraza in https://github.com/Future-House/paper-qa/pull/680
* Resolving `evidence_detailed_citations` and `Answer` deprecations by jamesbraza in https://github.com/Future-House/paper-qa/pull/682
* Fixed agent prompt names and contents after 681 mess up by jamesbraza in https://github.com/Future-House/paper-qa/pull/683
* Removed `tool_names` validation for `gen_answer` being present by jamesbraza in https://github.com/Future-House/paper-qa/pull/685
* Fixing `test_evaluation` logic bugs by jamesbraza in https://github.com/Future-House/paper-qa/pull/686
* Removed `GenerateAnswer.FAILED_TO_ANSWER` as its unnecessary by jamesbraza in https://github.com/Future-House/paper-qa/pull/691
* Allowing serialized `Settings` in `get_settings` by jamesbraza in https://github.com/Future-House/paper-qa/pull/688
* Fixed LDP runner's `TRUNCATED` not calling `gen_answer`, and documented `AgentStatus` by jamesbraza in https://github.com/Future-House/paper-qa/pull/690
* Removed `gen_answer`'s dead argument `question` by jamesbraza in https://github.com/Future-House/paper-qa/pull/689
* Making sure we copy distractors by sidnarayanan in https://github.com/Future-House/paper-qa/pull/694
* Created `complete` tool to allow unsure answers by jamesbraza in https://github.com/Future-House/paper-qa/pull/684
* Added missing `test_from_question` cassette by jamesbraza in https://github.com/Future-House/paper-qa/pull/696
* Moved `fake` agent to LLM propose `complete` tool by jamesbraza in https://github.com/Future-House/paper-qa/pull/695
* Default to ordered tool calls, w env variable control by mskarlin in https://github.com/Future-House/paper-qa/pull/697
* Lock file maintenance by renovate in https://github.com/Future-House/paper-qa/pull/699
* Refactored `TestGradablePaperQAEnvironment` for DRY code by jamesbraza in https://github.com/Future-House/paper-qa/pull/702
* Fixing `PaperQAEnvironment.reset` respecting `mmr_lambda` and `text_hashes` by jamesbraza in https://github.com/Future-House/paper-qa/pull/703
* Removed `"cannot answer"` literals and added `reset` tool by jamesbraza in https://github.com/Future-House/paper-qa/pull/698
* Update all non-major dependencies by renovate in https://github.com/Future-House/paper-qa/pull/705
* Fixing `LitQAEvaluation` bugs: incorrect reward indices, not using LLM's native knowledge by jamesbraza in https://github.com/Future-House/paper-qa/pull/708
* Adding filters to paper-qa Docs by whitead in https://github.com/Future-House/paper-qa/pull/707
* Fixed mutably defaulted `NumpyVectorStore.texts` by jamesbraza in https://github.com/Future-House/paper-qa/pull/711

**Full Changelog**: https://github.com/Future-House/paper-qa/compare/v5.4.0...v5.5.0

Page 4 of 25

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.