Latest version: v0.0.1
The information on this page was curated by experts in our Cybersecurity Intelligence Team.
Lean, modular reward functions for RL training with LLMs
No known vulnerabilities found
Has known vulnerabilities