Latest version: v0.8.0
The information on this page was curated by experts in our Cybersecurity Intelligence Team.
Automatically caption images using various LLaVA multimodal models. This tool processes images with state-of-the-art vision language models to generate accurate, high-quality captions.
No known vulnerabilities found