We’re thrilled to announce the release of IndoxJudge—our LLM Evaluation App—now available in version 0.0.1 on pip! 🎉
This powerful tool is designed to help you evaluate and compare large language models with ease. IndoxJudge features:
✨ LLM Evaluation: Dive deep into various metrics to assess your models.
🛡️ Safety Evaluator: Ensure your LLMs meet safety standards.
📚 RAG Evaluator: Assess the effectiveness of Retrieval-Augmented Generation models.
⚖️ LLM Comparison: Compare different models side by side.
Get insightful metrics and generate beautiful plots to visualize your results. Perfect for researchers, developers, and AI enthusiasts!