AiFinder All Tools DeepEval

The LLM evaluation framework. Pytest-like unit tests for LLMs with hallucination detection, RAG evaluation, and 50+ built-in metrics.

15,669
GitHub Stars
1h ago
Last Commit
271
Open Issues
Python
Language

Browse categories