DeepEval

AI Developer Tools & Infra

The LLM evaluation framework. Pytest-like unit tests for LLMs with hallucination detection, RAG evaluation, and 50+ built-in metrics.

15,669

GitHub Stars

1h ago

Last Commit

271

Open Issues

Python

Language

Browse categories

LLMs & Foundation Models

AI Agents & Autonomous Workflows

RAG & Vector Search

Image & Video Generation

Voice & Audio AI

Code Assistants & Copilots

AI Developer Tools & Infra