Docs
Github
Blog

$ the open-source LLM evaluation framework

Get Started
Delivered by
Confident AI
Regression Testing for LLMs

LLM evaluation metrics to unit test LLM outputs in Python

Hyperparameter Discovery

Gain insights to quickly iterate towards optimal hyperparameters

Integrate with Popular Frameworks

Evaluate existing LLM applications built with other frameworks

Documentation
  • Introduction
Community
  • GitHub
  • Discord
  • Newsletter
Copyright © 2024 Confident AI Inc. Built with ❤️ and confidence.