Ai testing Articles

Page 1 of 1 (4 articles)

12/13/2025 • EN

What are AI Evals?

Explains AI evals: automated checks for non-deterministic AI outputs using LLMs to score against expectations, not exact matches.

AI Evals ai testing LLM Evaluation Non Deterministic Systems software testing

Nick Taylor

10/6/2025 • EN

Seriously Testing LLMs

Explores the unique challenges of testing Generative AI and Large Language Models, contrasting it with traditional software testing approaches.

ai testing generative ai LLM Testing Regression Testing software testing

James Bach

10/27/2024 • EN

AlignEval: Building an App to Make Evals Easy, Fun, and Automated

Introduces AlignEval, an app for building and automating LLM evaluators, making the process easier and more data-driven.

ai testing Automated Evals Langsmith LLM Evaluation LLM Evaluator

Eugene Yan

5/27/2024 • EN

WAIT #3 Call For Participation

Call for participation in WAIT #3, a peer conference on AI in software testing, seeking experienced testers to share and evaluate real-world AI testing experiences.

ai testing Experience Report Peer Conference software testing test automation