3/31/2024
•
EN
Task-Specific LLM Evals that Do & Don't Work
A guide to effective and ineffective evaluation methods for LLMs on tasks like classification, summarization, and translation, including practical metrics.