6/28/2024
•
EN
Evaluating Open LLMs with MixEval: The Closest Benchmark to LMSYS Chatbot Arena
Introduces MixEval, a cost-effective LLM benchmark with high correlation to Chatbot Arena, for evaluating open-source language models.