Compare LLM performance on multiple-choice questions using Hugging Face models.
Format: Each line should have: Question,Correct Answer,Choice1,Choice2,Choice3
Question,Correct Answer,Choice1,Choice2,Choice3
๐ก Features:
Enter the delimiter used in your dataset:
Format Requirements:
โ ๏ธ Note:
Results will appear here...
Detailed results will appear here...
This tool loads and runs HuggingFace models for evaluation:
๐๏ธ How it works:
โก Performance Tips:
๐ง Supported Models: