Running on CPU Upgrade 239 MMLU-Pro Leaderboard 🥇 239 More advanced and challenging multi-task evaluation