deployed-models Models deployed on HuggingFace or RunPods. PatronusAI/glider Text Generation • 4B • Updated Jan 2, 2025 • 166 • 41
glider-eval-suite PatronusAI/glider-hh-alignment-suite Viewer • Updated Dec 6, 2024 • 178 • 3 PatronusAI/glider-mt-bench-suite Viewer • Updated Dec 6, 2024 • 2.58k • 4 PatronusAI/glider_summeval_suite Viewer • Updated Dec 4, 2024 • 6.4k • 9 PatronusAI/glider-flask-eval-suite Viewer • Updated Dec 18, 2024 • 2k • 10 • 1
hallucination-detection-results-openai PatronusAI/openai-gpt-4-turbo-covidqa-generations Viewer • Updated Jul 9, 2024 • 1k • 4 PatronusAI/openai-gpt-4o-covidqa-generations Viewer • Updated Jul 9, 2024 • 1k PatronusAI/openai-gpt-3.5-turbo-drop-generations Viewer • Updated Jul 9, 2024 • 1k • 2 PatronusAI/openai-gpt-4-turbo-drop-generations Viewer • Updated Jul 9, 2024 • 1k
BLUR A benchmark for tip-of-the-tongue search and reasoning. PatronusAI/BLUR Viewer • Updated Mar 26, 2025 • 350 • 3 • 11 Running 3 BLUR Leaderboard 🌍 3 BLUR leaderboard.
hallucination-detection-results-patronusai-lynx-70b-instruct PatronusAI/lynx-70b-instruct-covidqa-generations Viewer • Updated Jul 8, 2024 • 1k • 4 PatronusAI/lynx-70b-instruct-drop-generations Viewer • Updated Jul 8, 2024 • 1k PatronusAI/lynx-70b-instruct-financebench-generations Viewer • Updated Jul 8, 2024 • 1k • 2 PatronusAI/lynx-70b-instruct-halueval-generations Viewer • Updated Jul 8, 2024 • 10k • 2
deployed-models Models deployed on HuggingFace or RunPods. PatronusAI/glider Text Generation • 4B • Updated Jan 2, 2025 • 166 • 41
BLUR A benchmark for tip-of-the-tongue search and reasoning. PatronusAI/BLUR Viewer • Updated Mar 26, 2025 • 350 • 3 • 11 Running 3 BLUR Leaderboard 🌍 3 BLUR leaderboard.
glider-eval-suite PatronusAI/glider-hh-alignment-suite Viewer • Updated Dec 6, 2024 • 178 • 3 PatronusAI/glider-mt-bench-suite Viewer • Updated Dec 6, 2024 • 2.58k • 4 PatronusAI/glider_summeval_suite Viewer • Updated Dec 4, 2024 • 6.4k • 9 PatronusAI/glider-flask-eval-suite Viewer • Updated Dec 18, 2024 • 2k • 10 • 1
hallucination-detection-results-patronusai-lynx-70b-instruct PatronusAI/lynx-70b-instruct-covidqa-generations Viewer • Updated Jul 8, 2024 • 1k • 4 PatronusAI/lynx-70b-instruct-drop-generations Viewer • Updated Jul 8, 2024 • 1k PatronusAI/lynx-70b-instruct-financebench-generations Viewer • Updated Jul 8, 2024 • 1k • 2 PatronusAI/lynx-70b-instruct-halueval-generations Viewer • Updated Jul 8, 2024 • 10k • 2
hallucination-detection-results-openai PatronusAI/openai-gpt-4-turbo-covidqa-generations Viewer • Updated Jul 9, 2024 • 1k • 4 PatronusAI/openai-gpt-4o-covidqa-generations Viewer • Updated Jul 9, 2024 • 1k PatronusAI/openai-gpt-3.5-turbo-drop-generations Viewer • Updated Jul 9, 2024 • 1k • 2 PatronusAI/openai-gpt-4-turbo-drop-generations Viewer • Updated Jul 9, 2024 • 1k