Revision as of 13:54, 27 March 2025

@@ Line 17: / Line 17: @@
 * [https://lmsys.org/ LMSYS]: Human preference ranking leaderboard
 * [https://trackingai.org/home Tracking AI]: "IQ" leaderboard
-* [https://www.vectara.com/ Vectara] [https://github.com/vectara/hallucination-leaderboard Hallucination Leaderboard]
 * [https://livebench.ai/#/ LiveBench: A Challenging, Contamination-Free LLM Benchmark]
+* [https://github.com/lechmazur/generalization/ LLM Thematic Generalization Benchmark]
 ==Hallucination==
+* [https://www.vectara.com/ Vectara] [https://github.com/vectara/hallucination-leaderboard Hallucination Leaderboard]
 * [https://github.com/lechmazur/confabulations/ LLM Confabulation (Hallucination) Leaderboard for RAG]

Difference between revisions of "AI benchmarks"