Difference between revisions of "AI benchmarks"

From GISAXS
Jump to: navigation, search
(Various)
(Assess Specific Attributes)
 
Line 25: Line 25:
 
* [https://www.galileo.ai/blog/agent-leaderboard Galileo AI] [https://huggingface.co/spaces/galileo-ai/agent-leaderboard Agent Leaderboard]
 
* [https://www.galileo.ai/blog/agent-leaderboard Galileo AI] [https://huggingface.co/spaces/galileo-ai/agent-leaderboard Agent Leaderboard]
 
* [https://huggingface.co/spaces/smolagents/smolagents-leaderboard Smolagents LLM Leaderboard]: LLMs powering agents
 
* [https://huggingface.co/spaces/smolagents/smolagents-leaderboard Smolagents LLM Leaderboard]: LLMs powering agents
 +
 +
==Science==
 +
See: [[Science_Agents#Science_Benchmarks|Science Benchmarks]]

Latest revision as of 12:39, 13 March 2025