ai-agent-benchmark-compendium

by philschmid · Agent Tool · ★ 107

About ai-agent-benchmark-compendium

Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.

Quick Facts

Stars107
Forks9
CategoryAgent Tool
Quality Score32.2/100
Open Issues2
Last Updated2025-10-15
Created2025-10-15
Est. Tokens~3k

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Frequently Asked Questions

What is ai-agent-benchmark-compendium?

ai-agent-benchmark-compendium is Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.. It is categorized as a Agent Tool with 107 GitHub stars.

How do I install or use ai-agent-benchmark-compendium?

You can find installation instructions and usage details in the ai-agent-benchmark-compendium GitHub repository at github.com/philschmid/ai-agent-benchmark-compendium. The project has 107 stars and 9 forks, indicating an active community.

View on GitHub → Browse Agent Tool tools