by philschmid · Agent Tool · ★ 107
Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.
| Stars | 107 |
| Forks | 9 |
| Category | Agent Tool |
| Quality Score | 32.2/100 |
| Open Issues | 2 |
| Last Updated | 2025-10-15 |
| Created | 2025-10-15 |
| Est. Tokens | ~3k |
Explore other popular agent tool tools:
ai-agent-benchmark-compendium is Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.. It is categorized as a Agent Tool with 107 GitHub stars.
You can find installation instructions and usage details in the ai-agent-benchmark-compendium GitHub repository at github.com/philschmid/ai-agent-benchmark-compendium. The project has 107 stars and 9 forks, indicating an active community.