AgentBench

by THUDM · Agent Tool · ★ 3.2k

About AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

chatgptgpt-4llmllm-agent

Quick Facts

Stars3,214
Forks240
LanguagePython
CategoryAgent Tool
LicenseApache-2.0
Quality Score40.7/100
Open Issues68
Last Updated2026-02-08
Created2023-07-28
Platformspython
Est. Tokens~1970k

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is AgentBench?

AgentBench is A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24). It is categorized as a Agent Tool with 3.2k GitHub stars.

What programming language is AgentBench written in?

AgentBench is primarily written in Python. It covers topics such as chatgpt, gpt-4, llm.

How do I install or use AgentBench?

You can find installation instructions and usage details in the AgentBench GitHub repository at github.com/THUDM/AgentBench. The project has 3.2k stars and 240 forks, indicating an active community.

What license does AgentBench use?

AgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

View on GitHub → Browse Agent Tool tools