Question 1

What is tool-eval-bench?

Accepted Answer

tool-eval-bench is Tool-calling quality benchmark for LLM serving stacks. 65+ deterministic scenarios testing multi-turn orchestration, safety boundaries, and structured output. Supports vLLM, LiteLLM, and llama.cpp.. It is categorized as a Agent Tool with 67 GitHub stars.

Question 2

What programming language is tool-eval-bench written in?

Accepted Answer

tool-eval-bench is primarily written in Python.

Question 3

How do I install or use tool-eval-bench?

Accepted Answer

You can find installation instructions and usage details in the tool-eval-bench GitHub repository at github.com/SeraphimSerapis/tool-eval-bench. The project has 67 stars and 7 forks, indicating an active community.

Question 4

What license does tool-eval-bench use?

Accepted Answer

tool-eval-bench is released under the MIT license, making it free to use and modify according to the license terms.

Stars	67
Forks	7
Language	Python
Category	Agent Tool
License	MIT
Quality Score	36.2/100
Last Updated	2026-05-20
Created	2026-04-17
Platforms	python
Est. Tokens	~99k

tool-eval-bench

About tool-eval-bench

Quick Facts

Compatible Skills

More Agent Tool Tools

Popular Python Agent Tools

Frequently Asked Questions