agent-eval-ts

About agent-eval-ts

agent-eval-ts Evaluation framework for TypeScript AI agents — define suites, run batch evaluations, and report accuracy, latency, cost, and more. What is this? helps you measure and compare AI agent behavior: exact output checks, semantic similarity (bag-of-words cosine by default, or your own embeddings), JSON Schema validation, tool-call sequences, latency, token usage, and cost logging. It runs locally, produces JSON / Markdown / HTML / JUnit reports, supports optional LLM-as-judge (OpenAI-compatible), caching, multi-model comparison, and regression detection against a saved baseline.

ai-agents benchmarking evaluation llm testing typescript

Quick Facts

Stars	29
Forks	192
Language	TypeScript
Category	Agent Tool
License	MIT
Quality Score	63.9/100
Last Updated	2026-05-21
Created	2026-04-10
Platforms	docker, node
Est. Tokens	~7k

Compatible Skills

These tools work well together with agent-eval-ts for enhanced workflows:

promptfoo — semantic(0.23)+complementary+shared_fw(openai)+rare_topics+same_lang+shared_platform (61%)
skill-optimizer — semantic(0.19)+complementary+rare_topics+same_lang+similar_pop+shared_platform (56%)

More Agent Tool Tools

Explore other popular agent tool tools:

superpowers ⭐ 202.7k
AutoGPT ⭐ 184.5k
ollama ⭐ 172.0k
langflow ⭐ 148.7k
langchain ⭐ 137.4k
gstack ⭐ 100.8k
skills ⭐ 95.6k
browser-use ⭐ 95.0k
autoresearch ⭐ 82.7k
deer-flow ⭐ 69.1k

View all Agent Tool tools →

Popular TypeScript Agent Tools

openclaw ⭐ 374.0k · Codex Skill
n8n ⭐ 189.3k · MCP Server
dify ⭐ 142.3k · MCP Server
gemini-cli ⭐ 104.5k · MCP Server
gstack ⭐ 100.8k · Agent Tool

Frequently Asked Questions

What is agent-eval-ts?

agent-eval-ts is Agent evaluation & benchmarking for TypeScript: test suites, LLM metrics, caching, OpenAI-compatible judge, JUnit/HTML/MD reports, Docker, GitHub Actions.. It is categorized as a Agent Tool with 29 GitHub stars.

What programming language is agent-eval-ts written in?

agent-eval-ts is primarily written in TypeScript. It covers topics such as ai-agents, benchmarking, evaluation.

How do I install or use agent-eval-ts?

You can find installation instructions and usage details in the agent-eval-ts GitHub repository at github.com/Agent-Mem-Tools/agent-eval-ts. The project has 29 stars and 192 forks, indicating an active community.

What license does agent-eval-ts use?

agent-eval-ts is released under the MIT license, making it free to use and modify according to the license terms.

View on GitHub → Browse Agent Tool tools