chinese-llm-benchmark

by jeinlee1991 · Agent Tool · ★ 6.0k

About chinese-llm-benchmark

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。

agentic-aiartificial-intelligencellm-agentllm-evaluation

Quick Facts

Stars6,037
Forks244
CategoryAgent Tool
Quality Score31.7/100
Open Issues13
Last Updated2026-05-21
Created2023-06-04
Platformsclaude-code, gemini
Est. Tokens~2147484k

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Frequently Asked Questions

What is chinese-llm-benchmark?

chinese-llm-benchmark is ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax. It is categorized as a Agent Tool with 6.0k GitHub stars.

How do I install or use chinese-llm-benchmark?

You can find installation instructions and usage details in the chinese-llm-benchmark GitHub repository at github.com/jeinlee1991/chinese-llm-benchmark. The project has 6.0k stars and 244 forks, indicating an active community.

View on GitHub → Browse Agent Tool tools