by THUDM · Agent Tool · ★ 256
Towards Large Multimodal Models as Visual Foundation Agents
| Stars | 256 |
| Forks | 9 |
| Language | Python |
| Category | Agent Tool |
| License | Apache-2.0 |
| Quality Score | 39.2/100 |
| Open Issues | 16 |
| Last Updated | 2025-04-24 |
| Created | 2024-08-08 |
| Platforms | python |
| Est. Tokens | ~378k |
These tools work well together with VisualAgentBench for enhanced workflows:
Explore other popular agent tool tools:
VisualAgentBench is Towards Large Multimodal Models as Visual Foundation Agents. It is categorized as a Agent Tool with 256 GitHub stars.
VisualAgentBench is primarily written in Python. It covers topics such as gpt, llm-agent, multimodal-large-language-models.
You can find installation instructions and usage details in the VisualAgentBench GitHub repository at github.com/THUDM/VisualAgentBench. The project has 256 stars and 9 forks, indicating an active community.
VisualAgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.