VideoGLaMM

by mbzuai-oryx · Agent Tool · ★ 97

About VideoGLaMM

[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

cvpr2025foundation-modelsllm-agentlmmvision-and-languagevision-language-model

Quick Facts

Stars97
Forks5
LanguagePython
CategoryAgent Tool
Quality Score35.25/100
Open Issues8
Last Updated2025-04-14
Created2024-10-31
Platformspython
Est. Tokens~2718k

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is VideoGLaMM?

VideoGLaMM is [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos. It is categorized as a Agent Tool with 97 GitHub stars.

What programming language is VideoGLaMM written in?

VideoGLaMM is primarily written in Python. It covers topics such as cvpr2025, foundation-models, llm-agent.

How do I install or use VideoGLaMM?

You can find installation instructions and usage details in the VideoGLaMM GitHub repository at github.com/mbzuai-oryx/VideoGLaMM. The project has 97 stars and 5 forks, indicating an active community.

View on GitHub → Browse Agent Tool tools