by jordanrendric · MCP Server · ★ 640
Claude Code Video Vision Give Claude the ability to watch and understand videos. A Claude Code plugin that extracts frames via ffmpeg and processes audio via multiple backends (Gemini API, local Whisper, or OpenAI API). Claude receives frames as images and audio transcription with timestamps — the plugin is a perception layer, not an interpretation layer.
| Stars | 640 |
| Forks | 77 |
| Language | TypeScript |
| Category | MCP Server |
| License | MIT |
| Quality Score | 55.92/100 |
| Open Issues | 10 |
| Last Updated | 2026-05-18 |
| Created | 2026-03-31 |
| Platforms | claude-code, gemini, mcp, node |
| Est. Tokens | ~1048k |
These tools work well together with claude-video-vision for enhanced workflows:
Explore other popular mcp server tools:
claude-video-vision is Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis. It is categorized as a MCP Server with 640 GitHub stars.
claude-video-vision is primarily written in TypeScript. It covers topics such as claude-code, claude-code-plugin, ffmpeg.
You can find installation instructions and usage details in the claude-video-vision GitHub repository at github.com/jordanrendric/claude-video-vision. The project has 640 stars and 77 forks, indicating an active community.
claude-video-vision is released under the MIT license, making it free to use and modify according to the license terms.