by jjang-ai · MCP Server · ★ 521
vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!
| Stars | 521 |
| Forks | 64 |
| Language | Python |
| Category | MCP Server |
| License | Apache-2.0 |
| Quality Score | 34.75/100 |
| Open Issues | 23 |
| Last Updated | 2026-05-22 |
| Created | 2026-02-18 |
| Platforms | mcp, python |
| Est. Tokens | ~3962k |
These tools work well together with vmlx for enhanced workflows:
Explore other popular mcp server tools:
vmlx is vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!. It is categorized as a MCP Server with 521 GitHub stars.
vmlx is primarily written in Python. It covers topics such as anthropic-api, kvcache-compression, kvcache-optimization.
You can find installation instructions and usage details in the vmlx GitHub repository at github.com/jjang-ai/vmlx. The project has 521 stars and 64 forks, indicating an active community.
vmlx is released under the Apache-2.0 license, making it free to use and modify according to the license terms.