Twitter/X

MiniCPM-V 4.6 (1.3B) launched by @OpenBMB on 2026-05-11 and uses LLaVA-UHD v4 to…

Brief

MiniCPM-V 4.6 (1.3B) is a high-resolution multimodal model from OpenBMB released 2026-05-11 that applies LLaVA-UHD v4 to cut vision encoding costs by 55%, claiming superior benchmark results versus Gemma4-E2B-it and Qwen3.5-0.8B while using just 2.5% of Qwen's token budget; OpenBMB reports 75.7 ms TTFT (2.2× faster on 3136² images) and ~1.5× token throughput on an RTX 4090, with model and demos available on Hugging Face, GitHub and Modelscope.

Why it matters

MiniCPM-V 4.6 (1.3B) launched by @OpenBMB on 2026-05-11 and uses LLaVA-UHD v4 to cut vision encoding costs by 55%, enabling native edge deployment and optimization for consumer-grade and mobile hardware.

Key details

  • OpenBMB claims MiniCPM-V 4.6 outperforms Gemma4-E2B-it and Qwen3.5-0.8B on key multimodal/Artificial Analysis benchmarks — scoring higher than Qwen3.5-0.8B while using just 2.5% of its token budget; reported TTFT = 75.7 ms (2.2× faster on 3136² images) and ~1.5× token throughput vs Qwen3.5-0.8B on a single RTX 4090.
  • Model and demos are publicly available: Hugging Face (openbmb/MiniCPM-V), GitHub (OpenBMB/MiniCPM-V), Modelscope, a Hugging Face web demo, and an app demo (links posted by OpenBMB).
Source evidence

1/5 MiniCPM-V 4.6 (1.3B) is now live 🚀🚀
High-res visual processing, optimized for consumer-grade and mobile hardware. We’ve leveraged the latest LLaVA-UHD v4 technique to cut vision encoding costs by 55%, enabling native edge deployment with extreme efficiency.
🔥 Beats Gemma4-E2B-it and Qwen3.5-0.8B across key multimodal and Artificial Analysis benchmarks — scoring higher than Qwen3.5-0.8B using just 2.5% of its token budget.
⚡ TTFT (75.7ms) 2.2x Faster than Qwen3.5-0.8B even with 3136² high-res images.
🏗️ ~1.5x Token Throughput compared with Qwen3.5-0.8B on a single RTX 4090.
Try the model here:
🤗 Hugging Face:
huggingface.co/openbmb/MiniC…
💻 GitHub:
github.com/OpenBMB/MiniCPM-V
🔭 Modelscope:
modelscope.cn/models/OpenBMB…
🌐 Web Demo:
huggingface.co/spaces/openbm…
📱 App Demo:
github.com/OpenBMB/MiniCPM-V…

Video