We’re excited to release LFM2-VL-3B, the newest and most capable addition to our family of vision LFMs (450M and 1.6B). Built on the LFM2-2.6B backbone, this 3B parameter model targets applications that require more accuracy while maintaining the speed advantage of the LFM2 architecture. It is available today on LEAP and Hugging Face.
Flexible Architecture
.png)
LFM2-VL-3B follows the recipe adopted for our previous VLMs. It builds on our most powerful dense model, LFM2-2.6B, and integrates a SigLIP2 400M NaFlex encoder. This enables image processing at native resolutions with variable aspect ratios. Its flexible architecture allows developers to balance performance and speed by adjusting the number of vision tokens per image. This offers finer control for deployment, especially in edge environments.
You can find more information about the architecture in our LFM2-VL blog post.
Broader Capabilities

LFM2-VL-3B delivers competitive results across open-source evaluations, achieving an impressive 51.8% on MM-IFEval and 71.4% on RealWorldQA. The model shows strong performance in single- and multi-image comprehension and English OCR, with low hallucination rates on the POPE benchmark.
It maintains comparable language-only knowledge benchmark scores to its backbone, LFM2-2.6B, with 30% on GPQA and 63% on MMLU. In addition, we have significantly expanded multilingual capabilities, extending visual understanding beyond English to include Japanese, French, Spanish, German, Italian, Portuguese, Arabic, Chinese, and Korean.
Open and Available
LFM2-VL-3B is now available on Hugging Face under our LFM Open License, and through our LEAP platform, making cutting-edge efficient AI accessible to developers and researchers worldwide.
The LFM2 series continues to push the boundaries of efficient AI. We're proving that with the right architecture and approach, smaller models can deliver enterprise-grade performance without the computational overhead. In the future, we will continue to scale our foundation models to bring this level of efficiency to more devices and unlock new applications.