Robuta

Sponsor of the Day: Jerkmate
https://ui.adsabs.harvard.edu/abs/2025arXiv251204032K/abstract Jina-VLM: Small Multilingual Vision Language Model - ADS We present Jina-VLM, a 2.4B parameter vision-language model that achieves state-of-the-art multilingual visual question answering among open 2B-scale VLMs. The... jina vlm smallmultilingual vision languagemodelads https://arxiv.org/abs/2512.04032 [2512.04032] Jina-VLM: Small Multilingual Vision Language Model Abstract page for arXiv paper 2512.04032: Jina-VLM: Small Multilingual Vision Language Model jina vlm smallmultilingual vision language2512model https://jina.ai/news/jina-vlm-small-multilingual-vision-language-model/ Jina-VLM: Small Multilingual Vision Language Model Dec 5, 2025 - New 2B vision language model achieves SOTA on multilingual VQA, no catastrophic forgetting on text-only tasks. jina vlm smallmultilingual vision languagemodel https://github.com/rednote-hilab/dots.ocr GitHub - rednote-hilab/dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language... Multilingual Document Layout Parsing in a Single Vision-Language Model - rednote-hilab/dots.ocr multilingual documentsingle visiongithubrednotehilab