Robuta

https://arxiv.org/abs/2509.22186 [2509.22186] MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document... Abstract page for arXiv paper 2509.22186: MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing vision language model