https://embeddedvisionsummit.com/vlm-training/
Nov 14, 2025 - An intensive training session designed to introduce the latest techniques in vision-language models (VLMs) plus their integration with traditional computer...
vision language modeltrainingembeddedsummit
https://openreview.net/forum?id=Cxj4jZ7mRI&referrer=%5Bthe%20profile%20of%20Zhiyu%20Huang%5D(%2Fprofile%3Fid%3D~Zhiyu_Huang2)
Recent advancements in Vision-Language-Action (VLA) models have shown promise for end-to-end autonomous driving by leveraging world knowledge and reasoning...
a visionlanguageactionmodelend
https://www.securityinfowatch.com/ai/product/55332552/ambientai-ambientai-launches-pulsar-a-new-vision-language-model-for-physical-security
Ambient.ai has introduced Pulsar, a new vision-language model that brings agentic monitoring, investigation, and real-time decision support to enterprise...
vision language modelambientailaunchespulsar
https://hunyuanocr.org/privacy
Hunyuan OCR is a leading end-to-end OCR expert VLM powered by Hunyuan's native multimodal architecture. With 1B parameters, it achieves state-of-the-art...
vision language modelhunyuanocrendexpert
https://openreview.net/forum?id=bDkisS75zy&referrer=%5Bthe%20profile%20of%20Xingjian%20He%5D(%2Fprofile%3Fid%3D~Xingjian_He1)
Due to the limited scale and quality of video-text training corpus, most vision-language foundation models employ image-text datasets for pretraining and...
foundation modelcosaconcatenatedsamplevision
https://openreview.net/forum?id=uq5XvfLLGG&referrer=%5Bthe%20profile%20of%20Yong-Ju%20Lee%5D(%2Fprofile%3Fid%3D~Yong-Ju_Lee2)
Despite emerging efforts to enhance the safety of Vision-Language Models (VLMs), current approaches face two main shortcomings. 1) Existing safety-tuning...
vision language modelholisticsafetybenchmarkingmodeling
https://aclanthology.org/2024.findings-acl.140/
Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, Zhongyu Wei. Findings of the Association for Computational...
vision language modelsocial medialargeprocessing