Sponsor of the Day:
Jerkmate
https://simonwillison.net/2024/Dec/24/qvq/
Trying out QvQ—Qwen’s new visual reasoning model
I thought we were done for major model releases in 2024, but apparently not: Alibaba’s Qwen team just dropped the Apache 2.0 licensed Qwen licensed (the...
new visualreasoning modeltrying
https://www.tvbeurope.com/live-production/ptzoptics-aims-to-make-video-more-actionable-with-visual-reasoning-initiative
PTZOptics aims to make video more 'actionable' with Visual Reasoning initiative - TVBEurope
Feb 20, 2026 - The initiative combines PTZOptics’ robotic camera systems with Moondream’s lightweight vision models to create video workflows that can interpret what the...
make videovisual reasoningptzopticsaimsactionable
https://www.videomaker.com/how-to/shooting/camera-movement/what-is-visual-reasoning-ai-and-how-is-it-reinventing-live-broadcast/
What is Visual Reasoning AI and how is it reinventing live broadcast? - Videomaker
Mar 12, 2026 - Visual Reasoning AI is transforming video broadcasting workflows, but what is it exactly, and how does it work?
visual reasoninglive broadcastaireinventingvideomaker
https://www.infoworld.com/article/4123202/gemini-flash-model-gets-visual-reasoning-capability.html
Gemini Flash model gets visual reasoning capability | InfoWorld
Jan 27, 2026 - Agentic Vision combines visual reasoning with code execution to ground answers in visual evidence, delivering a 5% to 10% quality boost across most vision...
gemini flashmodel getsvisual reasoningcapabilityinfoworld
https://deepmind.google/research/publications/124002/
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning —...
multimodal foundation modelsauthoringpersonalizedvisualsensors
https://visualcommonsense.com/
VCR: Visual Commonsense Reasoning
commonsense reasoningvcrvisual
https://zzo.ai/models/nano-banana-pro
Nano Banana Pro AI Image Generator - Reasoning-Powered Visual Creation | zzo.ai
Experience Nano Banana Pro, the next-generation AI image engine powered by Gemini 3 reasoning. Perfect text rendering, character consistency, and logical...
nano banana proai image generatorpowered visualreasoningcreation
https://prior.allenai.org/projects/close
Perceptual Reasoning and Interaction Research - I Can't Believe There's No Images! Learning Visual...
Perceptual Reasoning and Interaction Research (PRIOR) is a computer vision research team within the Allen Institute for AI. PRIOR seeks to advance computer...
interaction researchperceptualreasoningbelieveimages