Sponsor of the Day:
Jerkmate
https://ollama.com/blog/new-model-scheduling
New model scheduling · Ollama Blog
Ollama now includes a significantly improved model scheduling system, reducing crashes due to out of memory issues, maximizing GPU utilization and performance,...
new modelollama blogscheduling
https://ollama.com/blog/structured-outputs
Structured outputs · Ollama Blog
Ollama now supports structured outputs making it possible to constrain a model's output to a specific format defined by a JSON schema. The Ollama Python and...
structured outputsollama blog
https://ollama.com/blog/llms-in-obsidian
Leveraging LLMs in your Obsidian Notes · Ollama Blog
This post walks through how you could incorporate a local LLM using Ollama in Obsidian, or potentially any note taking tool.
leveraging llmsollama blogobsidiannotes
https://ollama.com/blog/gpt-oss
OpenAI gpt-oss · Ollama Blog
Ollama partners with OpenAI to bring gpt-oss to Ollama and its community.
openai gpt ossollama blog
https://ollama.com/blog/vision-models
Vision models · Ollama Blog
New vision models are now available: LLaVA 1.6, in 7B, 13B and 34B parameter sizes. These models support higher resolution images, improved text recognition...
vision modelsollama blog
https://ollama.com/blog/minimax-m2
MiniMax M2 · Ollama Blog
MiniMax M2 is now available on Ollama's cloud. It's a model built for coding and agentic workflows.
minimax m2ollama blog
https://ollama.com/blog/llama3.2-vision
Llama 3.2 Vision · Ollama Blog
Llama 3.2 Vision 11B and 90B models are now available in Ollama.
llama 3 2ollama blogvision
https://ollama.com/blog/qwen3-vl
Qwen3-VL · Ollama Blog
Ollama now supports Alibaba's Qwen3-VL.
qwen3 vlollama blog
https://ollama.com/blog/web-search-subagents-claude-code
Subagents and web search in Claude Code · Ollama Blog
Ollama now supports subagents and web search in Claude Code.
web searchclaude codeollama blogsubagents
https://ollama.com/blog/image-generation
Image generation (experimental) · Ollama Blog
Generate images locally with Ollama on macOS. Windows and Linux support coming soon.
image generationollama blogexperimental
https://ollama.com/blog/windows-preview
Windows preview · Ollama Blog
Ollama is now available on Windows in preview, making it possible to pull, run and create large language models in a new native Windows experience. Ollama on...
windows previewollama blog
https://ollama.com/blog/streaming-tool
Streaming responses with tool calling · Ollama Blog
Ollama now supports streaming responses with tool calling. This enables all chat applications to stream content and also call tools in real time.
streaming responsestool callingollama blog
https://ollama.com/blog/openclaw
OpenClaw · Ollama Blog
OpenClaw is a personal AI assistant that connects your messaging apps to local AI coding agents, all running on your own device.
ollama blogopenclaw
https://ollama.com/blog/gpt-oss-safeguard
OpenAI gpt-oss-safeguard · Ollama Blog
Ollama is partnering with OpenAI and ROOST (Robust Open Online Safety Tools) to bring the latest gpt-oss-safeguard reasoning models to users for safety...
openai gpt ossollama blogsafeguard
https://ollama.com/blog/openai-compatibility
OpenAI compatibility · Ollama Blog
Ollama now has initial compatibility with the OpenAI Chat Completions API, making it possible to use existing tooling built for OpenAI with local models via...
ollama blogopenaicompatibility
https://ollama.com/blog/python-javascript-libraries
Python & JavaScript Libraries · Ollama Blog
The initial versions of the Ollama Python and JavaScript libraries are now available, making it easy to integrate your Python or JavaScript, or Typescript app...
python javascriptollama bloglibraries
https://ollama.com/blog/llama-3-is-not-very-censored
Llama 3 is not very censored · Ollama Blog
Compared to Llama 2, Llama 3 feels much less censored. Meta has substantially lowered false refusal rates. Llama 3 will refuse less than 1/3 of the prompts...
llama 3ollama blogcensored
https://ollama.com/blog/gemma2
Google Gemma 2 · Ollama Blog
Gemma 2 is now available on Ollama in 3 sizes - 2B, 9B and 27B.
google gemma 2ollama blog
https://ollama.com/blog/thinking
Thinking · Ollama Blog
Ollama now has the ability to enable or disable thinking. This gives users the flexibility to choose the model’s thinking behavior for different applications...
ollama blogthinking
https://ollama.com/blog/nvidia-spark
NVIDIA DGX Spark · Ollama Blog
The latest NVIDIA DGX Spark is here! Ollama has partnered with NVIDIA to ensure it runs fast and efficiently out-of-the-box.
nvidia dgx sparkollama blog
https://ollama.com/blog/web-search
Web search · Ollama Blog
A new web search API is now available in Ollama. Ollama provides a generous free tier of web searches for individuals to use, and higher rate limits are...
web searchollama blog
https://ollama.com/blog/how-to-prompt-code-llama
How to prompt Code Llama · Ollama Blog
This guide walks through the different ways to structure prompts for Code Llama and its different variations and features including instructions, code...
code llamaollama blogprompt
https://ollama.com/blog/minions
Minions: where local and cloud LLMs meet · Ollama Blog
Avanika Narayan, Dan Biderman, and Sabri Eyuboglu from Christopher Ré's Stanford Hazy Research lab, along with Avner May, Scott Linderman, James Zou, have...
cloud llmsollama blogminionslocalmeet
https://ollama.com/blog/coding-models
New coding models & integrations · Ollama Blog
GLM-4.6 and Qwen3-coder-480B are available on Ollama’s cloud service with easy integrations to the tools you are familiar with. Qwen3-Coder-30B has been...
new codingollama blogmodelsintegrations