https://deepinfra.com/moonshotai/Kimi-K2-Instruct-0905
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
moonshotaikimiinstructdemodeepinfra
https://deepinfra.com/Qwen/Qwen3-Next-80B-A3B-Instruct
Over the past few months, we have observed increasingly clear trends toward scaling both total parameters and context lengths in the pursuit of more powerful...
qwennextinstructdemodeepinfra
https://deepinfra.com/Qwen/Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507 is the Qwen3's new model with scaling the thinking capability of Qwen3-235B-A22B, improving both the quality and depth of...
qwenthinkingdemodeepinfra
https://deepinfra.com/openai/gpt-oss-120b
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose...
gpt ossopenaidemodeepinfra
https://deepinfra.com/Qwen/Qwen3-235B-A22B-Instruct-2507
Qwen3-235B-A22B-Instruct-2507 is the updated version of the Qwen3-235B-A22B non-thinking mode, featuring Significant improvements in general capabilities,...
qweninstructdemodeepinfra
https://deepinfra.com/MiniMaxAI/MiniMax-M2
MiniMax-M2 is a Mini model built for Max coding & agentic workflows with just 10 billion activated parameters. Try out API on the Web
minimaxaidemodeepinfra
https://deepinfra.com/meta-llama/Meta-Llama-3-70B-Instruct
Model Details Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative...
metallamainstructdemodeepinfra
https://deepinfra.com/deepseek-ai/DeepSeek-V3.1-Terminus
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
deepseekaiterminusdemodeepinfra
https://deepinfra.com/deepseek-ai/DeepSeek-V3.1
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
deepseekaidemodeepinfra
https://deepinfra.com/meta-llama/Meta-Llama-3-8B-Instruct
Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in...
metallamainstructdemodeepinfra
https://deepinfra.com/moonshotai/Kimi-K2-Thinking
Kimi K2 Thinking is the latest, most capable version of open-source thinking model developed by MoonshotAI. Try out API on the Web
moonshotaikimithinkingdemodeepinfra
https://deepinfra.com/Qwen/Qwen3-Coder-480B-A35B-Instruct
Qwen3-Coder-480B-A35B-Instruct is the Qwen3's most agentic code model, featuring Significant Performance on Agentic Coding, Agentic Browser-Use and other...
qwencoderinstructdemodeepinfra
https://deepinfra.com/canopylabs/orpheus-3b-0.1-ft
Orpheus TTS is a state-of-the-art, Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned to...
orpheusftdemodeepinfra
https://ai-sdk.dev/providers/ai-sdk-providers/deepinfra
Learn how to use DeepInfra's models with the AI SDK.
ai sdkprovidersdeepinfra
https://deepinfra.com/deepseek-ai/DeepSeek-V3.2-Exp
DeepSeek-V3.2-Exp is an intermediate step toward the next-generation architecture of the DeepSeek models by introducing DeepSeek Sparse Attention—a sparse...
deepseekaiexpdemodeepinfra
https://deepinfra.com/meta-llama/Meta-Llama-3.1-8B-Instruct
Meta developed and released the Meta Llama 3.1 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models...
metallamainstructdemodeepinfra
https://deepinfra.com/PaddlePaddle/PaddleOCR-VL-0.9B
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
paddlepaddlevldemodeepinfra
https://deepinfra.com/zai-org/GLM-4.6
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
orgglmdemodeepinfra
https://deepinfra.com/deepseek-ai/DeepSeek-OCR
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
deepseekaiocrdemodeepinfra
https://deepinfra.com/meta-llama/Meta-Llama-3.1-70B-Instruct
Meta developed and released the Meta Llama 3.1 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models...
metallamainstructdemodeepinfra
https://deepinfra.com/allenai/olmOCR-2-7B-1025
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
allenaidemodeepinfra
https://deepinfra.com/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
Qwen3-Coder-480B-A35B-Instruct is the Qwen3's most agentic code model, featuring Significant Performance on Agentic Coding, Agentic Browser-Use and other...
qwencoderinstructturbodemo
https://deepinfra.com/openai/gpt-oss-20b
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
gpt ossopenaidemodeepinfra