https://deepinfra.com/deepseek-ai/Janus-Pro-1B/versions
deepseek-ai/Janus-Pro-1B - Versions - DeepInfra
Janus-Pro is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by...
deepseek aijanus proversionsdeepinfra
https://docs.deepinfra.com/apis/reranker
Reranking - DeepInfra
Rerank a list of documents by relevance to a query.
rerankingdeepinfra
https://deepinfra.com/zai-org/GLM-4.7-Flash/versions
zai-org/GLM-4.7-Flash - Versions - DeepInfra
GLM-4.7-Flash is a 30B-A3B MoE model. As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances...
zaiglmflashversionsdeepinfra
https://deepinfra.com/dash/deployments?new=custom-llm&base_model=google%2Fgemma-2-9b-it
Dashboard - DeepInfra
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
dashboarddeepinfra
https://deepinfra.com/deepseek-ai/DeepSeek-R1-0528/api
deepseek-ai/DeepSeek-R1-0528 - API Reference - DeepInfra
The DeepSeek R1 model has undergone a minor version upgrade, with the current version being DeepSeek-R1-0528.. Full API Reference
deepseek aiapi referencedeepinfra
https://deepinfra.com/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8/versions
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 - Versions - DeepInfra
The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts...
meta llamamaverickinstructversionsdeepinfra
https://deepinfra.com/XiaomiMiMo/MiMo-V2.5-tts/api
XiaomiMiMo/MiMo-V2.5-tts - API Reference - DeepInfra
Automatically convert input text into natural and fluent speech output. You can generate natural and vivid speech content by configuring parameters such as...
tts apimimoreferencedeepinfra
https://toktab.com/deepinfra-Qwen-Qwen3-14B/
deepinfra/Qwen/Qwen3-14B Pricing - Toktab
Current pricing data for deepinfra/Qwen/Qwen3-14B - input/output token costs and context window. Free JSON API available.
deepinfraqwenpricing
https://deepinfra.com/moonshotai/Kimi-K2.5/versions
moonshotai/Kimi-K2.5 - Versions - DeepInfra
Kimi K2.5 is an open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed visual and text tokens atop...
moonshotaikimiversionsdeepinfra
https://deepinfra.com/dash/deployments?new=custom-llm&base_model=zai-org%2FGLM-4.7-Flash
Dashboard - DeepInfra
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
dashboarddeepinfra
https://deepinfra.com/mistralai/Mistral-Small-3.2-24B-Instruct-2506/versions
mistralai/Mistral-Small-3.2-24B-Instruct-2506 - Versions - DeepInfra
Mistral-Small-3.2-24B-Instruct is a drop-in upgrade over the 3.1 release, with markedly better instruction following, roughly half the infinite-generation...
mistral smallmistralaiinstructversionsdeepinfra
https://deepinfra.com/Bria/expand/voice
Bria/expand - Voice - DeepInfra
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained...
briaexpandvoicedeepinfra
https://deepinfra.com/oauth/authorize
Authorize | DeepInfra
Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models.
authorizedeepinfra
https://www.getmaxim.ai/bifrost/llm-cost-calculator/provider/deepinfra/model/olmocr-7b-0725-fp8
olmOCR-7B-0725-FP8 Cost Calculator - DeepInfra | Bifrost
Calculate the cost of using olmOCR-7B-0725-FP8 from DeepInfra for Chat workloads. Input: $0.27 per 1M tokens, Output: $1.50 per 1M tokens
cost calculatorolmocrdeepinfrabifrost
https://deepinfra.com/nvidia/llama-nemotron-rerank-vl-1b-v2/voice
nvidia/llama-nemotron-rerank-vl-1b-v2 - Voice - DeepInfra
The llama-nemotron-rerank-vl-1b-v2 is a 1.7B parameter multimodal reranking model designed to evaluate and order the relevance of document images and text...
nvidiallamanemotronrerankvl
https://theconsensus.dev/company/deepinfra.html
DeepInfra - The Consensus
deepinfraconsensus
https://deepinfra.com/nvidia/NVIDIA-Nemotron-Nano-9B-v2/voice
nvidia/NVIDIA-Nemotron-Nano-9B-v2 - Voice - DeepInfra
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning...
nvidia nemotronnanovoicedeepinfra
https://docs.deepinfra.com/chat/structured-outputs
Structured Outputs - DeepInfra
Get model responses in JSON format using response_format.
structured outputsdeepinfra
https://docs.openclaw.ai/de/providers/deepinfra
DeepInfra - OpenClaw
deepinfraopenclaw