deepinfra - Robuta Search

https://deepinfra.com/deepseek-ai/Janus-Pro-1B/versions deepseek-ai/Janus-Pro-1B - Versions - DeepInfra Janus-Pro is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by... deepseek ai janus pro versions deepinfra https://docs.deepinfra.com/apis/reranker Reranking - DeepInfra Rerank a list of documents by relevance to a query. reranking deepinfra https://deepinfra.com/zai-org/GLM-4.7-Flash/versions zai-org/GLM-4.7-Flash - Versions - DeepInfra GLM-4.7-Flash is a 30B-A3B MoE model. As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances... zai glm flash versions deepinfra https://deepinfra.com/dash/deployments?new=custom-llm&base_model=google%2Fgemma-2-9b-it Dashboard - DeepInfra Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models. dashboard deepinfra https://deepinfra.com/deepseek-ai/DeepSeek-R1-0528/api deepseek-ai/DeepSeek-R1-0528 - API Reference - DeepInfra The DeepSeek R1 model has undergone a minor version upgrade, with the current version being DeepSeek-R1-0528.. Full API Reference deepseek ai api reference deepinfra https://deepinfra.com/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8/versions meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 - Versions - DeepInfra The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts... meta llama maverick instruct versions deepinfra https://deepinfra.com/XiaomiMiMo/MiMo-V2.5-tts/api XiaomiMiMo/MiMo-V2.5-tts - API Reference - DeepInfra Automatically convert input text into natural and fluent speech output. You can generate natural and vivid speech content by configuring parameters such as... tts api mimo reference deepinfra https://toktab.com/deepinfra-Qwen-Qwen3-14B/ deepinfra/Qwen/Qwen3-14B Pricing - Toktab Current pricing data for deepinfra/Qwen/Qwen3-14B - input/output token costs and context window. Free JSON API available. deepinfra qwen pricing https://deepinfra.com/moonshotai/Kimi-K2.5/versions moonshotai/Kimi-K2.5 - Versions - DeepInfra Kimi K2.5 is an open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed visual and text tokens atop... moonshotai kimi versions deepinfra https://deepinfra.com/dash/deployments?new=custom-llm&base_model=zai-org%2FGLM-4.7-Flash Dashboard - DeepInfra Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models. dashboard deepinfra https://deepinfra.com/mistralai/Mistral-Small-3.2-24B-Instruct-2506/versions mistralai/Mistral-Small-3.2-24B-Instruct-2506 - Versions - DeepInfra Mistral-Small-3.2-24B-Instruct is a drop-in upgrade over the 3.1 release, with markedly better instruction following, roughly half the infinite-generation... mistral small mistralai instruct versions deepinfra https://deepinfra.com/Bria/expand/voice Bria/expand - Voice - DeepInfra Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained... bria expand voice deepinfra https://deepinfra.com/oauth/authorize Authorize | DeepInfra Low pay-as-you-go pricing. No long-term contracts. Simple APIs. Scale to trillions of tokens. 100+ AI models. authorize deepinfra https://www.getmaxim.ai/bifrost/llm-cost-calculator/provider/deepinfra/model/olmocr-7b-0725-fp8 olmOCR-7B-0725-FP8 Cost Calculator - DeepInfra | Bifrost Calculate the cost of using olmOCR-7B-0725-FP8 from DeepInfra for Chat workloads. Input: $0.27 per 1M tokens, Output: $1.50 per 1M tokens cost calculator olmocr deepinfra bifrost https://deepinfra.com/nvidia/llama-nemotron-rerank-vl-1b-v2/voice nvidia/llama-nemotron-rerank-vl-1b-v2 - Voice - DeepInfra The llama-nemotron-rerank-vl-1b-v2 is a 1.7B parameter multimodal reranking model designed to evaluate and order the relevance of document images and text... nvidia llama nemotron rerank vl https://theconsensus.dev/company/deepinfra.html DeepInfra - The Consensus deepinfra consensus https://deepinfra.com/nvidia/NVIDIA-Nemotron-Nano-9B-v2/voice nvidia/NVIDIA-Nemotron-Nano-9B-v2 - Voice - DeepInfra NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning... nvidia nemotron nano voice deepinfra https://docs.deepinfra.com/chat/structured-outputs Structured Outputs - DeepInfra Get model responses in JSON format using response_format. structured outputs deepinfra https://docs.openclaw.ai/de/providers/deepinfra DeepInfra - OpenClaw deepinfra openclaw