Robuta

https://www.modular.com/inference/shared-endpoints Modular: Shared Endpoints, Our Cloud, Any GPU OpenAI-compatible shared endpoints with $/token pricing. Scale to zero, burst to meet demand. 2x vLLM performance. 500+ open models. No minimums, no reserved... shared endpointsour cloudmodulargpu