https://www.modular.com/inference/shared-endpoints
Modular: Shared Endpoints, Our Cloud, Any GPU
OpenAI-compatible shared endpoints with $/token pricing. Scale to zero, burst to meet demand. 2x vLLM performance. 500+ open models. No minimums, no reserved...
shared endpointsour cloudmodulargpu