https://modal.com/docs/examples/vllm_inference
Run OpenAI-compatible LLM inference with Gemma and vLLM | Modal Docs
In this example, we show how to run a vLLM server in OpenAI-compatible mode on Modal.
openai compatiblellm inferencemodal docs
https://modal.com/docs/examples/streaming_kyutai_stt
Stream transcriptions with Kyutai STT | Modal Docs
This example demonstrates the deployment of a streaming audio transcription service with Kyutai STT on Modal.
modal docsstreamtranscriptionsstt
https://modal.com/docs/examples/diffusers_lora_finetune
Fine-tune Flux on your pet using LoRA | Modal Docs
This example finetunes the Flux.1-dev model on images of a pet (by default, a puppy named Qwerty) using a technique called textual inversion from the...
fine tuneyour petmodal docs
https://modal.com/docs/examples/generate_music
Make music with ACE-Step 1.5 | Modal Docs
In this example, we show you how you can run ACE Studio’s ACE-Step 1.5 music generation model on Modal.
make musicace stepmodal docs
https://modal.com/docs/guide
Introduction | Modal Docs
Modal is a serverless AI infrastructure platform with sub-second cold starts and per-second pricing.
modal docsintroduction