Robuta

https://github.com/sgl-project/sglang GitHub - sgl-project/sglang: SGLang is a high-performance serving framework for large language... SGLang is a high-performance serving framework for large language models and multimodal models. - sgl-project/sglang https://www.sglang.io/ SGLang - High-Performance Serving Framework for LLMs and VLMs SGLang powers fast, scalable inference for large language and multimodal models. Open-source serving framework with state-of-the-art performance. high performancesglangservingframeworkllms https://docs.sglang.io/docs/basic_usage/overview Basic Usage - SGLang Documentation Core APIs and common usage patterns for SGLang. basic usagesglangdocumentation https://docs.sglang.io/docs/developer_guide/contribution_guide Contribution Guide - SGLang Documentation contribution guidesglangdocumentation https://docs.sglang.io/docs/advanced_features/vlm_query Query VLM with Offline Engine - SGLang Documentation queryvlmofflineenginesglang https://www.thoughtworks.com/en-th/radar/tools/sglang SGLang | Technology Radar | Thoughtworks Thailand SGLang is a high-performance serving framework that reduces the compute overhead of LLM inference through a co-design of its front-end programming language and... technology radarsglangthoughtworksthailand https://docs.sglang.io/docs/references/multi_node_deployment/multi_node_index Multi-Node Deployment - SGLang Documentation multi nodedeploymentsglangdocumentation https://docs.sglang.io/docs/basic_usage/offline_engine_api Offline Engine API - SGLang Documentation offlineengineapisglangdocumentation https://docs.sglang.io/docs/advanced_features/hisparse_guide HiSparse: Hierarchical Sparse Attention - SGLang Documentation sparse attentionhierarchicalsglangdocumentation https://docs.sglang.io/docs/basic_usage/popular_model_usage Popular Model Usage (DeepSeek, GPT-OSS, GLM, Llama, MiniMax, Qwen, and more) - SGLang Documentation Documentation for Popular Model Usage (DeepSeek, GPT-OSS, GLM, Llama, MiniMax, Qwen, and more) https://aws.github.io/deep-learning-containers/releasenotes/sglang/ SGLang - Deep Learning Containers Documentation for AWS Deep Learning Containers deep learningsglangcontainers https://arxiv.org/abs/2312.07104 [2312.07104] SGLang: Efficient Execution of Structured Language Model Programs Abstract page for arXiv paper 2312.07104: SGLang: Efficient Execution of Structured Language Model Programs language modelsglangefficientexecutionstructured https://docs.sglang.io/docs/advanced_features/epd_disaggregation EPD Disaggregation - SGLang Documentation epddisaggregationsglangdocumentation https://docs.sglang.io/docs/advanced_features/hyperparameter_tuning Hyperparameter Tuning - SGLang Documentation hyperparameter tuningsglangdocumentation https://docs.sglang.io/cookbook/autoregressive/Qwen/Qwen3.5 Qwen3.5 - SGLang Documentation sglangdocumentation https://technicalmunch.com/sources-project-sglang-spins-out-as-radixark-with-400m-valuation-as-inference-market-explodes/ Sources: project SGLang spins out as RadixArk with $400M valuation as inference market explodes -... Jan 21, 2026 - Some of the team responsible for maintaining SGLang, a popular open-source tool used by companies like xAI and Cursor to accelerate AI model training, has https://docs.sglang.io/docs/supported-models Supported models - SGLang Documentation See which families of SGLang-compatible models are actively maintained. supported modelssglangdocumentation https://docs.sglang.io/cookbook/autoregressive/InclusionAI/Ling-2.6 Ling-2.6 - SGLang Documentation lingsglangdocumentation https://docs.sglang.io/cookbook/autoregressive/MiniMax/MiniMax-M2.7 MiniMax-M2.7 - SGLang Documentation minimaxsglangdocumentation https://docs.sglang.io/docs/developer_guide/benchmark_and_profiling Benchmark and Profiling - SGLang Documentation benchmarkprofilingsglangdocumentation https://docs.sglang.io/cookbook/autoregressive/GLM/GLM-4.7-Flash GLM-4.7-Flash - SGLang Documentation glmflashsglangdocumentation