https://github.com/sgl-project/sglang
GitHub - sgl-project/sglang: SGLang is a high-performance serving framework for large language...
SGLang is a high-performance serving framework for large language models and multimodal models. - sgl-project/sglang
https://www.sglang.io/
SGLang - High-Performance Serving Framework for LLMs and VLMs
SGLang powers fast, scalable inference for large language and multimodal models. Open-source serving framework with state-of-the-art performance.
high performancesglangservingframeworkllms
https://docs.sglang.io/docs/basic_usage/overview
Basic Usage - SGLang Documentation
Core APIs and common usage patterns for SGLang.
basic usagesglangdocumentation
https://docs.sglang.io/docs/developer_guide/contribution_guide
Contribution Guide - SGLang Documentation
contribution guidesglangdocumentation
https://docs.sglang.io/docs/advanced_features/vlm_query
Query VLM with Offline Engine - SGLang Documentation
queryvlmofflineenginesglang
https://www.thoughtworks.com/en-th/radar/tools/sglang
SGLang | Technology Radar | Thoughtworks Thailand
SGLang is a high-performance serving framework that reduces the compute overhead of LLM inference through a co-design of its front-end programming language and...
technology radarsglangthoughtworksthailand
https://docs.sglang.io/docs/references/multi_node_deployment/multi_node_index
Multi-Node Deployment - SGLang Documentation
multi nodedeploymentsglangdocumentation
https://docs.sglang.io/docs/basic_usage/offline_engine_api
Offline Engine API - SGLang Documentation
offlineengineapisglangdocumentation
https://docs.sglang.io/docs/advanced_features/hisparse_guide
HiSparse: Hierarchical Sparse Attention - SGLang Documentation
sparse attentionhierarchicalsglangdocumentation
https://docs.sglang.io/docs/basic_usage/popular_model_usage
Popular Model Usage (DeepSeek, GPT-OSS, GLM, Llama, MiniMax, Qwen, and more) - SGLang Documentation
Documentation for Popular Model Usage (DeepSeek, GPT-OSS, GLM, Llama, MiniMax, Qwen, and more)
https://aws.github.io/deep-learning-containers/releasenotes/sglang/
SGLang - Deep Learning Containers
Documentation for AWS Deep Learning Containers
deep learningsglangcontainers
https://arxiv.org/abs/2312.07104
[2312.07104] SGLang: Efficient Execution of Structured Language Model Programs
Abstract page for arXiv paper 2312.07104: SGLang: Efficient Execution of Structured Language Model Programs
language modelsglangefficientexecutionstructured
https://docs.sglang.io/docs/advanced_features/epd_disaggregation
EPD Disaggregation - SGLang Documentation
epddisaggregationsglangdocumentation
https://docs.sglang.io/docs/advanced_features/hyperparameter_tuning
Hyperparameter Tuning - SGLang Documentation
hyperparameter tuningsglangdocumentation
https://docs.sglang.io/cookbook/autoregressive/Qwen/Qwen3.5
Qwen3.5 - SGLang Documentation
sglangdocumentation
https://technicalmunch.com/sources-project-sglang-spins-out-as-radixark-with-400m-valuation-as-inference-market-explodes/
Sources: project SGLang spins out as RadixArk with $400M valuation as inference market explodes -...
Jan 21, 2026 - Some of the team responsible for maintaining SGLang, a popular open-source tool used by companies like xAI and Cursor to accelerate AI model training, has
https://docs.sglang.io/docs/supported-models
Supported models - SGLang Documentation
See which families of SGLang-compatible models are actively maintained.
supported modelssglangdocumentation
https://docs.sglang.io/cookbook/autoregressive/InclusionAI/Ling-2.6
Ling-2.6 - SGLang Documentation
lingsglangdocumentation
https://docs.sglang.io/cookbook/autoregressive/MiniMax/MiniMax-M2.7
MiniMax-M2.7 - SGLang Documentation
minimaxsglangdocumentation
https://docs.sglang.io/docs/developer_guide/benchmark_and_profiling
Benchmark and Profiling - SGLang Documentation
benchmarkprofilingsglangdocumentation
https://docs.sglang.io/cookbook/autoregressive/GLM/GLM-4.7-Flash
GLM-4.7-Flash - SGLang Documentation
glmflashsglangdocumentation