https://mlcommons.org/2025/09/deepseek-inference-5-1/
Sep 9, 2025 - MLCommons MLPerf Inference v5.1 Benchmarking the Next Generation of Reasoning LLMs with Long Output Sequences
mlperf inferencedeepseekreasoning
https://mlcommons.org/2025/04/mlperf-inference-v5-0-results/
Jul 1, 2025 - MLCommons' latest MLPerf Inference v5.0 results show Gen AI now the center of attention for performance engineering.
mlperf inferencebenchmark resultsreleasesnew
https://mlcommons.org/2025/04/auto-inference-v5/
Sep 2, 2025 - MLCommons has developed a new automotive benchmark for MLPerf Inference v5.0 based on established industry methods such as PointPainting, DeepLabv3+, and the...
mlperf inferencenewautomotivebenchmark
https://mlcommons.org/working-groups/benchmarks/inference/
Aug 15, 2025 - The MLCommons MLPerf Inference working group creates a set of fair and representative inference benchmarks. The myriad combinations of ML hardware and software...
mlperf inference
https://www.networkworld.com/article/3952638/nvidias-blackwell-raises-the-bar-with-new-mlperf-inference-v5-0-results.html
Sep 9, 2025 - Its GB200 NVL72 system delivered up to 30 times higher throughput on the Llama 3.1 405B workload compared to firm’s H200 NVL8, Nvidia said.
mlperf inferenceblackwellraisesbarnew
https://mlcommons.org/2025/09/mlperf-inference-v5-1-results/
Sep 9, 2025 - MLCommons Releases New MLPerf Inference v5.1 Benchmark ResultsNew results highlight AI industry’s latest technical advances
mlperf inferencebenchmark resultsreleasesnew
https://mlcommons.org/2025/09/small-llm-inference-5-1/
Sep 9, 2025 - MLCommons introduces a new small language model benchmark based on established industry methods such as Llama3.1-8B, vLLM, and the CNN-DailyMail dataset.
mlperf inferencebenchmarkingsmallllms