Robuta

Sponsor of the Day: Jerkmate
https://rocm.blogs.amd.com/artificial-intelligence/hipblaslt-tensilelite-tuning/README.html Customizing Kernels with hipBLASLt TensileLite GEMM Tuning - Advanced User Guide — ROCm Blogs Master hipBLASLt TensileLite Tuning. Learn to build custom kernels that deliver 150%-250% faster GEMM performance on AMD Instinct™ MI300X GPUs advanced user guiderocm blogscustomizingkernelsgemm https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inf_v6.0-repro/README.html Reproducing the AMD MLPerf Inference v6.0 Submission Result — ROCm Blogs Provide instructions to potential customers and partners to verify our MLPerf Inference v6.0 submission result. mlperf inference v6rocm blogsreproducingamd0 https://rocm.blogs.amd.com/software-tools-optimization/eaisuite-autoscaling/README.html Leveraging AMD AI Workbench to Scale LLM Inference for Optimal Resource Utilization — ROCm Blogs Learn how to use the AMD AI Workbench GUI and AIM Engine CLI capabilities to enable and configure autoscaling for your AI workloads. amd aiscale llmresource utilizationrocm blogsleveraging https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inference-v6.0/README.html AMD Instinct™ GPUs MLPerf Inference v6.0 Submission — ROCm Blogs In this blog, we share the technical details of how we accomplish the results in our MLPerf Inference v6.0 submission. mlperf inference v6rocm blogsamdgpus0 https://rocm.blogs.amd.com/artificial-intelligence/hipblaslt_online_tuning/README.html hipBLASLt Online GEMM Tuning — ROCm Blogs Learn how to improve model performance with hipBLASLt online tuning merged into LLM framework rocm blogsonlinegemmtuning https://rocm.blogs.amd.com/software-tools-optimization/flydsl-nightly-wheel/README.html Getting Started with FlyDSL Nightly Wheels on ROCm — ROCm Blogs A practical guide to installing and using FlyDSL nightly wheels on ROCm for fast, Python-native GPU kernel development getting startednightlywheelsrocmblogs