Sponsor of the Day:
Jerkmate
https://rocm.blogs.amd.com/artificial-intelligence/hipblaslt-tensilelite-tuning/README.html
Customizing Kernels with hipBLASLt TensileLite GEMM Tuning - Advanced User Guide — ROCm Blogs
Master hipBLASLt TensileLite Tuning. Learn to build custom kernels that deliver 150%-250% faster GEMM performance on AMD Instinct™ MI300X GPUs
advanced user guiderocm blogscustomizingkernelsgemm
https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inf_v6.0-repro/README.html
Reproducing the AMD MLPerf Inference v6.0 Submission Result — ROCm Blogs
Provide instructions to potential customers and partners to verify our MLPerf Inference v6.0 submission result.
mlperf inference v6rocm blogsreproducingamd0
https://rocm.blogs.amd.com/software-tools-optimization/eaisuite-autoscaling/README.html
Leveraging AMD AI Workbench to Scale LLM Inference for Optimal Resource Utilization — ROCm Blogs
Learn how to use the AMD AI Workbench GUI and AIM Engine CLI capabilities to enable and configure autoscaling for your AI workloads.
amd aiscale llmresource utilizationrocm blogsleveraging
https://rocm.blogs.amd.com/artificial-intelligence/mlperf-inference-v6.0/README.html
AMD Instinct™ GPUs MLPerf Inference v6.0 Submission — ROCm Blogs
In this blog, we share the technical details of how we accomplish the results in our MLPerf Inference v6.0 submission.
mlperf inference v6rocm blogsamdgpus0
https://rocm.blogs.amd.com/artificial-intelligence/hipblaslt_online_tuning/README.html
hipBLASLt Online GEMM Tuning — ROCm Blogs
Learn how to improve model performance with hipBLASLt online tuning merged into LLM framework
rocm blogsonlinegemmtuning
https://rocm.blogs.amd.com/software-tools-optimization/flydsl-nightly-wheel/README.html
Getting Started with FlyDSL Nightly Wheels on ROCm — ROCm Blogs
A practical guide to installing and using FlyDSL nightly wheels on ROCm for fast, Python-native GPU kernel development
getting startednightlywheelsrocmblogs