Sponsor of the Day:
Jerkmate
https://huggingface.co/zai-org/GLM-4.6-FP8
zai-org/GLM-4.6-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
glm 4 6hugging facezaifp8
https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
meta llama 4maverick 17b 128ehugging faceinstructfp8
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/Wan2_1-T2V-14B_fp8_e4m3fn.safetensors
Wan2_1-T2V-14B_fp8_e4m3fn.safetensors · Kijai/WanVideo_comfy at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
wan2 1 t2vfp8 e4m3fn safetensorskijai wanvideo comfy14bmain
https://towardsdatascience.com/breaking-the-hardware-barrier-software-fp8-for-older-gpus/
Breaking the Hardware Barrier: Software FP8 for Older GPUs | Towards Data Science
Dec 29, 2025 - Deep learning workloads are increasingly memory-bound, with GPU cores sitting idle while waiting for data transfers. FP8 precision solves this on newer...
towards data sciencebreakinghardwarebarriersoftware
https://huggingface.co/XLabs-AI/flux-dev-fp8/tree/main
XLabs-AI/flux-dev-fp8 at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ai fluxdevfp8main
https://hazyresearch.stanford.edu/blog/2024-11-27-tk-fp8
ThunderKittens: Bringing fp8 to theaters near you · Hazy Research
theaters nearthunderkittensbringingfp8hazy
https://huggingface.co/Qwen/Qwen3.6-27B-FP8
Qwen/Qwen3.6-27B-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
qwen qwen3 6hugging face27bfp8
https://huggingface.co/PrimeIntellect/INTELLECT-3-FP8
PrimeIntellect/INTELLECT-3-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
intellect 3hugging faceprimeintellectfp8
https://civitai.com/models/637170/flux1-compact-or-clip-and-vae-included?modelVersionId=714945
Flux.1 Compact | CLIP and VAE included - 🟦 Flux.1-Schnell fp8 | Flux.1 Checkpoint | Civitai
FLUX COMPACT I am excited to present a collection of compact Flux models, with Clips and VAE included in a single model, optimized to integrate sea...
flux 1checkpoint civitaicompactclipvae
https://huggingface.co/NousResearch/k2-merged-3.5T-fp8
NousResearch/k2-merged-3.5T-fp8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
3 5thugging facenousresearchk2merged
https://docs.vllm.ai/en/latest/api/vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors_moe/compressed_tensors_moe_w4a8_fp8/
compressed_tensors_moe_w4a8_fp8 - vLLM
compressed tensorsmoefp8vllm
https://huggingface.co/comfyanonymous/flux_text_encoders/blob/main/t5xxl_fp8_e4m3fn.safetensors
t5xxl_fp8_e4m3fn.safetensors · comfyanonymous/flux_text_encoders at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
fp8 e4m3fn safetensorstext encodersfluxmain
https://huggingface.co/zai-org/GLM-4.6V-FP8?inference_api=true&inference_provider=zai-org
zai-org/GLM-4.6V-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
glm 4 6vhugging facezaifp8
https://huggingface.co/Kijai/WanVideo_comfy/blame/main/Wan2_1-I2V-14B-720P_fp8_e4m3fn.safetensors
Wan2_1-I2V-14B-720P_fp8_e4m3fn.safetensors · Kijai/WanVideo_comfy at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
wan2 1 i2vfp8 e4m3fn safetensorskijai wanvideo comfy14b720p
https://hackernoon.com/how-frontier-labs-use-fp8-to-train-faster-and-spend-less
How Frontier Labs Use FP8 to Train Faster and Spend Less | HackerNoon
Naively casting to FP8 destroys your numerics. Here's the per-tensor and blockwise quantization mechanics that make it actually work at pretraining scale.
frontier labsspend lessusefp8train
https://huggingface.co/zai-org/GLM-4.6V-FP8
zai-org/GLM-4.6V-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
glm 4 6vhugging facezaifp8
https://huggingface.co/nota-ai/Solar-Open-100B-Nota-FP8
nota-ai/Solar-Open-100B-Nota-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
nota aihugging facesolaropen100b
https://zencoder.ai/blog/the-reality-of-self-hosting-llms-performance-cost-and-control-with-glm-4.5-fp8-white-paper
The Reality of Self-Hosting LLMs: Performance, Cost, and Control with GLM-4.5-FP8 - White paper
Jan 28, 2026 - Discover which LLM is the best for coding to help you write cleaner code, fix bugs faster, and boost your productivity effortlessly.
glm 4 5self hostingperformance costwhite paperreality
https://huggingface.co/unsloth/Qwen3-VL-8B-Thinking-FP8
unsloth/Qwen3-VL-8B-Thinking-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
qwen3 vl 8bhugging faceunsloththinkingfp8
https://huggingface.co/unsloth/Qwen3-14B-FP8
unsloth/Qwen3-14B-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
unsloth qwen3hugging face14bfp8
https://huggingface.co/Qwen/Qwen3.6-35B-A3B-FP8
Qwen/Qwen3.6-35B-A3B-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
qwen qwen3 635b a3bhugging facefp8
https://huggingface.co/zai-org/GLM-5.1-FP8?inference_api=true&inference_provider=zai-org
zai-org/GLM-5.1-FP8 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
glm 5 1hugging facezaifp8