https://github.com/LostRuins/koboldcpp
GitHub - LostRuins/koboldcpp: Run GGUF models easily with a KoboldAI UI. One File. Zero Install. ยท...
Run GGUF models easily with a KoboldAI UI. One File. Zero Install. - LostRuins/koboldcpp
https://mitjafelicijan.github.io/gguf-list/
GGUF model list
ggufmodellist
https://sekurak.pl/tag/gguf/
gguf - Sekurak
gguf
https://www.endorlabs.com/ai-model/bartowski-llama-3-2-3b-instruct-uncensored-gguf
bartowski/Llama-3.2-3B-Instruct-uncensored-GGUF | Endor Labs
bartowski/Llama-3.2-3B-Instruct-uncensored-GGUF AI model with 22255 downloads
llamainstructuncensoredggufendor
https://free2aitools.com/dataset/ermiaazarkhalili/carnice-9b-function-calling-xlam-unsloth-gguf
Carnice 9b Function Calling Xlam Unsloth Gguf - AI Dataset Insights
May 16, 2026 - Deep dive into Carnice 9b Function Calling Xlam Unsloth Gguf.
function callingai datasetxlamunslothgguf
https://docs.inferless.com/how-to-guides/deploy-a-Llama-3.1-8B-Instruct-GGUF-using-inferless
Deploy Llama-3.1-8B-Instruct GGUF using Inferless - Inferless
Llama-3.1-8B-Instruct GGUF is a quantized version of Meta's state-of-the-art Llama-3.1 series of large language models. This guide will take you through the...
deployllamainstructggufusing
https://free2aitools.com/dataset/mradermacher/vietlegal-harrier-0.6b-gguf
Vietlegal Harrier 0.6b Gguf - AI Dataset Insights
May 15, 2026 - Deep dive into Vietlegal Harrier 0.6b Gguf.
ai datasetharrierggufinsights
https://qwen-image-2512.com/blog/qwen-image-layered-gguf-comfyui-guide-en
How to Use Qwen-Image-Layered GGUF in ComfyUI: Complete Installation and Usage Guide
Complete guide to installing and using Qwen-Image-Layered GGUF in ComfyUI. Learn layer decomposition, GGUF quantization benefits, and step-by-step setup for...
how to use
https://www.runpod.io/blog/gguf-quantization-koboldcpp
How to Work with GGUF Quantizations in KoboldCPP | Runpod Blog
GGUF quantizations make large language models faster and more efficient. This guide walks you through using KoboldCPP to load, run, and manage quantized LLMs...
how to workggufkoboldcpprunpodblog
https://docs.vllm.ai/en/latest/api/vllm/model_executor/layers/quantization/gguf/
gguf - vLLM
ggufvllm
https://interfaze.ai/models/noctrexlightonocr-2-1b-gguf
LightOnOCR 2 1B GGUF - Interfaze
Apr 13, 2026 - LightOnOCR 2 1B GGUF by noctrex, a image-to-text model with OCR capabilities. Understand and compare OCR features, benchmarks, and capabilities.
lightonocrgguf
https://interfaze.ai/models/unslothqwen36-35b-a3b-gguf
Qwen3.6 35B A3B GGUF - Interfaze
Apr 17, 2026 - Qwen3.6 35B A3B GGUF by unsloth, a image-text-to-text model with multimodal capabilities. Understand and compare multimodal features, benchmarks, and...
gguf
https://forums.developer.nvidia.com/t/trt-llm-for-inference-with-nvfp4-safetensors-slower-than-lm-studio-gguf-on-the-spark/348636
TRT LLM for Inference with NVFP4 safetensors slower than LM studio GGUF on the Spark - DGX Spark /...
Oct 22, 2025 - I was executing the TRT LLM for Inference playbook using the nvidia/Llama-3.3-70B-Instruct-FP4 LLM and loaded meta/llama-3.3-70b Q4_K_M on LM Studio. TRT LLM...
https://www.endorlabs.com/ai-model/lewdiculous-mn-12b-lyra-v4-gguf-iq-imatrix
Lewdiculous/MN-12B-Lyra-v4-GGUF-IQ-Imatrix | Endor Labs
Lewdiculous/MN-12B-Lyra-v4-GGUF-IQ-Imatrix AI model with 8328 downloads
mnlyraggufiqimatrix
https://www.runninghub.ai/model/public/1929101229916372994
- RunningHub Stable Diffusion & Flux GGUF
Discover generative AI and LoRA models for personalized model training, creative styles, and AI content creation. Build and customize your own AI models with...
stable diffusionrunninghubfluxgguf
https://www.decodesfuture.com/articles/llama-cpp-gguf-quantization-guide-2026
Llama.cpp GGUF Quantization Guide: Optimize Local LLM Performance (2026)
Feb 27, 2026 - Learn how GGUF quantization works in Llama.cpp and optimize local LLM performance. Expert guide covering Q4, Q5, Q8 formats, RAM usage, and speed benchmarks.
gguf quantization guidelocal llmllamacppoptimize