Search for ComfyUI-GGUF in ComfyUI Manager and install it, or clone manually:
在 ComfyUI Manager 中搜索并安装 ComfyUI-GGUF,或手动克隆:
bash
cd ComfyUI/custom_nodes
git clone https://github.com/city96/ComfyUI-GGUF
Step 2 — Place the Model File / 放置模型文件
Place the downloaded .gguf file into:
将下载的 .gguf 文件放入:
text
ComfyUI/models/unet/
Step 3 — Load in Workflow / 在 Workflow 中加载
In the ComfyUI canvas:
Double-click to open the node search
Search for Unet Loader (GGUF) and replace your existing Load Diffusion Model node
Select the .gguf file from the dropdown
在 ComfyUI 画布中:
双击空白处打开节点搜索
搜索 Unet Loader (GGUF),替换原有的 Load Diffusion Model 节点
在节点下拉菜单中选择对应的 .gguf 文件
About GGUF Quantization / 关于 GGUF 量化
GGUF is a quantization format popularized by llama.cpp that supports offloading
model layers to system RAM, enabling large models to run even when VRAM is
insufficient. Q8_0 has negligible quality loss compared to FP16, while Q5_K_M
offers the best balance between file size and image quality.