13.2 GB16GB+ VRAMlora

gemma_3_12B_it_fp8_scaled.safetensors

Gemma 3 12B IT FP8 Scaled (Text Encoder)

FP8-scaled Gemma 3 12B IT text encoder. Alternative to FP4 with marginally higher precision. Place in models/text_encoders/.

Download gemma_3_12B_it_fp8_scaled.safetensors

Direct HuggingFace download. 13.2 GB · Free.

Install path: ComfyUI/models/loras/ + gemma_3_12B_it_fp8_scaled.safetensors

No 16GB GPU? Try gemma_3_12B_it_fp8_scaled.safetensors online — free generation included

Skip the 13.2 GB download and ComfyUI setup. Generate a 6-second video using this exact model in your browser, ~30 seconds.

Try this model online — free →

Will this run on my GPU?

Minimum: 16GB VRAM.

GPUVRAMVerdict
RTX 3060 12GB12GBInsufficient VRAM
RTX 4060 Ti / 4070 (16GB)16GBTight fit
RTX 4070 Ti SUPER / 4080 (16GB)16GBTight fit
RTX 3090 (24GB)24GBComfortable
RTX 4090 (24GB)24GBComfortable
RTX 5090 / A6000 (32GB+)32GBComfortable

Recommendation: Alternative for 16-24 GB cards. Slightly higher quality than FP4 at 3.7 GB more.

How to use gemma_3_12B_it_fp8_scaled.safetensors

  1. Download the file from HuggingFace.
  2. Place it in ComfyUI/models/loras/ inside your ComfyUI directory.
  3. Restart ComfyUI (or refresh the model list from the menu).
  4. Load a compatible workflow — see below.

Don't want to run this locally? Try gemma_3_12B_it_fp8_scaled.safetensors online with a free generation — no GPU, no install, ~30 seconds per clip.

Common issues

ComfyUI doesn't see the file after I downloaded it

Make sure the file is in ComfyUI/models/loras/ (not a subfolder). Restart ComfyUI fully — the menu refresh sometimes misses new files. Filename must match exactly: gemma_3_12B_it_fp8_scaled.safetensors.

CUDA out of memory error when loading the model

gemma_3_12B_it_fp8_scaled.safetensors needs ~16GB VRAM minimum. If you're hitting OOM: • Enable Sequential Offloading in ComfyUI settings • Lower the resolution (768×512 instead of 1280×704) — both dimensions must be divisible by 32 • Reduce frame count (65 frames instead of 161) — must be 8n+1 • Use a smaller variant — see Related models below.

How do I apply this LoRA in ComfyUI?

Load it in a 'LoraLoader' node and connect it after your model loader. Pair this LoRA with the dev base model (not the distilled one) for the right behavior. LoRA strength 1.0 is the trained value — start there.

Free newsletter

Get notified when Gemma 3 12B IT FP8 Scaled (Text Encoder) updates

Occasional updates on what's new in LTX 2.3 — new FP8 quants, LoRAs, IC-LoRA releases — with our hands-on verdict on whether they're worth re-downloading. No fixed cadence.

No spam. Sent occasionally when there's real news. Unsubscribe in one click.

Related models