13.2 GB16GB+ VRAMlora

gemma_3_12B_it_fp8_scaled.safetensors

Gemma 3 12B IT FP8 Scaled (Text Encoder)

FP8-scaled Gemma 3 12B IT text encoder. Alternative to FP4 with marginally higher precision. Place in models/text_encoders/.

Download gemma_3_12B_it_fp8_scaled.safetensors

Direct HuggingFace download. 13.2 GB · Free.

Open on HuggingFace

Install path: ComfyUI/models/text_encoders/ + gemma_3_12B_it_fp8_scaled.safetensors

No 16GB GPU? Try gemma_3_12B_it_fp8_scaled.safetensors online — free generation included

Skip the 13.2 GB download and ComfyUI setup. Generate a 6-second video using this exact model in your browser, ~30 seconds.

Try this model online — free →

Will this run on my GPU?

Minimum: 16GB VRAM.

GPUVRAMVerdict

RTX 3060 12GB12GBInsufficient VRAM

RTX 4060 Ti / 4070 (16GB)16GBTight fit

RTX 4070 Ti SUPER / 4080 (16GB)16GBTight fit

RTX 3090 (24GB)24GBComfortable

RTX 4090 (24GB)24GBComfortable

RTX 5090 / A6000 (32GB+)32GBComfortable

Recommendation: Alternative for 16-24 GB cards. Slightly higher quality than FP4 at 3.7 GB more.

How to use gemma_3_12B_it_fp8_scaled.safetensors

Download the file from HuggingFace.
Place it in ComfyUI/models/text_encoders/ inside your ComfyUI directory.
Restart ComfyUI (or refresh the model list from the menu).
Load a compatible workflow — see below.

Don't want to run this locally? Try gemma_3_12B_it_fp8_scaled.safetensors online with a free generation — no GPU, no install, ~30 seconds per clip.

Common issues

ComfyUI doesn't see the file after I downloaded it▼

Make sure the file is in ComfyUI/models/text_encoders/ (not a subfolder). Restart ComfyUI fully — the menu refresh sometimes misses new files. Filename must match exactly: gemma_3_12B_it_fp8_scaled.safetensors.

CUDA out of memory error when loading the model▼

gemma_3_12B_it_fp8_scaled.safetensors needs ~16GB VRAM minimum. If you're hitting OOM: • Enable Sequential Offloading in ComfyUI settings • Lower the resolution (768×512 instead of 1280×704) — both dimensions must be divisible by 32 • Reduce frame count (65 frames instead of 161) — must be 8n+1 • Use a smaller variant — see Related models below.

How do I apply this LoRA in ComfyUI?▼

Load it in a 'LoraLoader' node and connect it after your model loader. Pair this LoRA with the dev base model (not the distilled one) for the right behavior. LoRA strength 1.0 is the trained value — start there.

Free newsletter

Get notified when Gemma 3 12B IT FP8 Scaled (Text Encoder) updates

Occasional updates on what's new in LTX 2.3 — new FP8 quants, LoRAs, IC-LoRA releases — with our hands-on verdict on whether they're worth re-downloading. No fixed cadence.

No spam. Sent occasionally when there's real news. Unsubscribe in one click.

Related models

fp825.2 GB·16GB

ltx-2.3-22b-distilled-1.1_transformer_only_fp8_scaled.safetensors

lora2.74 GB·16GB

ltx-2.3-22b-distilled-1.1_lora-dynamic_fro09_avg_rank_111_bf16.safetensors

lora662 MB·16GB

ltx-2.3-22b-distilled-lora-1.1_fro90_ceil72_condsafe.safetensors

lora617 MB·16GB

LTX-2.3-OmniNFT-RL-Lora_bf16.safetensors

fp8~25 GB·16GB

ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors

fp825 GB·16GB

ltx-2.3-22b-distilled_transformer_only_fp8_input_scaled_v3.safetensors