Question 1

Runs but is no faster than BF16 on my RTX 3090

Accepted Answer

MXFP8 dequantizes to BF16 for the matmul itself — the speedup comes from halved memory bandwidth, not faster compute. On a 24 GB card with the full pipeline in memory, you're already mostly compute-bound, not memory-bound.

Fix: This is expected. The win on RTX 30-series is fitting the model in VRAM at all — a BF16 transformer would be ~44 GB and OOM on 24 GB without sequential offloading. Use the FP8 scaled file on RTX 40xx+ for actual compute speedup.

Question 2

ComfyUI 'Mismatched shapes' error when stacking with a LoRA

Accepted Answer

Some older LoRA loaders don't understand MXFP8 weight layout and try to apply LoRA deltas in the wrong dtype.

Fix: Update ComfyUI to a recent version (post-2026-04) and ensure you're using KJNodes if your workflow needs Kijai-specific loaders. Or apply the LoRA against the BF16 transformer instead and quantize after, if your trainer supports it.

Question 3

Black or noisy first frame, rest of video looks fine

Accepted Answer

Workflow loaded the MXFP8 file with a node configured for fp8_scaled or BF16 — internal scale tables aren't being applied to the first denoising step.

Fix: Use ComfyUI's standard CheckpointLoaderSimple or the Kijai LTXVideoModelLoader from KJNodes. Avoid custom loaders that hard-assume a specific dtype.

Question 4

ComfyUI doesn't see the file after I downloaded it

Accepted Answer

Make sure the file is in ComfyUI/models/checkpoints/ (not a subfolder). Restart ComfyUI fully — the menu refresh sometimes misses new files. Filename must match exactly: ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors.

Question 5

I get a CUDA error mentioning fp8 / scaled / matmul

Accepted Answer

FP8 scaled matmuls require an RTX 40-series GPU or newer (Ada Lovelace architecture). RTX 30-series and older cannot run FP8 weights at native precision. Use the BF16 variant instead, or the MXFP8 block-32 alternative.

Question 6

CUDA out of memory error when loading the model

Accepted Answer

ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors needs ~16GB VRAM minimum. If you're hitting OOM:
• Enable Sequential Offloading in ComfyUI settings
• Lower the resolution (768×512 instead of 1280×704) — both dimensions must be divisible by 32
• Reduce frame count (65 frames instead of 161) — must be 8n+1
• Use a smaller variant — see Related models below.

Question 7

Why does ComfyUI say it can't find ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors?

Accepted Answer

The workflow JSON references this file under a subdirectory prefix. Observed variants in published workflows include: ltx23\ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors, diffusion_models/ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors. Either create the matching subdirectory inside ComfyUI/models/checkpoints/ and place the file there, or edit the workflow JSON and remove the prefix so it references just ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors.

ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors

Download ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors

No 16GB GPU? Try ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors online — free generation included

Technical details

When to choose ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors

Will this run on my GPU?

How to use ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors

ComfyUI says it can't find ltx-2.3-22b-distilled-1.1_transformer_only_mxfp8_block32.safetensors?

Common issues

Get notified when LTX 2.3 Distilled 1.1 MXFP8 (Kijai) updates

Related models