Tutorials

LTX-2 Prompting Guide: Mastering Motion and Camera Control

Practical field notes on crafting effective prompts for LTX-2, focusing on motion verbs, camera movements, and constraints to reduce artifacts and improve video stability.

By ltx workflow

Editor's Note: This guide shares hands-on experience with LTX-2 prompting techniques, offering practical tips for controlling motion, camera movements, and reducing common artifacts like jitter and frame inconsistencies.

How LTX-2 "reads" prompts

After extensive testing with LTX-2, several key patterns emerged:

  • LTX-2 prioritizes structure first, then style. Clear subject, action, and constraints help lock in composition better. Style words (cinematic, dreamy) decorate what's already defined.
  • It favors explicit motion verbs: "Rotate," "pan," "dolly," "track," "zoom," "tilt." Concrete moves like "slow dolly-in" produce more stable results than vague language.
  • One sentence per idea helps. Long, complex prompts can produce drifting and odd transitions. Short clauses lead to cleaner motion planning.
  • Order matters. Better results come from this sequence: Subject → Action → Camera → Lighting → Lens → Constraints → Negatives.

Think of the model like a careful cinematographer: it can do a lot, but it works best with a clear shot list.

Motion verbs & constraints

Clear motion verbs can significantly reduce wobble and artifacts. Testing shows that explicit verbs improve stability.

Motion verbs that work well:

  • Dolly-in / dolly-out (smooth depth changes)
  • Slow pan left/right (gentle lateral movement)
  • Track forward/back (follow movement, less jitter than handheld)
  • Subtle tilt up/down (pairs well with reveals)
  • Static lock-off (when you want crisp detail)

Helpful constraints:

  • Speed: "slow," "gentle," or "0.5x speed"
  • Duration hints: "5-second shot," "8-second loop"
  • Composition locks: "center-framed," "profile view," "wide establishing shot," "macro close-up"

Example that works well: "Center-framed subject, slow dolly-in, 5-second shot, gentle camera, keep background stable."

Combining too many movements (fast pan + fast zoom + handheld) tends to increase artifacts. One clean move beats three messy ones.

Camera prompts (pan/zoom/dolly/handheld)

Camera terminology makes a significant difference:

  • Pan: "slow pan right across the skyline" produces straighter horizons than "move across the skyline."
  • Zoom: Digital zoom cues can hunt focus. "Dolly-in" often gives better results when you want that zoom feel, unless you explicitly need a zoom aesthetic.
  • Dolly/Track: Best choice for cinematic reveals. "Slow dolly-in toward the subject" creates emotional intimacy and maintains geometric consistency.

Sources

#ltx-2#prompting#camera-control#motion#video-generation