Generate Midjourney videos cleanly

Midjourney's video feature is image-to-video, not text-to-video. You feed it a starting frame (usually an MJ-generated image), optionally a text prompt, and it produces a 5-second clip. Then you can extend it up to 21 seconds total in 4-second chunks.

It is expensive compared to images and cheap compared to dedicated video models like Veo or Kling. Quality is mid-tier, better than the Higgsfield Free or Wan 2.5 baseline, worse than Veo 3.1. The killer feature is the identity continuity: a video extended from an MJ image preserves the original aesthetic in ways no other model matches today.

When to use Midjourney video, when not to

Use MJ video when:

The user already has a MJ image they love and wants to animate it
The user wants a short editorial clip (5–10 s) with strong stylistic continuity from a still
The user is on Standard+ plan and HD (720p) output is fine
Cost matters more than ultimate motion quality

Do NOT use MJ video when:

The user needs photoreal complex motion (use Veo 3.1 or Kling 3.0)
The user wants > 21 seconds in one piece (MJ caps at 21 s; chain a longer model instead)
The user wants text-to-video with no starting image (MJ is image-to-video only)
The user is on Basic plan and wants HD. Basic is SD-only

How to generate a video

There are two paths: midjourney.com (web) and Discord. The agent's output should target the user's platform.

Param	Purpose	Values
`--motion low`	Subtle camera + character movement (default)	flag
`--motion high`	Big camera moves, large character motion (more glitch risk)	flag
`--raw`	Reduce MJ's creative flair, follow prompt more literally	flag
`--loop`	Reuse the start frame as the end frame (creates a loop)	flag
`--end <URL>`	Use a different image as the end frame	URL
`--bs N`	Batch size, how many video variations to generate	1, 2, or 4 (default 4)
`--video`	Required in Discord when using a custom image URL	flag

Resolution	Batch 4 (default)	Batch 2	Batch 1
SD (480p)	8 GPU-min	4 GPU-min	2 GPU-min
HD (720p)	26 GPU-min	13 GPU-min	7 GPU-min

Starting AR	Video AR	SD pixels	HD pixels
1:1	1:1	624×624	960×960
4:3	~4:3	720×544	1104×832
2:3	2:3	512×768	784×1168
16:9	~16:9	832×464	1280×720
1:2	1:2	448×880	672×1360

Generate Midjourney videos cleanly

Generate Midjourney videos cleanly

When to use Midjourney video, when not to

How to generate a video

On midjourney.com (recommended)

In Discord

The video-specific parameters

GPU costs, read this before generating

Plan gating, also memorize

Extending videos

Looping and end frames

Motion settings, when to pick which

Aspect ratio is inherited

What to deliver to the user

Frequent failures and how to recover

Generate Midjourney videos cleanly

Generate Midjourney videos cleanly

When to use Midjourney video, when not to

How to generate a video

On midjourney.com (recommended)

In Discord

The video-specific parameters

GPU costs, read this before generating

Plan gating, also memorize

Extending videos

Looping and end frames

Motion settings, when to pick which

Aspect ratio is inherited

What to deliver to the user

Downloading + sharing

Frequent failures and how to recover