How much does Grok Imagine Video 1.5 cost?

Grok Imagine Video 1.5 pricing is per second by resolution — 720p costs more per second than 480p — and the bill scales linearly with the duration parameter. See the current rate in the pricing table on this page.

Does Grok Imagine Video 1.5 need an input image?

Yes. Grok Imagine Video 1.5 is image-to-video: exactly one reference image is required, and a text prompt is optional to direct motion, camera, and atmosphere. Every image URL must be a public HTTP(S) link; data URIs and base64 are rejected at the gateway.

Does Grok Imagine Video 1.5 generate audio?

Yes — Grok Imagine Video 1.5 generates synchronized audio together with the video in one call, including dialogue, sound effects, and ambient sound. No separate audio step is required.

What resolutions and durations does Grok Imagine Video 1.5 support?

Grok Imagine Video 1.5 supports 480p and 720p output, with clip durations from 1 to 15 seconds (default 8). Aspect ratio defaults to auto, which follows the source image; you can also set 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, or 2:3.

How is Grok Imagine Video 1.5 different from Grok Imagine 1.0 Video?

Grok Imagine Video 1.5 focuses on image-to-video with native synchronized audio and more realistic motion, while Grok Imagine 1.0 Video covers both text-to-video and image-to-video but outputs silent clips. Both share the same submit-and-poll API pattern on reAPI.

Where are the full Grok Imagine Video 1.5 docs?

The full parameter list, error codes, and runnable cURL examples for Grok Imagine Video 1.5 live at /docs/grok-imagine-video-1-5.

Grok Imagine Video 1.5 — Image-to-Video with Native Audio

Grok Imagine Video 1.5 turns a single reference image into a realistic short clip with synchronized audio. xAI's image-to-video model delivers lifelike motion, strong prompt adherence, and consistent identity at 480p or 720p, 1 to 15 seconds.

Input

Prompt*

Reference Image 1 as identity lock. Create a gritty, ultrarealistic live-action 16:9 film montage, 9 shots, 0.2 seconds each, shot on Arri Alexa with 35mm anamorphic lenses. Use the exact same real photographed woman in every shot (mid-20s, focused eyes, short asymmetrical hair, identical face, body and silhouette throughout). Real skin texture, natural imperfections, realistic hair movement. No CGI, no AI look, no clones, no doubles, no morphing, no face changes. Practical stunts and practical effects only. Each shot is a different movie genre with different locations, costumes, lighting and framing, but always the same actress. Action highway chase in a sports car: "Let's go!" Documentary thriller pushing through a rainy protest crowd: "Keep filming." Romantic thriller close-up in blue emergency light: "Trust me." Spy film ballroom dance stealing a keycard: "Got it!" Sports drama sprinting onto a stadium field: "We're going all the way!" Horror film backing through a flashlight-lit farmhouse hallway: "Creepy." Biopic music performance singing into a microphone: "Baby, I'm yours!" Love film under fireworks reassuring a male guitarist: "Happy Fourth!" Adventure film on a city observation deck, wind in her hair: "I'm ready for my next role!" One continuous fast heroic montage score, hard cuts only, every shot looking like a different real movie while clearly featuring the exact same actress.

optional · up to 4096 chars

Reference image*

required · 1 image · public URL only

Aspect ratio

default auto

Resolution

default 480p

Duration (seconds)8

1 to 15 · default 8

Audio

default on

Result

Try one of these prompts

What you can build with this model

Real-world workflows and production use cases you can build and ship with this model.

Animate a still with Grok Imagine Video 1.5

Pass one reference image and an optional motion prompt. Grok Imagine Video 1.5 returns a 1 to 15 second clip with lifelike movement and synchronized audio — identity, face, and silhouette stay consistent from the first frame to the last.

Generate a clip

Short-form social video with native sound

Turn product shots, portraits, and posters into TikTok, Reels, and Shorts content. Grok Imagine Video 1.5 generates the motion and the audio together in one request, so there is no separate voice, music, or sound-effects pass to wire up.

Marketing and ad creatives at scale

Automate promo clips and campaign assets from a single source frame. Grok Imagine Video 1.5 keeps the subject on-model across shots and renders realistic lighting and camera moves, so batch output stays usable straight out of the call.

Pricing

Credit-based — 1 credit = $0.001 USD. Pay only for completed generations.

Category	Unit	Price
Official
480p	1 second	$0.02 20 credits
720p	1 second	$0.04 40 credits
Grok Imagine Video 1.5
480p	1 second	$0.084 84 credits
720p	1 second	$0.144 144 credits

Why reAPI

Image-to-video that holds identity

Grok Imagine Video 1.5 animates from one reference image while keeping face, body, and silhouette consistent across the clip. Real skin texture and natural motion, no morphing or clones — built for believable live-action results.

Native synchronized audio

Grok Imagine Video 1.5 generates video and audio together in a single workflow — dialogue, sound effects, ambient room tone, and music — so you skip the separate audio-generation and post-production steps entirely.

Simple per-second pricing

Grok Imagine Video 1.5 bills per second by resolution, with no subscription. The cost scales linearly with the duration parameter, and failed jobs refund automatically — you only pay for clips that succeed.

Grok Imagine Video 1.5 vs Seedance 2.0

Both turn images into short clips on reAPI. Grok Imagine Video 1.5 focuses on image-to-video realism with native synchronized audio; Seedance 2.0 spans more input modes and a higher resolution tier. Here is how the two compare on publicly documented behavior.

Capability

Grok Imagine Video 1.5 on reAPI

Seedance 2.0

Primary mode

Image-to-video from one reference image, with strong identity and motion adherence.

Text, image, first/last frame, and multi-modal reference inputs on one endpoint.

Native audio

Generates synchronized audio together with the video in a single call.

Optional generated audio track via a request flag.

Resolution range

480p and 720p, selectable per request.

480p, 720p, and 1080p.

Duration

1 to 15 seconds per clip.

4 to 15 seconds per clip.

Pricing model

Pay-as-you-go per second by resolution; no subscription required.

Pay-as-you-go per second by resolution and reference mode.

Integration

One REST endpoint, submit-and-poll, OpenAI-style API key — the same pattern as every reAPI model.

Same submit-and-poll pattern and API key on reAPI.

Comparison reflects publicly documented behavior at the time of writing. Model behavior and pricing can change; check the pricing card above and the API docs for current values.

Ship Grok Imagine Video 1.5 in three steps

step 01
Create an API key
Sign up and grab a key from the dashboard. Free credits cover your first Grok Imagine Video 1.5 calls — no card required.
Open
step 02
Submit a video task
POST to /api/v1/videos/generations with model = grok-imagine-video-1.5-beta and one image_urls entry. Grok Imagine Video 1.5 returns a task ID immediately.
Open
step 03
Poll the result
GET /api/v1/tasks/:id until status is completed, then download the Grok Imagine Video 1.5 output clip and ship it.
Open

Frequently asked questions

Common questions about this model.

Grok Imagine Video 1.5 is xAI's async image-to-video model on reAPI. It turns a single reference image into a realistic short clip with native synchronized audio, billed per second of output.

Related models

Explore more models in the same category.

View all models

xAI

Grok Imagine 1.0 Video

From $0.008 per second

Video

Vidu

Vidu Q3

From $0.037 per second

Video

Recommended

ByteDance

Seedance 2.0 Mini

From $0.029 per second

Video

Alibaba

Wan 2.7 Video

From $0.073 per second

Video

View all models

start building

Ready to ship?

Try it in the playground or grab an API key to integrate now.

Try Grok Imagine Video 1.5 View API docs

What you can build with this model

Real-world workflows and production use cases you can build and ship with this model.

Animate a still with Grok Imagine Video 1.5

Generate a clip