Pixverse v6 API — Five Video Modes in One Endpoint
The Pixverse v6 API turns a prompt, an image, a pair of keyframes, or up to seven references into a 1 to 15 second clip at 360p–1080p. One endpoint covers text-to-video, image-to-video, first/last-frame transitions, multi-reference fusion, and video extend — with optional generated audio.
≤ 5000 chars · required in every mode
Default 540p
1 to 15 · default 5 · 5 or 8 only for first/last-frame
Default 16:9
0 to 2147483647 · default 0
≤ 2048 chars
Default off
Default off
normal
Default off
What you can build
Real workflows powered by this model.

Fuse multiple references into one shot with the Pixverse v6 API
Send 1 to 7 reference images and a prompt. The Pixverse v6 API fuses characters, outfits, and objects from each reference into a single coherent clip — keep a face from one image, a costume from another, and a prop from a third without a separate compositing pass.
Generate a clip
Smooth transitions between two frames with the Pixverse v6 API
Pass a first frame and a last frame and the Pixverse v6 API interpolates the motion between them. Stitch keyframes straight into a finished 5 or 8 second shot — useful for transformations, reveals, and product before/after spots.

Extend a clip into a continuous scene with the Pixverse v6 API
Pass the task ID of a finished Pixverse v6 video and a new prompt; the API continues the scene from where it ended. Chain extends to build a longer sequence one beat at a time, without re-rendering the whole shot.
Pricing
Credit-based — 1 credit = $0.001 USD. Pay only for completed generations.
| Category | Unit | Price |
|---|---|---|
| 360p | ||
| No audio | 1 second | $0.0184 19 credits |
| With audio | 1 second | $0.0276 28 credits |
| 540p | ||
| No audio | 1 second | $0.0276 28 credits |
| With audio | 1 second | $0.0368 37 credits |
| 720p | ||
| No audio | 1 second | $0.0368 37 credits |
| With audio | 1 second | $0.046 46 credits |
| 1080p | ||
| No audio | 1 second | $0.0736 74 credits |
| With audio | 1 second | $0.092 92 credits |
Why reAPI
Five modes, one endpoint
The Pixverse v6 API picks its mode from the fields you send. A prompt alone gives text-to-video; one image gives image-to-video; a first/last frame pair interpolates a transition; reference images trigger multi-reference fusion; a task ID extends an existing clip — all on the same /api/v1/videos/generations call, with the same authentication and the same task polling pattern.
Pricing by resolution × seconds
The Pixverse v6 API charges per second by resolution. Turning audio on raises the per-second rate; every other parameter is free. See current 360p / 540p / 720p / 1080p rates in the pricing table on this page. Failed jobs refund automatically.
Reference fusion and frame control
Beyond plain text-to-video, the Pixverse v6 API fuses up to seven reference images into one shot and interpolates between a first and last keyframe. That control lives in the request body — no extra endpoint, no model swap, no separate compositing step.
Pixverse v6 vs Runway Gen-3
Both turn text and images into short clips. Pixverse v6 leans into mode breadth — five input shapes on one endpoint, including multi-reference fusion and video extend — and per-second usage pricing. Here is how the two compare on publicly documented behavior.
Comparison reflects publicly documented behavior at the time of writing. Model behavior and pricing can change; check the pricing card above and the API docs for current values.
Ship the Pixverse v6 API in three steps
- step 01
Create an API key
Sign up and grab a key from the dashboard. Free credits cover your first Pixverse v6 API calls — no card required.
Open - step 02
Submit a video task
POST to /api/v1/videos/generations with model = pixverse-v6 and a prompt. The Pixverse v6 API returns a task ID immediately so your worker can move on.
Open - step 03
Poll the result
GET /api/v1/tasks/:id until status is completed. Download the Pixverse v6 API output and ship it.
Open
Frequently asked questions
Common questions about this model.
Related models
Explore more models in the same category.
Alibaba
Wan 2.7 Video
Text, image, reference video and editing in one Wan 2.7 Video API call — per-second pricing, 720P or 1080P.
xAI
Grok Imagine 1.0 Video
Text-to-video and image-to-video in a single Grok Imagine 1.0 Video API call.
Kuaishou
Kling Motion Control
Reference image plus reference video — controllable motion transfer in one Kling Motion Control API call.
curl -X POST https://reapi.ai/api/v1/videos/generations \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "pixverse-v6",
"prompt": "A neon-lit Tokyo alley at night, light rain, anamorphic flare",
"size": "21:9",
"resolution": "720p",
"duration": 8
}'Ready to ship?
Try it in the playground or grab an API key to integrate now.