rreAPI
  • Models
  • Agent
  • Pricing
  • Blog
  • Docs
PlaygroundUse casesPricingFAQAPI
Home/Models/Gemini Omninew

Gemini Omni API — Google's Any-Input Video Model

The Gemini Omni API turns a prompt, a single image, or three reference images into a 4 to 10 second clip at 720p, 1080p, or 4K. One endpoint covers text-to-video, image-to-video, and three-image fusion — Google's newest video model, billed per generation.

Input

≤ 2000 chars · required

Default 720p

16:9 or 9:16 · default 16:9

Default 6 · ignored in reference-to-video mode

Result

Try one of these prompts

What you can build with this model

Real-world workflows and production use cases you can build and ship with this model.

Animate a single still with the Gemini Omni API

Pass one reference image and a motion prompt. The Gemini Omni API returns a 4 to 10 second clip from the same endpoint as your text-to-video calls — no model swap, no extra integration. Send a 1080p or 4K request when you want the result production-ready.

Generate a clip

Fuse three references in one Gemini Omni API call

Send three reference images alongside a prompt and the Gemini Omni API combines scene, character, and product into a single motion shot. Skip the storyboard, the masking, and the multi-pass compositing — three-image fusion is the most differentiated mode on the Gemini Omni API and ships from the same /api/v1/videos/generations endpoint as text-to-video.

Text-to-video at 4K via the Gemini Omni API

Describe the scene, pick 4K, and the Gemini Omni API returns a clip at the highest fidelity tier — useful for hero shots, social ads, and landing-page video. Audio is omitted in the reapi surface, so the result drops cleanly into any downstream editor.

Pricing

Credit-based — 1 credit = $0.001 USD. Pay only for completed generations.

CategoryUnitPrice
720p
4 seconds1 generation
$0.495
495 credits
6 seconds1 generation
$0.66
660 credits
8 seconds1 generation
$0.825
825 credits
10 seconds1 generation
$0.99
990 credits
1080p
4 seconds1 generation
$0.495
495 credits
6 seconds1 generation
$0.66
660 credits
8 seconds1 generation
$0.825
825 credits
10 seconds1 generation
$0.99
990 credits
4K
4 seconds1 generation
$1.155
1155 credits
6 seconds1 generation
$1.32
1320 credits
8 seconds1 generation
$1.485
1485 credits
10 seconds1 generation
$1.65
1650 credits
Reference 720p
per generation1 generation
$1.32
1320 credits
Reference 1080p
per generation1 generation
$1.32
1320 credits
Reference 4K
per generation1 generation
$1.98
1980 credits

Why reAPI

One endpoint, three input modes

The Gemini Omni API picks its mode from the count of image_urls you send. Zero gives you text-to-video, one gives image-to-video, three gives three-image fusion — all on the same /api/v1/videos/generations call, with the same authentication and the same task polling pattern. Two images is not supported; the Gemini Omni API will reject that combination at the gateway with a clear 400.

Per-generation pricing, no surprises

The Gemini Omni API charges per generation, not per second. 720p and 1080p share the same rate; only 4K is uplifted. See current per-tier rates in the pricing table on this page. Failed Gemini Omni API jobs refund automatically — your worker never pays for a result you didn't get.

Access without a Google Cloud account

Skip the Google Cloud onboarding, billing setup, and service-account dance. Sign up for reapi, grab a key, and you can call the Gemini Omni API in under a minute. Same model, same outputs — fewer hoops to ship.

Ship the Gemini Omni API in three steps

  1. 01
    step 01

    Create an API key

    Sign up and grab a key from the dashboard. Free credits cover your first Gemini Omni API calls — no card required.

    Open
  2. 02
    step 02

    Submit a video task

    POST to /api/v1/videos/generations with model = gemini-omni. The Gemini Omni API returns a task ID immediately so your worker can move on.

    Open
  3. 03
    step 03

    Poll the result

    GET /api/v1/tasks/:id until status is completed. Download the Gemini Omni API output and ship it.

    Open

Frequently asked questions

Common questions about this model.

Gemini Omni is Google DeepMind's any-to-any multimodal model family announced at Google I/O 26. The Gemini Omni API in reapi is the video-generation surface of that family — submit a prompt and optionally up to three reference images, and the Gemini Omni API returns a 4 to 10 second clip at 720p, 1080p, or 4K. One endpoint covers text-to-video, image-to-video, and three-image fusion.

Related models

Explore more models in the same category.

View all models
Video

Google

VEO 3.1

Veo 3.1 in five channels — audio, 4K, and 15-second remix in one API.

From $0.092 per generation
VideoRecommended

ByteDance

Seedance 2.0

Text/image/audio-to-video — 4 variants, per-second pricing.

From $0.037 per second
VideoRecommended

ByteDance

Seedance 2.5

Next-gen text/image/audio-to-video from ByteDance — coming soon.

Coming soon
Video

Alibaba Cloud Bailian

Happy Horse 1.0

Text, image, reference video, and video edit — one Happy Horse 1.0 API call.

From $0.146 per second
View all models
start building

Ready to ship?

Try it in the playground or grab an API key to integrate now.

Try Gemini OmniView API docs
rreAPI

reAPI is the AI API aggregator with sub-second failover, zero request logging, and one OpenAI-compatible endpoint for every top model.

GitHubX (Twitter)
Built withLogo of reAPIreAPI
Featured on There's An AI For ThatFeatured on Findly.toolsFazier badgeDang.ai
ai tools code.market
Featured on Twelve Tools
Image
  • GPT Image 2
  • Gemini 3 Pro Image
  • Gemini 3.1 Flash Image
  • Gemini 2.5 Flash Image
  • Seedream 5.0 Lite
  • Imagen 4.0
  • Wan 2.7 Image
Video
  • Seedance 2.0
  • Happy Horse 1.0
  • Vidu Q3
  • Pixverse v6
  • Grok Imagine 1.0
  • VEO 3.1
  • Gemini Omni
  • Wan 2.7 Video
  • Kling Motion Control
LLM
  • Claude Opus 4.8
  • Claude Opus 4.7
  • Claude Sonnet 4.6
  • DeepSeek V4
  • GPT-5.4
  • GPT-5.5
Audio
  • Mureka V9
  • Vocal Remover
  • Music Extractor
  • Voice Cleaner
  • Multistem Splitter
  • Voice Changer
Text
  • AI Humanizer
  • AI Text Detector
Tools
  • Enhance Video 1.0
·······
© 2026 reAPI. All Rights Reserved.[email protected]
AboutContactChangelogCookie PolicyPrivacy PolicyTerms of Service