AI Models

Choose from the latest AI video and image generation models. All available on Flashloop for iOS, Android, and web.

Video Models

Veo 3
VideoGoogle DeepMindGoogle DeepMind

Veo 3

Generate 8-second videos with native dialogue, sound effects, and ambient audio in one pass.

Up to 8 seconds·720p, 1080p
Kling 3.0
VideoKuaishouKuaishou

Kling 3.0

Create up to 15-second multi-shot videos with character consistency, 4K 60fps support, and bilingual audio.

Up to 15 seconds·Standard, Pro (1080p)
VideoKuaishouKuaishou

Kling 3.0 Turbo

A faster, more affordable Kling 3.0 — text-to-video and image-to-video at 720p or 1080p, up to 15 seconds.

Up to 15 seconds·720p, 1080p
Kling 2.6
VideoKuaishouKuaishou

Kling 2.6

Turn images into 10-second 1080p videos with 48fps motion control and native speaking, singing, or rapping audio.

Up to 10 seconds·1080p
Sora 2
VideoOpenAIOpenAI

Sora 2

Generate up to 30-second videos with strong physics, long scene continuity, and precise style control.

Up to 30 seconds·720p, 1080p
Seedance 2.0
VideoByteDanceByteDance

Seedance 2.0

ByteDance's video model with a Fast/High switch, native audio, up to 7 reference images, and director-level camera control.

Up to 15 seconds·480p, 720p, 1080p, 4K
VideoByteDanceByteDance

Seedance 2.0 Mini

ByteDance's faster, lower-cost Seedance — native audio, multi-reference, and 480p/720p output.

Up to 15 seconds·480p, 720p
Seedance 1.5 Pro
VideoByteDanceByteDance

Seedance 1.5 Pro

Generate 12-second videos with native audio, multilingual speech, and fast turnaround from 45 seconds to 3 minutes.

Up to 12 seconds·480p, 720p, 1080p
Wan 2.6
VideoAlibabaAlibaba

Wan 2.6

Generate 15-second videos with audio, strong character consistency, and a free tier on Flashloop.

Up to 15 seconds·720p
Grok Imagine
VideoxAIxAI

Grok Imagine

Generate 30-second videos with native audio, 4 instant variations, and fast renders in about 17 seconds.

Up to 30 seconds·480p, 720p
Gemini Omni
VideoGoogle DeepMindGoogle DeepMind

Gemini Omni

Multimodal video generation — text, up to 7 reference images, or a video clip. Native 4K output and up to 10-second clips.

Up to 10 seconds·720p, 1080p, 4K

Image Models