Kling 2.6
The first Kling model with simultaneous audio-visual generation — 1080p at 48fps with synchronized voices and sound effects.
Max Duration
10 seconds
Resolution
1080p
Aspect Ratios
16:9, 9:16, 1:1
Audio
Native audio generation
What makes Kling 2.6 unique
Simultaneous audio-visual generation — eliminates the silent video + manual dubbing workflow
1080p at 48fps — smoother motion than most competitors
Supports speaking, dialogue, narration, singing, and rapping voice types
Motion control support for directing character actions and expressions
Strong image-to-video capabilities with accurate subject preservation
3D variational autoencoder with synchronized spatiotemporal compression
Best for
How Kling 2.6 works
Kling 2.6 uses a diffusion-based Transformer with a proprietary 3D variational autoencoder for synchronized spatiotemporal compression. It was the first Kling model to generate audio and video simultaneously, supporting sound effects, human voices, musical scores, and dialogue intrinsically matched to the visual output.
How to use Kling 2.6 on Flashloop
Select Kling 2.6
Open Flashloop and choose Kling 2.6 from the model selector in the video creator.
Enter your prompt
Describe what you want to see in your video — be as detailed as you like.
Generate & download
Hit generate and your video will be ready in seconds. Download or share directly.
Ready to create with Kling 2.6?
Start generating videos with Kling 2.6 on Flashloop — available on iOS, Android, and web.
Start Creating with Kling 2.6