Video ModelKuaishou

Kling 2.6

The first Kling model with simultaneous audio-visual generation — 1080p at 48fps with synchronized voices and sound effects.

Max Duration

10 seconds

Resolution

1080p

Aspect Ratios

16:9, 9:16, 1:1

Audio

Native audio generation

Create with Kling 2.6 View Pricing

What makes Kling 2.6 unique

Simultaneous audio-visual generation — eliminates the silent video + manual dubbing workflow

1080p at 48fps — smoother motion than most competitors

Supports speaking, dialogue, narration, singing, and rapping voice types

Motion control support for directing character actions and expressions

Strong image-to-video capabilities with accurate subject preservation

3D variational autoencoder with synchronized spatiotemporal compression

Best for

Social media clips with audioImage-to-video animationMotion control workflowsBatch content productionQuick video drafts

How Kling 2.6 works

Kling 2.6 uses a diffusion-based Transformer with a proprietary 3D variational autoencoder for synchronized spatiotemporal compression. It was the first Kling model to generate audio and video simultaneously, supporting sound effects, human voices, musical scores, and dialogue intrinsically matched to the visual output.

How to use Kling 2.6 on Flashloop

Select Kling 2.6

Open Flashloop and choose Kling 2.6 from the model selector in the video creator.

Enter your prompt

Describe what you want to see in your video — be as detailed as you like.

Generate & download

Hit generate and your video will be ready in seconds. Download or share directly.

Ready to create with Kling 2.6?

Start generating videos with Kling 2.6 on Flashloop — available on iOS, Android, and web.

Start Creating with Kling 2.6