Kling O1
Kling O1 is a generative AI platform that transforms text, images, or videos into high-quality video content, combining video generation and video editing into a unified workflow. It supports multiple input modalities (text-to-video, image-to-video, and video editing) and offers a suite of models, including the latest “Video O1 / Kling O1”, that allow users to generate, remix, or edit clips using prompts in natural language. The new model enables tasks such as removing objects across an entire clip (without manual masking or frame-by-frame editing), restyling, and seamlessly integrating different media types (text, image, video) for flexible creative production. Kling AI emphasizes fluid motion, realistic lighting, cinematic quality visuals, and accurate prompt adherence, so actions, camera movement, and scene transitions follow user instructions closely.
Learn more
Ray2
Ray2 is a large-scale video generative model capable of creating realistic visuals with natural, coherent motion. It has a strong understanding of text instructions and can take images and video as input. Ray2 exhibits advanced capabilities as a result of being trained on Luma’s new multi-modal architecture scaled to 10x compute of Ray1. Ray2 marks the beginning of a new generation of video models capable of producing fast coherent motion, ultra-realistic details, and logical event sequences. This increases the success rate of usable generations and makes videos generated by Ray2 substantially more production-ready. Text-to-video generation is available in Ray2 now, with image-to-video, video-to-video, and editing capabilities coming soon. Ray2 brings a whole new level of motion fidelity. Smooth, cinematic, and jaw-dropping, transform your vision into reality. Tell your story with stunning, cinematic visuals. Ray2 lets you craft breathtaking scenes with precise camera movements.
Learn more
Kling 3.0 Omni
Kling 3.0 Omni model is a generative video system designed to create imaginative videos from text prompts, images, or reference materials using advanced multimodal AI technology. It allows users to generate continuous video clips with flexible durations ranging from approximately 3 to 15 seconds, enabling short cinematic scenes that respond closely to prompt instructions. It supports prompt-based video generation as well as reference-based workflows, where users provide images or other visual elements to guide the subject, style, or composition of the generated scene. It improves prompt adherence and subject consistency, allowing characters, objects, and environments to remain stable throughout the generated clip while maintaining realistic motion and visual coherence. The Omni model also enhances reference-based generation so that characters or elements introduced through images remain recognizable across frames.
Learn more
Kling 2.5
Kling 2.5 is an AI video generation model designed to create high-quality visuals from text or image inputs. It focuses on producing detailed, cinematic video output with smooth motion and strong visual coherence. Kling 2.5 generates silent visuals, allowing creators to add voiceovers, sound effects, and music separately for full creative control. The model supports both text-to-video and image-to-video workflows for flexible content creation. Kling 2.5 excels at scene composition, camera movement, and visual storytelling. It enables creators to bring ideas to life quickly without complex editing tools. Kling 2.5 serves as a powerful foundation for visually rich AI-generated video content.
Learn more