Showing 46 open source projects for "frame generation"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Pixelle-Video

    Pixelle-Video

    AI Fully Automated Short Video Engine

    ...It focuses on enabling automated video creation workflows where visual content can be synthesized, edited, or enhanced through AI models. The project integrates different components of video processing, such as frame generation, transformation, and sequencing, into a unified pipeline. It is built to support experimentation with generative video models, making it useful for research and creative applications. The system emphasizes modularity, allowing developers to plug in different models or processing steps depending on the use case. It can be used for tasks such as content generation, video editing, or visual storytelling. ...
    Downloads: 50,277 This Week
    Last Update:
    See Project
  • 2
    lsfg-vk

    lsfg-vk

    Lossless Scaling Frame Generation on Linux

    The lsfg-vk project is a Vulkan layer developed primarily by PancakeTAS that hooks into Vulkan-based applications to enhance rendering by generating additional frames using the Lossless Scaling frame generation algorithm originally associated with the Lossless Scaling project. Instead of relying on driver-specific or hardware-accelerated upscaling, this layer intercepts Vulkan API calls and injects frame interpolation on the fly, effectively producing smoother motion in supported games and applications by creating synthetic intermediate frames. ...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 3
    OptiScaler

    OptiScaler

    OptiScaler bridges upscaling/frame gen across GPUs

    ...The tool effectively acts as a compatibility layer between the game engine and multiple upscaling frameworks, enabling cross-GPU access to features that might otherwise be restricted to specific hardware ecosystems. In addition to replacing upscalers, OptiScaler can enable frame generation features in titles that do not officially support them, improving frame rates and perceived smoothness during gameplay.
    Downloads: 281 This Week
    Last Update:
    See Project
  • 4
    Oasis

    Oasis

    Inference script for Oasis 500M

    ...Instead of rendering a pre-built game world, the system produces the next visual state via a diffusion-transformer approach, effectively “imagining” the world response to your actions in real time. The project focuses on enabling action-conditional frame generation so developers can experiment with interactive, model-generated environments rather than static video generation alone. Because it’s an inference-focused repository, it’s especially useful as a practical reference for running the model, wiring inputs, and producing the autoregressive sequence of gameplay frames. It also serves as a research sandbox for people exploring how far interactive generative models can go with smaller, more accessible checkpoints compared to massive internal systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while...
    Downloads: 112 This Week
    Last Update:
    See Project
  • 6
    Agent Sprite Forge

    Agent Sprite Forge

    Agent Skill for generating 2D sprite sheets and map, transparent PNG

    ...The system supports multi-frame sprite generation, animation sequencing, and transparent background rendering for easier integration into game engines. Its architecture is designed around automation and repeatability, enabling developers to generate large batches of visual assets through structured prompt workflows. Overall, agent-sprite-forge acts as an AI-assisted creative tool for accelerating 2D game art production and experimentation.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    ComfyUI-WanVideoWrapper

    ComfyUI-WanVideoWrapper

    ComfyUI wrapper nodes for WanVideo and related models

    ...The project supports complex node-based pipelines where users can control sampling, conditioning, and frame continuity across generated sequences. It also enables extended video generation by linking outputs between iterations, allowing for longer and more coherent animations. Additionally, the wrapper often includes optimizations for performance, such as low VRAM configurations and multi-stage sampling strategies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    HunyuanVideo-I2V

    HunyuanVideo-I2V

    A Customizable Image-to-Video Model based on HunyuanVideo

    HunyuanVideo-I2V is a customizable image-to-video generation framework from Tencent Hunyuan, built on their HunyuanVideo foundation. It extends video generation so that given a static reference image plus an optional prompt, it generates a video sequence that preserves the reference image’s identity (especially in the first frame) and allows stylized effects via LoRA adapters. The repository includes pretrained weights, inference and sampling scripts, training code for LoRA effects, and support for parallel inference via xDiT. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Positron

    Positron

    Positron, a next-generation data science IDE

    Positron is a next-generation integrated development environment (IDE) created by Posit PBC (formerly RStudio Inc) specifically tailored for data science workflows in Python, R, and multi-language ecosystems. It aims to unify exploratory data analysis, production code, and data-app authoring in a single environment so that data scientists move from “question → insight → application” without switching tools.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    HunyuanVideo-Foley

    HunyuanVideo-Foley

    Multimodal Diffusion with Representation Alignment

    ...Hybrid architecture combining multimodal transformer blocks and unimodal refinement blocks. Temporal alignment via frame-level synchronization modules (e.g. Synchformer).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Kotlin Dataframe

    Kotlin Dataframe

    Structured data processing in Kotlin

    Data frame is an abstraction for working with structured data. Essentially it’s a 2-dimensional table with labeled columns of potentially different types. You can think of it like a spreadsheet or SQL table, or a dictionary of series objects. The handiness of this abstraction is not in the table itself but in a set of operations defined on it. The Kotlin Dataframe library is an idiomatic Kotlin DSL defining such operations. The process of working with data frame is often called data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    HyperFrames

    HyperFrames

    Write HTML. Render video. Built for agents

    HyperFrames is a framework developed by HeyGen that focuses on generating and managing structured, dynamic content experiences powered by AI. It introduces the concept of “frames,” which represent modular units of content that can be dynamically composed and rendered based on context or user interaction. The system is designed to enable highly interactive and personalized experiences, particularly in applications such as video generation, storytelling, and user interfaces. It supports...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    FFmpegAndroid

    FFmpegAndroid

    FFmpeg implements video cropping, watermarking, transcoding

    FFmpegAndroid is a comprehensive Android-focused multimedia development project that demonstrates how to integrate and use FFmpeg for advanced audio and video processing tasks. It provides a wide range of implementations including video editing, transcoding, watermarking, and GIF generation, all optimized for mobile environments. The project also covers real-time streaming capabilities such as local and live RTMP pushing using H264 encoding, making it suitable for building live broadcasting...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    StoryGen Atelier is an advanced creative tool that blends AI with visual storytelling, making it possible to generate fully structured storyboards and stitched videos from text prompts without requiring manual art or animation skills. Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames. Once individual frames are created, a second AI model generates transition clips that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Cowart

    Cowart

    Local infinite canvas plugin for Codex

    Cowart is a local infinite-canvas plugin designed specifically for Codex-powered creative workflows. Built on top of tldraw, it provides a visual workspace where users can brainstorm ideas, annotate images, generate new artwork, and iterate on existing visuals through AI-assisted editing. The canvas runs as a local web service and stores project data directly inside the user's workspace rather than in the plugin repository, making it suitable for project-centric workflows. Cowart enables...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 17
    Qwen-2.5-VL

    Qwen-2.5-VL

    Qwen2.5-VL is the multimodal large language model series

    Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    HunyuanOCR

    HunyuanOCR

    OCR expert VLM powered by Hunyuan's native multimodal architecture

    ...It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a wide variety of OCR tasks, outperforming many traditional OCR systems and even other multimodal models on benchmark suites. HunyuanOCR handles complex documents: multi-column layouts, tables, mathematical formulas, mixed languages, handwritten or stylized fonts, receipts, tickets, and even video-frame subtitles. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    i-Educar

    i-Educar

    Launching the most free educational software in Brazil

    Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HunyuanVideo-I2V

    HunyuanVideo-I2V

    A Customizable Image-to-Video Model based on HunyuanVideo

    HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PixClip

    PixClip

    Image & video editor with clarity, sharpness & fast FFmpeg export

    PixClip is a free, open-source desktop image and video editor built with Python and PySide6 (Qt6). It focuses on professional-grade clarity and sharpness: a bilateral-filter Clarity control boosts local contrast without halo artifacts, and an Edge Sharpness tool uses noise-thresholded unsharp masking so grain isn't amplified. Adjustments run in 32-bit floating point and LAB color space to avoid banding and keep natural hues. For video, PixClip applies clarity, sharpness, color...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    CuteGIF

    CuteGIF

    Convert video to GIF. Simple and fast

    CuteGIF is a lightweight graphical tool designed to create and edit animated GIFs from video files or image sequences with a focus on simplicity and accessibility. It provides an intuitive interface where users can trim clips, adjust frame rates, and export GIFs without needing command-line tools. The application emphasizes quick workflows, allowing users to generate shareable animations in just a few steps. It supports customization of output quality, resolution, and playback speed to balance file size and visual fidelity. CuteGIF is built for casual users and content creators who need fast GIF generation for social media or messaging. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    comFramework

    comFramework

    Framework for CAN communication interfaces including code generator

    comFramework presents a flexible, widely customizable CAN communication interface for embedded applications; it covers the CAN stack down to the hardware driver. Signal based application code is linked to the frame based HW abstraction layer. The interface imposes easy to fulfill requirements on the integration environment and will fit in most embedded platforms. A powerful code generator auto-codes those parts of the interface, which depend on your specific CAN network databases;...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    blackvideo-mini-player

    blackvideo-mini-player

    A standalone lightweight auxiliary CLI video player for BlackVideo.

    Lightweight cross-platform video player (Ada + SDL2 + FFmpeg). Support player for the BlackVideo. Works standalone via CLI or right-click on any video file. Usage Method 1 — Command Line Step 1. Unzip blackvideo-mini-player-v2.3.0.win.zip Step 2. Open the build\ folder, then type cmd directly in the address bar and press Enter — this opens a terminal already in that folder. Alternatively: open Command Prompt anywhere and use cd with the copied path: cd...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo