25 projects for "clips" with 2 filters applied:

  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    ...This unified approach allows creators to generate complete multimedia sequences where motion, timing, and sound are aligned automatically. LTX-2 is designed for both research and production workflows and can generate high-resolution video clips with precise control over structure, motion, and camera behavior.
    Downloads: 103 This Week
    Last Update:
    See Project
  • 2
    VGGT

    VGGT

    [CVPR 2025 Best Paper Award] VGGT

    VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. The repo provides inference pipelines to estimate geometry from monocular inputs, stereo pairs, or brief sequences, together with evaluation harnesses for common geometry benchmarks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CLIPS Rule Based Programming Language
    CLIPS is a forward-chaining rule-based programming language written in C that also provides procedural and object-oriented programming facilities.
    Leader badge
    Downloads: 1,456 This Week
    Last Update:
    See Project
  • 4
    OpenShorts

    OpenShorts

    Free & open source AI video platform

    ...It combines multiple tools into a single pipeline, including clip generation, AI-driven video creation, and YouTube optimization features. The system can transform long videos or uploaded files into short clips by detecting engaging moments, reframing content, and adding subtitles and visual effects. It also supports generating marketing videos using AI actors, voiceovers, and scripted narratives without requiring cameras or production resources. The platform integrates publishing capabilities, allowing users to distribute content directly to TikTok, Instagram, and YouTube. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    VoxCPM

    VoxCPM

    TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

    VoxCPM is a tokenizer-free text-to-speech system that models speech in a continuous space, aiming for extremely realistic, context-aware synthesis and true-to-life zero-shot voice cloning. Instead of converting speech into discrete tokens, it uses an end-to-end diffusion-autoregressive architecture built on the MiniCPM-4 backbone, combining hierarchical language modeling, finite scalar quantization (FSQ), and local Diffusion Transformers. This design helps decouple semantic and acoustic...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 6
    Short Video Factory

    Short Video Factory

    AI tool for automatic batch short video creation and editing

    Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. By leveraging AI technologies, it significantly reduces the manual effort required to produce high-quality short videos at scale. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    ...Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames. Once individual frames are created, a second AI model generates transition clips that smoothly link the frames into a coherent short video sequence, and the tool then assembles everything into a finished video using standard video processing tools. This pipeline transforms abstract creative ideas into shareable visual narratives, offering a powerful workflow for writers, filmmakers, content creators, and educators alike.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AI YouTube Shorts Generator

    AI YouTube Shorts Generator

    A python tool that uses GPT-4, FFmpeg, and OpenCV

    AI-YouTube-Shorts-Generator is a Python-based tool that automates the creation of short-form vertical video clips (“shorts”) from longer source videos — ideal for adapting content for platforms like YouTube Shorts, Instagram Reels, or TikTok. It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    vJEPA-2

    vJEPA-2

    PyTorch code and models for VJEPA2 self-supervised learning from video

    ...This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The architecture is designed to scale: spatiotemporal ViT backbones, flexible masking schedules, and efficient sampling let it train on long clips while remaining stable. Trained representations transfer well to downstream tasks such as action recognition, temporal localization, and video retrieval, often with simple linear probes or light fine-tuning. The repository typically includes end-to-end recipes—data pipelines, augmentation policies, training scripts, and evaluation harnesses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Vidi2

    Vidi2

    Large Multimodal Models for Video Understanding and Editing

    Vidi is a family of large multimodal models developed for deep video understanding and editing tasks, integrating vision, audio, and language to allow sophisticated querying and manipulation of video content. It’s designed to process long-form, real-world videos and answer complex queries such as “when in this clip does X happen?” or “where in the frame is object Y during that moment?” — offering temporal retrieval, spatio-temporal grounding (i.e. locating objects over time + space), and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    MARS5

    MARS5

    MARS5 speech model (TTS) from CAMB.AI

    MARS5-TTS is CAMB.AI’s open-source English speech model designed for high-quality text-to-speech and voice emulation. It uses a two-stage architecture that combines an autoregressive (AR) model with a non-autoregressive (NAR) model, giving it both expressiveness and speed. The model is built to handle prosodically challenging content such as sports commentary, anime dialogue, and other high-energy or highly varied speech patterns with realistic rhythm and intonation. To control speaker...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Sora.FM

    Sora.FM

    Sora AI Video Generator by Sora.FM

    ...The project belongs to the growing class of “AI video generator / AI-assisted content creation” tools: it may use model-based generation, template-based editing, or combine video assets with generative models to automate parts of video creation or editing. For creators wanting to explore AI-based content generation — for example automated video clips, short-form media, or other generated video content — sorafm offers a starting point. As with many open-source generators in this space, the tradeoff lies in balancing ease-of-use and the limitations of generative output, but the fact that it’s publicly available means users can experiment, iterate, or fork to adapt pipelines: maybe customizing model prompts, video templates, or post-processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MiniMax-M1

    MiniMax-M1

    Open-weight, large-scale hybrid-attention reasoning model

    MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Riffusion

    Riffusion

    Real-time music generation using stable diffusion techniques AI

    Riffusion (hobby) is a Python-based open source library designed for real-time music and audio generation using stable diffusion techniques. Riffusion (hobby) works by generating and manipulating spectrogram images, which are then converted into playable audio clips, effectively bridging image-based diffusion models with sound synthesis. It implements a diffusion pipeline that supports prompt interpolation, allowing smooth transitions between different musical styles or prompts over time. Riffusion (hobby) serves as the core implementation for audio and image processing, providing essential building blocks for generating music from text prompts. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    clipsmm is a C++ interface to the CLIPS libray, a C library for developing expert systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    CAPE stands for Clips And Perl with Extensions. As its name suggests, CAPE is a combined programming environment which allows programs to be written in Clips, a forward-chaining rule-based system which was originally developed by NASA, and Perl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Interface and example projects for CLIPS on NT using the CLIPS DLL. Includes Web/CGI interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PyCLIPS Python Module
    Python module to interface the CLIPS expert system shell library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Lisa is a production rule system for Common Lisp, whose purpose is to provide a foundation for the development of "intelligent" applications. Lisa employs a modern CLOS implementation of Rete and is based on CLIPS and Jess.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Port of CLIPS to C#
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project is an off-shoot of the EGADSS project. The nature of the changes made to the PD CLIPS code means a separate project would be best. This is an FOSS version of the PD CLIPS Engine to include such features as backwards chaining and more....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Multi-threaded version of the Popular CLIPS expert system shell. CLIPS is public-domain, this version is Open Source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A C++ port (a redesign implementation actually) of the CLIPS expert system . This will be done along the lines of the Java port Jess, but full CLIPS backwards compatability will be maintained (even to the API level where possible).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    XTTS-v2

    XTTS-v2

    Multilingual voice cloning model with 6-second voice samples

    ...The model supports 17 languages and can perform cross-language voice cloning, meaning a voice recorded in one language can be used to synthesize speech in another while preserving speaker identity. XTTS-v2 improves on the original XTTS architecture with better speaker conditioning, support for multiple reference clips, improved prosody, enhanced audio quality, and greater inference stability. The model generates speech at a 24 kHz sampling rate and supports emotion and style transfer through voice cloning. It can be used entirely offline, supports both inference and fine-tuning, and is widely adopted for AI assistants, content creation, dubbing, accessibility tools, and multilingual voice applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo