Showing 47 open source projects for "model-builder"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    ...It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SeedVR2 Upscaler ComfyUI

    SeedVR2 Upscaler ComfyUI

    Official SeedVR2 Video Upscaler for ComfyUI

    ComfyUI-SeedVR2 Video Upscaler is an open-source integration node for the ComfyUI workflow environment that brings the advanced SeedVR2 video upscaling and restoration model directly into visual AI pipelines. This project packages the SeedVR2 architecture as a custom node for ComfyUI, letting users upscale low-resolution video or imagery inside a node-based interface without needing to write code manually. The underlying SeedVR2 model is known for delivering high-quality video enhancement with strong temporal consistency and improved detail preservation by using diffusion-based techniques that are trained specifically on video sequences. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 3
    Rsmpeg

    Rsmpeg

    A Rust crate that exposes FFmpeg's power as much as possible

    ...It allows developers to interact directly with FFmpeg libraries for tasks such as encoding, decoding, and stream manipulation. The library is designed to balance performance with safety by leveraging Rust’s ownership model and memory management features. It supports handling of containers, codecs, and frames, enabling fine-grained control over media workflows. rsmpeg is suitable for building high-performance multimedia applications that require precise control over processing pipelines. It also integrates well with other Rust libraries, enabling complex system development. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MeshLab

    MeshLab

    The open source mesh processing system

    MeshLab is an open-source, portable, and extensible system for the processing and editing of unstructured large 3D triangular meshes. It is aimed to help the processing of the typical not-so-small unstructured models arising in 3D scanning, providing a set of tools for editing, cleaning, healing, inspecting, rendering and converting this kind of meshes. MeshLab is mostly based on the open source C++ mesh processing library VCGlib developed at the Visual Computing Lab of ISTI - CNR. VCG can...
    Downloads: 30 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    ZLMediaKit

    ZLMediaKit

    WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket

    ...The project is designed for protocol conversion, live streaming, low-latency playback, recording, forwarding, and large-scale client access. Its asynchronous, multithreaded network model makes it suitable for commercial streaming deployments with many concurrent connections. ZLMediaKit can be deployed directly as a complete media server or embedded through its standard C API as an SDK for other languages and systems. It also provides REST APIs and webhooks so businesses can integrate authentication, event handling, and custom streaming logic.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 94 This Week
    Last Update:
    See Project
  • 8
    Transcoder

    Transcoder

    Hardware-accelerated video transcoding using Android MediaCodec APIs

    Transcoder by DeepMedia is an AI-powered video-to-video speech translation engine that enables fully automated multilingual dubbing. Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Stremio Core

    Stremio Core

    Types, addon system, UI models, core logic

    Stremio Core is the Rust engine that powers Stremio’s apps by centralizing all reusable logic behind discovery, catalogs, metadata, streams, add-ons, and user/library state. It exposes a clean set of modules—types, addon_transport, and state_types—so apps can talk to add-ons, model UI state, and react to events without duplicating code. The architecture is inspired by Elm: immutable state, message-driven updates, and explicit side-effects (“effects”) keep behavior predictable and testable. An environment abstraction lets hosts provide implementations for things like storage, networking, and timers, making the core portable to desktop, mobile, and web. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    python-socketio

    python-socketio

    Python Socket.IO server and client

    python-socketio is a robust Python library that implements the Socket.IO protocol, enabling real-time, bidirectional communication between web clients and servers. It works with multiple asynchronous frameworks such as asyncio, eventlet, and gevent, so developers can choose the concurrency model that best fits their application needs while still using a consistent API. The library provides both server and client implementations, allowing Python applications to serve and communicate with browser clients or other Socket.IO clients in real time. It supports key Socket.IO features like event handling, message broadcasting, binary data transmission, rooms, and namespaces, giving developers the building blocks to create chat applications, live dashboards, multiplayer games, and collaborative tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13

    Ultimate Media Downloader

    An Open source media downloader for downloading videos and audios

    ...Whether you're downloading a single YouTube video, extracting audio from Spotify playlists, archiving TikTok content, or batch-processing entire music libraries, UMD handles it all with elegance and efficiency. IT CONSISTS OF : 1. Unified Interface: One command, 1000+ platforms. No tool shopping, no mental model switching. 2. Production-Ready, Zero Friction Installation: Most users go from hearing about the tool to downloading content in under 5 minutes. 3. Active Maintenance: Codeberg hosting (after GitHub suspension) demonstrates commitment to long-term availability
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Golang HLS Streamer

    Golang HLS Streamer

    A server that exposes a directory for video streaming

    ...It provides tools for creating and managing HLS streams, including segmenting video into smaller chunks and generating M3U8 playlists. The project is intended for developers building streaming servers or media delivery systems. It focuses on performance and simplicity, leveraging Go’s concurrency model to handle streaming tasks efficiently. gohls can be integrated into backend services to enable adaptive streaming workflows. It supports real-time processing and can work alongside tools like FFmpeg for encoding. Overall, it provides a lightweight solution for implementing HLS streaming pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AI Models

    AI Models

    A repository of trained models

    All models (at least currently) are supported by chaiNNer, an upscaling GUI that allows for both very simple and very complex tasks to be completed in a nice manner where you "chain" nodes together. Highly recommended for images. If you're looking to upscale videos using the models then use enhancr simply due to the fact that it supports TensorRT, which will allow you to upscale videos at incredible speeds! The GUI is one of the best looking applications out there and is personally my go to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Video Pre-Training

    Video Pre-Training

    Learning to Act by Watching Unlabeled Online Videos

    The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    G2SConverter

    G2SConverter

    Convert models from GoldSource engine to Source engine with AI

    ...An example of a processed texture is shown in the following image (parameters used: scaling-factor = 4 and deblur iterations = 4) besides upscaling and debluring the utility also generates normal maps for each texture. This is implemented using the DeepBump by HugoTiny model. Examples of normal maps are shown in the following images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Easy Upscale

    Easy Upscale

    A simple image upscaler application using EDSR, ESPCN, FSRCNN, etc.

    ...The main theme is queues, we implement circular queues for pooling/storing a list of images to be upscaled. Gui creation is made manually using the tkinter library. For the upscale process itself, it uses the OpenCV library with a model obtained from open source. Checked using vermin. Minimum required versions: 3.6 Incompatible versions: 2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Robust Video Matting (RVM)

    Robust Video Matting (RVM)

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX

    We introduce a robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance. Our method is much lighter than previous approaches and can process 4K at 76 FPS and HD at 104 FPS on an Nvidia GTX 1080Ti GPU. Unlike most existing methods that perform video matting frame-by-frame as independent images, our method uses a recurrent architecture to exploit temporal information in videos and achieves significant improvements in temporal coherence and...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    TimeSformer

    TimeSformer

    The official pytorch implementation of our paper

    TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AviSynth AiUpscale v1.2.0

    AviSynth AiUpscale v1.2.0

    AviSynth+ implementation of Super-Resolution Convolutional Neural

    ...The AiUpscale models used for all datasets were the "Photo" models, except for the Manga109 dataset for which the "LineArt" models were used. In the same way, the Waifu2x cunet model was used for the Manga109 dataset, and the upconv_7 model for the rest.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    ...The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a specific input video, ensuring stable and realistic depth maps even in less-constrained regions. This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Talkie

    Talkie

    Talking 3D character for your stream

    Full source code and 3D model for a talking 3D character, tweak as you need. Useful for streaming/videos when you don't want to show your face. Uses your mic input to move the mouth* You can provide your own 3D model with a jaw bone. This is not intended for end users. No executable is provided. You will need to have a 3D head modeled, with armature attached and a labeled jaw bone (unless you just want to use the default green sphere).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next