Showing 61 open source projects for "model-builder"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    ...It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SeedVR2 Upscaler ComfyUI

    SeedVR2 Upscaler ComfyUI

    Official SeedVR2 Video Upscaler for ComfyUI

    ComfyUI-SeedVR2 Video Upscaler is an open-source integration node for the ComfyUI workflow environment that brings the advanced SeedVR2 video upscaling and restoration model directly into visual AI pipelines. This project packages the SeedVR2 architecture as a custom node for ComfyUI, letting users upscale low-resolution video or imagery inside a node-based interface without needing to write code manually. The underlying SeedVR2 model is known for delivering high-quality video enhancement with strong temporal consistency and improved detail preservation by using diffusion-based techniques that are trained specifically on video sequences. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 3
    MediaPipe

    MediaPipe

    Cross-platform, customizable ML solutions for live and streaming media

    ...It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU acceleration throughout the pipeline, the solution delivers real-time performance-critical for live experiences. Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control. For example, it can form the basis for yoga, dance, and fitness applications. ...
    Downloads: 71 This Week
    Last Update:
    See Project
  • 4
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    ...Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model. Filter the text in the non-subtitle area and remove the watermark (station logo) text.
    Downloads: 52 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    MeshLab

    MeshLab

    The open source mesh processing system

    MeshLab is an open-source, portable, and extensible system for the processing and editing of unstructured large 3D triangular meshes. It is aimed to help the processing of the typical not-so-small unstructured models arising in 3D scanning, providing a set of tools for editing, cleaning, healing, inspecting, rendering and converting this kind of meshes. MeshLab is mostly based on the open source C++ mesh processing library VCGlib developed at the Visual Computing Lab of ISTI - CNR. VCG can...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 6
    Rsmpeg

    Rsmpeg

    A Rust crate that exposes FFmpeg's power as much as possible

    ...It allows developers to interact directly with FFmpeg libraries for tasks such as encoding, decoding, and stream manipulation. The library is designed to balance performance with safety by leveraging Rust’s ownership model and memory management features. It supports handling of containers, codecs, and frames, enabling fine-grained control over media workflows. rsmpeg is suitable for building high-performance multimedia applications that require precise control over processing pipelines. It also integrates well with other Rust libraries, enabling complex system development. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ZLMediaKit

    ZLMediaKit

    WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket

    ...The project is designed for protocol conversion, live streaming, low-latency playback, recording, forwarding, and large-scale client access. Its asynchronous, multithreaded network model makes it suitable for commercial streaming deployments with many concurrent connections. ZLMediaKit can be deployed directly as a complete media server or embedded through its standard C API as an SDK for other languages and systems. It also provides REST APIs and webhooks so businesses can integrate authentication, event handling, and custom streaming logic.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    nunif

    nunif

    Misc; latest version of waifu2x; 2D video to stereo 3D video

    nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 94 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Transcoder

    Transcoder

    Hardware-accelerated video transcoding using Android MediaCodec APIs

    Transcoder by DeepMedia is an AI-powered video-to-video speech translation engine that enables fully automated multilingual dubbing. Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Stremio Core

    Stremio Core

    Types, addon system, UI models, core logic

    Stremio Core is the Rust engine that powers Stremio’s apps by centralizing all reusable logic behind discovery, catalogs, metadata, streams, add-ons, and user/library state. It exposes a clean set of modules—types, addon_transport, and state_types—so apps can talk to add-ons, model UI state, and react to events without duplicating code. The architecture is inspired by Elm: immutable state, message-driven updates, and explicit side-effects (“effects”) keep behavior predictable and testable. An environment abstraction lets hosts provide implementations for things like storage, networking, and timers, making the core portable to desktop, mobile, and web. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    python-socketio

    python-socketio

    Python Socket.IO server and client

    python-socketio is a robust Python library that implements the Socket.IO protocol, enabling real-time, bidirectional communication between web clients and servers. It works with multiple asynchronous frameworks such as asyncio, eventlet, and gevent, so developers can choose the concurrency model that best fits their application needs while still using a consistent API. The library provides both server and client implementations, allowing Python applications to serve and communicate with browser clients or other Socket.IO clients in real time. It supports key Socket.IO features like event handling, message broadcasting, binary data transmission, rooms, and namespaces, giving developers the building blocks to create chat applications, live dashboards, multiplayer games, and collaborative tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Warlock-Studio

    Warlock-Studio

    AI Suite for upscaling, interpolating & restoring images/videos

    v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 16
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17

    Ultimate Media Downloader

    An Open source media downloader for downloading videos and audios

    ...Whether you're downloading a single YouTube video, extracting audio from Spotify playlists, archiving TikTok content, or batch-processing entire music libraries, UMD handles it all with elegance and efficiency. IT CONSISTS OF : 1. Unified Interface: One command, 1000+ platforms. No tool shopping, no mental model switching. 2. Production-Ready, Zero Friction Installation: Most users go from hearing about the tool to downloading content in under 5 minutes. 3. Active Maintenance: Codeberg hosting (after GitHub suspension) demonstrates commitment to long-term availability
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    auto-subtitle

    auto-subtitle

    Automatically generate and overlay subtitles for any video

    auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    SuperImage

    SuperImage

    Sharpen your low-resolution pictures with the power of AI upscaling

    ...SuperImage is a neural network-based image upscaling application for Android built with the MNN deep learning framework and Real-ESRGAN. The input image is processed in tiles on the device GPU, using a pre-trained Real-ESRGAN model. The tiles are then merged into the final high-resolution image. This application requires Vulkan or OpenCL support and Android 7 or above.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 20
    Golang HLS Streamer

    Golang HLS Streamer

    A server that exposes a directory for video streaming

    ...It provides tools for creating and managing HLS streams, including segmenting video into smaller chunks and generating M3U8 playlists. The project is intended for developers building streaming servers or media delivery systems. It focuses on performance and simplicity, leveraging Go’s concurrency model to handle streaming tasks efficiently. gohls can be integrated into backend services to enable adaptive streaming workflows. It supports real-time processing and can work alongside tools like FFmpeg for encoding. Overall, it provides a lightweight solution for implementing HLS streaming pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AI Models

    AI Models

    A repository of trained models

    All models (at least currently) are supported by chaiNNer, an upscaling GUI that allows for both very simple and very complex tasks to be completed in a nice manner where you "chain" nodes together. Highly recommended for images. If you're looking to upscale videos using the models then use enhancr simply due to the fact that it supports TensorRT, which will allow you to upscale videos at incredible speeds! The GUI is one of the best looking applications out there and is personally my go to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Automatic YouTube subtitle generation

    Automatic YouTube subtitle generation

    Using OpenAI's Whisper to automatically generate YouTube subtitles

    ...The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and accuracy. yt-whisperc is designed for automation, enabling batch processing of multiple videos for transcription workflows. It also provides options for exporting subtitles in common formats such as SRT. Overall, it simplifies the process of converting video content into searchable and accessible text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Video Pre-Training

    Video Pre-Training

    Learning to Act by Watching Unlabeled Online Videos

    The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    G2SConverter

    G2SConverter

    Convert models from GoldSource engine to Source engine with AI

    ...An example of a processed texture is shown in the following image (parameters used: scaling-factor = 4 and deblur iterations = 4) besides upscaling and debluring the utility also generates normal maps for each texture. This is implemented using the DeepBump by HugoTiny model. Examples of normal maps are shown in the following images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo