model-builder free download

Showing 61 open source projects for "model-builder"

View related business solutions

Video Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
1

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

...It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
2

SeedVR2 Upscaler ComfyUI

Official SeedVR2 Video Upscaler for ComfyUI

ComfyUI-SeedVR2 Video Upscaler is an open-source integration node for the ComfyUI workflow environment that brings the advanced SeedVR2 video upscaling and restoration model directly into visual AI pipelines. This project packages the SeedVR2 architecture as a custom node for ComfyUI, letting users upscale low-resolution video or imagery inside a node-based interface without needing to write code manually. The underlying SeedVR2 model is known for delivering high-quality video enhancement with strong temporal consistency and improved detail preservation by using diffusion-based techniques that are trained specifically on video sequences. ...

Downloads: 31 This Week

Last Update: 2026-01-07
See Project
3

MediaPipe

Cross-platform, customizable ML solutions for live and streaming media

...It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU acceleration throughout the pipeline, the solution delivers real-time performance-critical for live experiences. Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control. For example, it can form the basis for yoga, dance, and fitness applications. ...

Downloads: 71 This Week

Last Update: 2026-04-23
See Project
4

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

...Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model. Filter the text in the non-subtitle area and remove the watermark (station logo) text.

1 Review

Downloads: 52 This Week

Last Update: 2026-04-05
See Project
Stop Storing Third-Party Tokens in Your Database
Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.

Try Auth0 for Free
5

MeshLab

The open source mesh processing system

MeshLab is an open-source, portable, and extensible system for the processing and editing of unstructured large 3D triangular meshes. It is aimed to help the processing of the typical not-so-small unstructured models arising in 3D scanning, providing a set of tools for editing, cleaning, healing, inspecting, rendering and converting this kind of meshes. MeshLab is mostly based on the open source C++ mesh processing library VCGlib developed at the Visual Computing Lab of ISTI - CNR. VCG can...

Downloads: 30 This Week

Last Update: 2025-07-22
See Project
6

Rsmpeg

A Rust crate that exposes FFmpeg's power as much as possible

...It allows developers to interact directly with FFmpeg libraries for tasks such as encoding, decoding, and stream manipulation. The library is designed to balance performance with safety by leveraging Rust’s ownership model and memory management features. It supports handling of containers, codecs, and frames, enabling fine-grained control over media workflows. rsmpeg is suitable for building high-performance multimedia applications that require precise control over processing pipelines. It also integrates well with other Rust libraries, enabling complex system development. ...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
7

ZLMediaKit

WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket

...The project is designed for protocol conversion, live streaming, low-latency playback, recording, forwarding, and large-scale client access. Its asynchronous, multithreaded network model makes it suitable for commercial streaming deployments with many concurrent connections. ZLMediaKit can be deployed directly as a complete media server or embedded through its standard C API as an SDK for other languages and systems. It also provides REST APIs and webhooks so businesses can integrate authentication, event handling, and custom streaming logic.

Downloads: 10 This Week

Last Update: 2026-06-08
See Project
8

nunif

Misc; latest version of waifu2x; 2D video to stereo 3D video

nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif...

Downloads: 4 This Week

Last Update: 2026-05-06
See Project
9

Real-ESRGAN GUI

Cross-platform GUI for image upscaler Real-ESRGAN

...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.

Downloads: 94 This Week

Last Update: 2024-06-02
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
10

Segmentation Models

Segmentation models with pretrained backbones. PyTorch

...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.

Downloads: 0 This Week

Last Update: 2025-04-17
See Project
11

Transcoder

Hardware-accelerated video transcoding using Android MediaCodec APIs

Transcoder by DeepMedia is an AI-powered video-to-video speech translation engine that enables fully automated multilingual dubbing. Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver...

Downloads: 1 This Week

Last Update: 2025-03-25
See Project
12

Stremio Core

Types, addon system, UI models, core logic

Stremio Core is the Rust engine that powers Stremio’s apps by centralizing all reusable logic behind discovery, catalogs, metadata, streams, add-ons, and user/library state. It exposes a clean set of modules—types, addon_transport, and state_types—so apps can talk to add-ons, model UI state, and react to events without duplicating code. The architecture is inspired by Elm: immutable state, message-driven updates, and explicit side-effects (“effects”) keep behavior predictable and testable. An environment abstraction lets hosts provide implementations for things like storage, networking, and timers, making the core portable to desktop, mobile, and web. ...

Downloads: 1 This Week

Last Update: 1 day ago
See Project
13

python-socketio

Python Socket.IO server and client

python-socketio is a robust Python library that implements the Socket.IO protocol, enabling real-time, bidirectional communication between web clients and servers. It works with multiple asynchronous frameworks such as asyncio, eventlet, and gevent, so developers can choose the concurrency model that best fits their application needs while still using a consistent API. The library provides both server and client implementations, allowing Python applications to serve and communicate with browser clients or other Socket.IO clients in real time. It supports key Socket.IO features like event handling, message broadcasting, binary data transmission, rooms, and namespaces, giving developers the building blocks to create chat applications, live dashboards, multiplayer games, and collaborative tools.

Downloads: 0 This Week

Last Update: 12 hours ago
See Project
14

Live API Web Console

A react-based starter app for using the Live API over websockets

...Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.

Downloads: 0 This Week

Last Update: 2025-10-14
See Project
15

Warlock-Studio

AI Suite for upscaling, interpolating & restoring images/videos

v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 34 This Week

Last Update: 2026-02-16
See Project
16

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 4 This Week

Last Update: 2026-04-22
See Project
17

Ultimate Media Downloader

An Open source media downloader for downloading videos and audios

...Whether you're downloading a single YouTube video, extracting audio from Spotify playlists, archiving TikTok content, or batch-processing entire music libraries, UMD handles it all with elegance and efficiency. IT CONSISTS OF : 1. Unified Interface: One command, 1000+ platforms. No tool shopping, no mental model switching. 2. Production-Ready, Zero Friction Installation: Most users go from hearing about the tool to downloading content in under 5 minutes. 3. Active Maintenance: Codeberg hosting (after GitHub suspension) demonstrates commitment to long-term availability

Downloads: 0 This Week

Last Update: 7 days ago
See Project
18

auto-subtitle

Automatically generate and overlay subtitles for any video

auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
19

SuperImage

Sharpen your low-resolution pictures with the power of AI upscaling

...SuperImage is a neural network-based image upscaling application for Android built with the MNN deep learning framework and Real-ESRGAN. The input image is processed in tiles on the device GPU, using a pre-trained Real-ESRGAN model. The tiles are then merged into the final high-resolution image. This application requires Vulkan or OpenCL support and Android 7 or above.

Downloads: 21 This Week

Last Update: 2023-04-13
See Project
20

Golang HLS Streamer

A server that exposes a directory for video streaming

...It provides tools for creating and managing HLS streams, including segmenting video into smaller chunks and generating M3U8 playlists. The project is intended for developers building streaming servers or media delivery systems. It focuses on performance and simplicity, leveraging Go’s concurrency model to handle streaming tasks efficiently. gohls can be integrated into backend services to enable adaptive streaming workflows. It supports real-time processing and can work alongside tools like FFmpeg for encoding. Overall, it provides a lightweight solution for implementing HLS streaming pipelines.

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
21

AI Models

A repository of trained models

All models (at least currently) are supported by chaiNNer, an upscaling GUI that allows for both very simple and very complex tasks to be completed in a nice manner where you "chain" nodes together. Highly recommended for images. If you're looking to upscale videos using the models then use enhancr simply due to the fact that it supports TensorRT, which will allow you to upscale videos at incredible speeds! The GUI is one of the best looking applications out there and is personally my go to...

Downloads: 3 This Week

Last Update: 2023-03-29
See Project
22

Automatic YouTube subtitle generation

Using OpenAI's Whisper to automatically generate YouTube subtitles

...The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and accuracy. yt-whisperc is designed for automation, enabling batch processing of multiple videos for transcription workflows. It also provides options for exporting subtitles in common formats such as SRT. Overall, it simplifies the process of converting video content into searchable and accessible text.

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
23

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
24

G2SConverter

Convert models from GoldSource engine to Source engine with AI

...An example of a processed texture is shown in the following image (parameters used: scaling-factor = 4 and deblur iterations = 4) besides upscaling and debluring the utility also generates normal maps for each texture. This is implemented using the DeepBump by HugoTiny model. Examples of normal maps are shown in the following images.

Downloads: 0 This Week

Last Update: 2023-03-30
See Project
25

VSGAN

VapourSynth Single Image Super-Resolution Generative Adversarial

Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. ...

Downloads: 0 This Week

Last Update: 2023-03-29
See Project