input-output model free download

Showing 23 open source projects for "input-output model"

View related business solutions

Multimedia Python Clear Filters & Widen Search

99.99% Uptime for MySQL and PostgreSQL Databases
Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.

Try Free
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

PersonaLive

Expressive Portrait Image Animation for Live Streaming

PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency...

Downloads: 0 This Week

Last Update: 2026-05-15
See Project
2

Moshi

A speech-text foundation model for real time dialogue

...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.

Downloads: 2 This Week

Last Update: 2024-11-05
See Project
3

PersonaPlex

PersonaPlex code

...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
4

TRELLIS.2

Native and Compact Structured Latents for 3D Generation

TRELLIS.2 is a cutting-edge open-source model and codebase for high-fidelity 3D asset generation from 2D images, developed to push forward the state of the art in image-to-3D generation. At its core is a novel sparse voxel structure called O-Voxel that jointly encodes both geometry and surface appearance, enabling reconstruction and generation of complex 3D shapes with arbitrary topology, open surfaces, and physically based rendering (PBR) textures. The system leverages a large...

Downloads: 40 This Week

Last Update: 2026-06-05
See Project
Host LLMs in Production With On-Demand GPUs
NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.

Try Free
5

CadQuery

A python parametric CAD scripting framework based on OCCT

...Build models with scripts that are as close as possible to how you’d describe the object to a human, using a standard, already established programming language. Create parametric models that can be very easily customized by end users. Output high-quality CAD formats like STEP and AMF in addition to traditional STL. Provide a non-proprietary, plain text model format that can be edited and executed with only a web browser. The scripts use a standard programming language, Python, and thus can benefit from the associated infrastructure. This includes many standard libraries and IDEs. ...

Downloads: 56 This Week

Last Update: 2026-06-20
See Project
6

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

...The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
7

screenshot-to-code

Drop in a screenshot and convert it to clean code

screenshot-to-code converts UI screenshots or design images into working front-end code, accelerating the path from concept to prototype. It uses modern vision-capable or code-generating models to infer layout structure, typography, and components, then outputs clean HTML/CSS (often Tailwind) or framework code. A web interface lets you upload images, tune options, and preview generated results, while a backend service orchestrates the model calls and post-processing. The tool focuses on...

Downloads: 4 This Week

Last Update: 2026-07-10
See Project
8

Segmentation Models

Segmentation models with pretrained backbones. PyTorch

...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.

Downloads: 0 This Week

Last Update: 2025-04-17
See Project
9

Real-ESRGAN GUI

Cross-platform GUI for image upscaler Real-ESRGAN

...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.

Downloads: 141 This Week

Last Update: 2024-06-02
See Project
Cut Data Warehouse Costs by 54%
Easily migrate from Snowflake, Redshift, or Databricks with free tools.

BigQuery delivers 54% lower TCO with exabyte scale and flexible pricing. Free migration tools handle the SQL translation automatically.

Try Free
10

Easy Background Remover

Free offline background remover for Windows - one click, no watermark

Easy Background Remover is a free, offline, AI-powered background remover for Windows 10 and 11. Drop in a photo and the background disappears in one click, leaving a clean transparent PNG. Your images never leave your computer - there is no upload, no sign-up, no account, no watermark and no limits, ever. The app runs the open source u2net neural network locally on your CPU through onnxruntime, so it works on any modern Windows PC and does not need a GPU. Pick a single photo or drop a whole...

Downloads: 18 This Week

Last Update: 2026-06-09
See Project
11

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 7 This Week

Last Update: 2026-06-25
See Project
12

Warlock-Studio

AI Suite for upscaling, interpolating & restoring images/videos

v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 23 This Week

Last Update: 2026-02-16
See Project
13

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 62 This Week

Last Update: 2026-03-23
See Project
14

Stable Diffusion Rembg

Removes backgrounds from pictures. Extension for webui

This project is an extension for the Stable Diffusion Web UI that removes backgrounds from images directly inside the interface. It wraps popular background-removal models so creators can take a generated or uploaded image and isolate the subject with a single click. The workflow is designed to be non-destructive: you can preview, tweak thresholds, and export either a transparent PNG or a masked layer for further editing. Because it runs within the Web UI, you can chain it with other...

Downloads: 0 This Week

Last Update: 2025-10-23
See Project
15

ProPainter

Improving Propagation and Transformer for Video Inpainting

ProPainter is a video inpainting project based on the ICCV 2023 paper “Improving Propagation and Transformer for Video Inpainting.” It is designed to remove objects, complete missing regions, and fill masked areas in videos while preserving temporal consistency. The project accepts video input or split frames along with frame-wise masks that define the areas to reconstruct. It provides pretrained models, example inputs, inference scripts, and an interactive demo workflow for object removal....

Downloads: 81 This Week

Last Update: 2026-07-02
See Project
16

auto-subtitle

Automatically generate and overlay subtitles for any video

auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
17

VSGAN

VapourSynth Single Image Super-Resolution Generative Adversarial

Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. ...

Downloads: 2 This Week

Last Update: 2023-03-29
See Project
18

ExiFlow

A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.

1 Review

Downloads: 2 This Week

Last Update: 2022-04-13
See Project
19

TRACER

Extreme Attention Guided Salient Object Tracing Network

Extreme Attention Guided Salient Object Tracing Network (AAAI 2022) implementation in PyTorch. Now, fast inference mode offers a salient object result with the mask. You can get the more clear salient object by tuning the threshold. We will release initializing TRACER with a version of pre-trained TE-x.

Downloads: 0 This Week

Last Update: 2023-04-05
See Project
20

Consistent Depth

We estimate dense, flicker-free, geometrically consistent depth

...The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a specific input video, ensuring stable and realistic depth maps even in less-constrained regions. This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
21

Super-résolution via CNN

Super resolution using a CNN, based on the work of the DGtal team

...First of all, an Nvidia graphics card (neither AMD nor Intel integrated) is highly recommended to parallelize the CNN. You will then need to install CUDA. No CUDA = dozens of times slower. This program will generate "model_epoch_ .pth" files corresponding to the model at epoch n, in a folder saved_model_u t_bs bs_tbs tbs_lr lr, where corresponds to the scale factor, bsthe size of the training batch, tbsthe size of the test batch and lrto the learning rate. Low res images should be located in a "dataset/input" folder, and high res targets in a "dataset/target" folder, where each different quality image has the same name in both folders.

Downloads: 1 This Week

Last Update: 2023-03-30
See Project
22

Image Super-Resolution (ISR)

Super-scale your images and run experiments with Residual Dense

The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...

Downloads: 3 This Week

Last Update: 2022-03-31
See Project
23

Open Asset Import Library

Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.

24 Reviews

Downloads: 37 This Week

Last Update: 2014-06-21
See Project