Page 7 | input free download

Showing 506 open source projects for "input"

View related business solutions

Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Build AI Apps with Gemini 3 on Vertex AI
Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.

Try Vertex AI Free
1

CycleGAN and pix2pix in PyTorch

Image-to-Image Translation in PyTorch

CycleGAN and pix2pix in PyTorch repository is a PyTorch implementation of two influential image-to-image translation frameworks: CycleGAN (for unpaired translation) and pix2pix (for paired translation). This repo gives developers and researchers a convenient, modern (PyTorch-based) platform to train and test these methods — supporting both paired datasets (input to output) and unpaired datasets (domain-to-domain) with minimal changes. The code supports standard training and inference pipelines, and as of recent updates, compatibility with the latest Python and PyTorch versions (e.g. Python 3.11, PyTorch 2.4) as well as support for distributed/multi-GPU training for scalable workflows. Because of its flexibility, users can apply it to many tasks: e.g. style transfer between domains (e.g. season changes, art-to-photo, etc.), mapping sketches/edges to real images, image colorization, day-to-night, photo enhancement, and more.

Downloads: 0 This Week

Last Update: 2025-12-09
See Project
2

Map-Anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

...Instead of stitching together many task-specific models, it uses a single architecture that supports a wide range of 3D tasks—multi-image structure-from-motion, multi-view stereo, monocular metric depth, registration, depth completion, and more. The model flexibly accepts different input combinations (images, intrinsics, poses, sparse or dense depth) and produces a rich set of outputs including per-pixel 3D points, camera intrinsics, camera poses, ray directions, confidence maps, and validity masks. Its inference path is fully feed-forward with optional mixed-precision and memory-efficient modes, making it practical to scale to long image sequences while keeping latency predictable.

Downloads: 0 This Week

Last Update: 2026-01-18
See Project
3

Automated Interpretability

Code for Language models can explain neurons in language models paper

...It includes a “neuron explainer” component that, given a target neuron or latent feature, proposes natural language explanations or heuristics (e.g. “this neuron activates when the input has property X”) and then simulates activation behavior across example inputs to test whether the explanation holds. The project also contains a “neuron viewer” web component for browsing neurons, explanations, and activation patterns, making it more interactive and exploratory.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
4

Kubespider

A global resource download orchestration system

We are a community of individuals who share a passion for life and have come together based on shared interests and needs. In our free time, we collaborated to develop Kubespider. Kubespider is developed to utilize an idle server in a local area network as a NAS, enabling automatic downloads of TV series, triggering downloads from a local laptop, and adapting to various websites such as YouTube and BiliBili, as well as different types of resources such as TV series, movies, music and more....

Downloads: 0 This Week

Last Update: 2024-06-27
See Project
Cut Data Warehouse Costs up to 54% with BigQuery
Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.

Try BigQuery Free
5

Segmentation Models

Segmentation models with pretrained backbones. PyTorch

...Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.

Downloads: 0 This Week

Last Update: 2025-04-17
See Project
6

Pytorch-toolbelt

PyTorch extensions for fast R&D prototyping and Kaggle farming

A pytorch-toolbelt is a Python library with a set of bells and whistles for PyTorch for fast R&D prototyping and Kaggle farming. Easy model building using flexible encoder-decoder architecture. Modules: CoordConv, SCSE, Hypercolumn, Depthwise separable convolution and more. GPU-friendly test-time augmentation TTA for segmentation and classification. GPU-friendly inference on huge (5000x5000) images. Every-day common routines (fix/restore random seed, filesystem utils, metrics). Losses:...

Downloads: 0 This Week

Last Update: 2024-11-21
See Project
7

UNO

A Universal Customization Method for Single and Multi Conditioning

UNO is a project by ByteDance introduced in 2025, titled “A Universal Customization Method for Both Single and Multi-Subject Conditioning.” It suggests a framework for image (or more general generative) modeling where the model can be conditioned either on a single subject or multiple subjects — which may correspond to generating or customizing images featuring specific people, styles, or objects, possibly with fine-grained control over subject identity or composition. Because the project is...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
8

Matrix

Multi-Agent daTa geneRation Infra and eXperimentation framework

Matrix is a distributed, large-scale engine for multi-agent synthetic data generation and experiments: it provides the infrastructure to run thousands of “agentic” workflows concurrently (e.g. multiple LLMs interacting, reasoning, generating content, data-processing pipelines) by leveraging distributed computing (like Ray + cluster management). The idea is to treat data generation as a “data-to-data” transformation: each input item defines a task, and the runtime orchestrates asynchronous, peer-to-peer agent workflows, avoiding global synchronization bottlenecks. That design makes Matrix particularly well-suited for large-batch inference, model benchmarking, data curation, augmentation, or generation — whether for language, code, dialogue, or multimodal tasks. ...

Downloads: 0 This Week

Last Update: 2025-12-30
See Project
9

shuyuan

Reading book source

...For learners, researchers, or avid readers, Shuyuan offers a way to bridge from plain text files or eBooks into a manageable, interactive resource — one where notes, references, and reading progress can be tracked. It likely supports different input formats (text, HTML, PDF), and may integrate optional translation or text normalization tools.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Raster Vision

Open source framework for deep learning satellite and aerial imagery

...Raster Vision allows engineers to quickly and repeatably configure pipelines that go through core components of a machine learning workflow: analyzing training data, creating training chips, training models, creating predictions, evaluating models, and bundling the model files and configuration for easy deployment. The input to a Raster Vision pipeline is a set of images and training data, optionally with Areas of Interest (AOIs) that describe where the images are labeled. The output of a Raster Vision pipeline is a model bundle that allows you to easily utilize models in various deployment scenarios.

Downloads: 0 This Week

Last Update: 2024-08-30
See Project
11

Protenix

A trainable PyTorch reproduction of AlphaFold 3

Protenix is an open-source, trainable PyTorch reimplementation of AlphaFold 3, developed by ByteDance with the goal of democratizing high-accuracy protein structure prediction for computational biology and drug-discovery research. Protenix provides a complete pipeline for turning protein sequences (with optional MSA / sequence alignment) or structural inputs (e.g. PDB/CIF) into full 3D atomic-level structure predictions. It supports both “full” models and lightweight variants such as...

Downloads: 0 This Week

Last Update: 2026-02-09
See Project
12

Step-Audio 2

Multi-modal large language model designed for audio understanding

Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
13

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
14

Graphtage

A semantic diff utility and library for tree-like files such as JSON

...Its name is a portmanteau of “graph” and “graftage”, the latter being the horticultural practice of joining two trees together such that they grow as one. Graphtage performs an analysis on an intermediate representation of the trees that is divorced from the filetypes of the input files. This means, for example, that you can diff a JSON file against a YAML file. Also, the output format can be different from the input format(s). By default, Graphtage will format the output diff in the same file format as the first input file. But one could, for example, diff two JSON files and format the output in YAML. There are several command-line arguments to specify these transformations, such as --format; please check the --help output for more information.

Downloads: 1 This Week

Last Update: 2024-05-20
See Project
15

M64Py

A frontend for Mupen64Plus

M64Py is a Qt6 front-end (GUI) for Mupen64Plus, a cross-platform plugin-based Nintendo 64 emulator. Front-end is written in Python and it provides a user-friendly interface over the Mupen64Plus shared library.

4 Reviews

Downloads: 160 This Week

Last Update: 2024-10-20
See Project
16

SuperAGI

A dev-first open source autonomous AI agent framework

...Join a community of developers constantly contributing to make SuperAGI better. Access your agents through a graphical user interface. Interact with agents by giving them input, permissions, etc. Agents typically learn and improve their performance over time with feedback loops. Run multiple agents simultaneously to improve efficiency and productivity. Connect to multiple Vector DBs to enhance your agent’s performance. Each agent is unique, use different models of your choice. Get insights into your agent’s performance and optimize accordingly. ...

Downloads: 3 This Week

Last Update: 2024-01-12
See Project
17

DeepSeek MoE

Towards Ultimate Expert Specialization in Mixture-of-Experts Language

DeepSeek-MoE (“DeepSeek MoE”) is the DeepSeek open implementation of a Mixture-of-Experts (MoE) model architecture meant to increase parameter efficiency by activating only a subset of “expert” submodules per input. The repository introduces fine-grained expert segmentation and shared expert isolation to improve specialization while controlling compute cost. For example, their MoE variant with 16.4B parameters claims comparable or better performance to standard dense models like DeepSeek 7B or LLaMA2 7B using about 40% of the total compute. ...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
18

vocal-separate

An extremely simple tool for separating vocals and background music

vocal-separate is a simple but effective audio processing application that isolates vocals and instrumental tracks from music and video files using stem-based source separation models, enabling tasks such as karaoke creation, remixing, and music analysis. Built as a localized web-based tool, it runs entirely on the user’s machine without requiring an internet connection, emphasizing privacy and convenience for creative work. Users can drag and drop an audio or video file onto the interface...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
19

OpenPiano — Virtual Piano for Windows

Desktop piano playable with a PC keyboard, mouse, or MIDI device.

...Project links: Website: https://www.justagwas.com/projects/openpiano GitHub: https://github.com/Justagwas/openpiano Documentation: https://github.com/Justagwas/openpiano/wiki The application is fully open source. All input handling, playback, and MIDI recording/export are performed on the user's machine. VirusTotal scan result: https://www.virustotal.com/gui/file/8e038bb061d4778715006a9d4b0a6ed9b4f5b6565050d5808452316b42eae7d4

Downloads: 83 This Week

Last Update: 2 days ago
See Project
20

Sonnet

TensorFlow-based neural network library

...Sonnet’s programming model revolves around a single concept: modules. These modules can hold references to parameters, other modules and methods that apply some function on the user input. There are a number of predefined modules that already ship with Sonnet, making it quite powerful and yet simple at the same time. Users are also encouraged to build their own modules. Sonnet is designed to be extremely unopinionated about your use of modules. It is simple to understand, and offers clear and focused code.

Downloads: 0 This Week

Last Update: 2024-01-02
See Project
21

OpenFieldAI - AI Open Field Test Tracker

OpenFieldAI is an AI based Open Field Test Rodent Tracker

...The software generates Centroid graph, Heat map and Line path and a spreadsheet containing all calculated parameters like - Speed - Time in and out of ROI - Distance - Entries/Exits for single/multiple pre-recorded videos or live webcam video. The ROI is assigned automatically in multiple video input , and can be manually given in single input. - For Queries/ Reporting Bugs, contact: kabeermuzammil614@gmail.com - Available on WIndows OS - Software Authorship - Muzammil Kabier and Shamili Mariya Varghese ( Sole Authors )

Downloads: 15 This Week

Last Update: 2026-01-05
See Project
22

CAIRO for AERMOD

AERMOD, visualisation, input, modelling and compiling tool

...Elevation and meteorological data is third party. CAIRO for AERMOD (Compile AERMAP, AERMOD and AERPLOT Input and Run Output) aimed at streamlining the process of making complex input file, with unique syntax and running them. While obviating the need to manually write inputs and running the program through Shell

1 Review

Downloads: 0 This Week

Last Update: 2025-03-03
See Project
23

Sharada-Braille-Writer

Six key approach to producing print materials.

...Website : https://zendalona.com/sbw/ Github repository 1 : https://github.com/zendalona/sbw Github repository 1 : https://github.com/zendalona/libbraille-input Ubuntu PPA 1 : https://launchpad.net/~nalin-x-linux/+archive/ubuntu/sbw Ubuntu PPA 2 : https://launchpad.net/~nalin-x-linux/+archive/ubuntu/libbraille-input

Downloads: 1 This Week

Last Update: 2025-06-22
See Project
24

Stable Diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

...The Stable Diffusion project, developed by Stability AI, is a cutting-edge image synthesis model that utilizes latent diffusion techniques for high-resolution image generation. It offers an advanced method of generating images based on text input, making it highly flexible for various creative applications. The repository contains pretrained models, various checkpoints, and tools to facilitate image generation tasks, such as fine-tuning and modifying the models. Stability AI's approach to image synthesis has contributed to creating detailed, scalable images while maintaining efficiency.

2 Reviews

Downloads: 212 This Week

Last Update: 2025-02-28
See Project
25

voxshare_gui

*VoxShare* is a simple Python-based push-to-talk multicast voice chat

VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).

Downloads: 33 This Week

Last Update: 2025-07-01
See Project