Showing 75 open source projects for "input-leap"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Mozc Devices

    Mozc Devices

    Circuit diagrams and firmware source code for Gboard DIY keyboards

    ...These devices creatively reinterpret how users can interact with Japanese text input, blending humor, engineering, and physical computing. The repository serves as an archive of the schematics, firmware, and PCB designs for these inventive input mechanisms, with many projects including promotional videos and technical references.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    ChatterBot

    ChatterBot

    Machine learning, conversational dialog engine for creating chat bots

    ...An untrained instance of ChatterBot starts off with no knowledge of how to communicate. Each time a user enters a statement, the library saves the text that they entered and the text that the statement was in response to. As ChatterBot receives more input the number of responses that it can reply increase.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...According to actual measurements, arm64the single-architecture performance is better than universal2the dual- architecture Mac on the Apple chip, so Apple chip users are advised to pack arm64single-architecture applications by themselves. Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 85 This Week
    Last Update:
    See Project
  • 4
    Audiomentations

    Audiomentations

    A Python library for audio data augmentation

    ...Useful if your original sound is clean and you want to simulate an environment where background noise is present. A folder of (background noise) sounds to be mixed in must be specified. These sounds should ideally be at least as long as the input sounds to be transformed. Otherwise, the background sound will be repeated, which may sound unnatural. Note that the gain of the added noise is relative to the amount of signal in the input. This implies that if the input is completely silent, no noise will be added.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Noto Emoji

    Noto Emoji

    Noto Emoji fonts

    Noto Emoji (Stands for No Tofu) is an open-source (Open Font License 1.1) emoji library that provides standard Unicode emoji support and tools for working with them.
    Downloads: 47 This Week
    Last Update:
    See Project
  • 6
    Bunkr Downloader

    Bunkr Downloader

    Yet bunkrr album downloader

    ...It is implemented in Python and focuses on providing a structured workflow for downloading, organizing, and synchronizing media collections. The tool allows users to input album URLs and automatically download all associated media while maintaining a local metadata database for tracking content. It includes features for managing albums as a persistent library, enabling users to add, remove, or sync albums over time without re-downloading existing files. bonkrr also categorizes downloaded content into different media types such as images, videos, and archives, improving organization and usability. ...
    Downloads: 144 This Week
    Last Update:
    See Project
  • 7
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    ...The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. PersonaLive’s architecture balances visual quality and efficiency by combining motion encoding, temporal modules, and hybrid implicit control signals to preserve identity and stable expression through long sequences.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    ffmpeg-normalize

    ffmpeg-normalize

    Audio Normalization for Python/ffmpeg

    ...It can process multiple files in batch mode, making it suitable for large media libraries or production workflows. ffmpeg-normalize also preserves metadata and supports a wide range of input and output formats. Its design emphasizes accuracy and compliance with broadcasting standards. Overall, it provides a reliable solution for achieving consistent audio quality in multimedia content.
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    ffmpy

    ffmpy

    Pythonic interface for FFmpeg/FFprobe command line

    ffmpy is a Python wrapper that provides a simple and Pythonic interface for constructing and executing FFmpeg and FFprobe command-line operations. It abstracts command generation into structured Python objects, making it easier to define inputs, outputs, and parameters programmatically. The library uses Python’s subprocess module to run compiled commands, ensuring compatibility with standard FFmpeg installations. It is designed for developers who want to automate media processing tasks...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    ipyvizzu

    ipyvizzu

    Build animated charts in Jupyter Notebook and similar environments

    ipyvizzu - Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax ipyvizzu is an animated charting tool for Jupyter, Google Colab, Databricks, Kaggle and Deepnote notebooks among other platforms. ipyvizzu enables data scientists and analysts to utilize animation for storytelling with data using Python. It's built on the open-source JavaScript/C++ charting library Vizzu. There is a new extension of ipyvizzu, ipyvizzu-story with which the animated charts...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    ML Sharp

    ML Sharp

    Sharp Monocular View Synthesis in Less Than a Second

    ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    pybaselines

    pybaselines

    Library of algorithms for baseline correction of experimental data

    pybaselines is a Python library that provides many different algorithms for performing baseline correction on data from experimental techniques such as Raman, FTIR, NMR, XRD, XRF, PIXE, etc. The aim of the project is to provide a semi-unified API to allow quick testing and comparing multiple baseline correction algorithms to find the best one for a set of data. pybaselines has 50+ baseline correction algorithms. These include popular algorithms, such as AsLS, airPLS, ModPoly, and SNIP, as...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Mesh R-CNN

    Mesh R-CNN

    code for Mesh R-CNN, ICCV 2019

    ...Unlike voxel-based or point-based approaches, Mesh R-CNN uses a differentiable mesh representation, allowing it to efficiently refine surface geometry while maintaining high spatial detail. The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    PyLivestream

    PyLivestream

    Pure Python FFmpeg-based live video / audio streaming to YouTube

    PyLivestream is a Python-based tool that enables real-time video streaming from various input sources to platforms such as YouTube and Twitch. It acts as a wrapper around FFmpeg, allowing users to stream video from cameras, files, or screen capture devices with minimal configuration. The tool supports cross-platform operation and integrates easily into Python workflows, making it suitable for automation and scripting. It provides options for controlling streaming parameters such as bitrate, resolution, and codecs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    OpenPiano — Virtual Piano for Windows

    OpenPiano — Virtual Piano for Windows

    Desktop piano playable with a PC keyboard, mouse, or MIDI device.

    ...Project links: Website: https://www.justagwas.com/projects/openpiano GitHub: https://github.com/Justagwas/openpiano Documentation: https://github.com/Justagwas/openpiano/wiki The application is fully open source. All input handling, playback, and MIDI recording/export are performed on the user's machine. VirusTotal scan result: https://www.virustotal.com/gui/file/af5e09c7433d2786129a9541bf0cd1d3dd5ba5fa7ab88a92753975795f81fbce
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • 22
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no means complete. More info at the project homepage.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 23
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 24
    Notes To Color

    Notes To Color

    Convert melodies to colors

    Input notes of a melody and see the program create a new color by mixing the colors corresponding to the notes, according to the theory that associates the 7 notes with the 7 colors of the rainbow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    vahatraker

    vahatraker

    a live MIDI sequencer for JACK

    ...Adhering to Unix philosophy, driving MIDI is the one thing it tries to do well, adhering to other doctrines - enabling expression, with added value of: - live editing - fast workflow - intuitive midi-in - unheard of time signatures - scenes a'la 'ton - fractal turtles - fits on a floppy - doesn't make sound Frankly speaking, vht was envisaged as a re-creation of seq24 in tracker form for author's "studio needs" and offers similar functionality (and limitations). It relies 100% on JACK audio connection kit for input/output/synch and uses jack_capture for rendering. The GUI has similar dependencies as gnome-calculator and tracker paradigm was chosen to allow for rhythmic gymnastics otherwise hard to convey. Low level stuff was done in C and wrapped in Python. Human interfacing part of contraption employs GTK through gobject introspection and was also contrived in the language we shall no longer spam abo
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB