Showing 30 open source projects for "image to sound"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while...
    Downloads: 158 This Week
    Last Update:
    See Project
  • 2
    Image To Chord
    Convert images to chords.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ImageBind

    ImageBind

    ImageBind One Embedding Space to Bind Them All

    ImageBind is a multimodal embedding framework that learns a shared representation space across six modalities—images, text, audio, depth, thermal, and IMU (inertial motion) data—without requiring explicit pairwise training for every modality combination. Instead of aligning each pair independently, ImageBind uses image data as the central binding modality, aligning all other modalities to it so they can interoperate zero-shot. This creates a unified embedding space where representations from any modality can be compared or retrieved against any other (e.g., matching sound to text or depth to image). The model is trained using large-scale contrastive learning, leveraging diverse datasets from natural images, videos, audio clips, and sensor data. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Bulk Image and Audio Merger

    Bulk Image and Audio Merger

    Easily pair images with audio file counterparts in bulk

    Automatically convert your audio files into YouTube-ready videos by pairing them with static images! Perfect for turning podcasts, audio episodes, or music tracks into video format in bulk. Simply match your audio files with corresponding images, and let the program handle the rest. Ideal for content creators who need to process multiple audio files without the hassle of manual video editing. Works on Windows 7 and above. (Source code is included with the program .zip file.)
    Downloads: 3 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    OpenaiBot

    OpenaiBot

    Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

    If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    linux-file-converter-addon

    linux-file-converter-addon

    Convert various image, audio and video formats from your context menu.

    Convert between various image, audio and video formats using the context menu. The addon is written in Python and available for Nautilus, Nemo, Thunar and Dolphin file viewers. It adds a new option to the context menu to create an easy way to convert between a huge amount of file types. The program offers many options to customize the appearance of its context menu. There are also a few extra formats which can be added by installing optional dependencies. The tool has a built-in auto-update...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    FileConverterX

    FileConverterX

    Convertidor de Archivos

    FileConverterX es una herramienta diseñada para convertir documentos entre distintos formatos, optimizando la gestión y manipulación de archivos. Permite a los usuarios transformar documentos de texto, imágenes y otros tipos de archivos en diferentes formatos de manera rápida y eficiente.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Auto Movie Assembler

    Auto Movie Assembler

    Automating making many trailer-like videos with a single click!

    This program can mass create multiple promotional movies at once using only these elements: - Pre-recorded .mp4 video clips. - Title card .png image file. - Ending card .png image file. - Sound effect 1 that plays during the Title card. - Sound effect 2 that plays during the Ending card. It will join the video clips in a alphabetical order, apply a Fade from Black transitions to all them individually, place a Title Card + sound effect after the first clip and an Ending Card with stylish Fade from White effect, also with its own sound effect. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Color to Waveform

    Color to Waveform

    Convert colors to synth presets

    The purpose of the program is to convert a color to a waveform you can use as a synthesizer oscillator inside a DAW such as FL Studio from Image Line. Many synths are provided with an option to load your own waveform, to replace the basic saw, square and sine waveforms commonly used to create synth sounds. The waveform generated by the program will correspond to the subliminal synesthetic sensation of the selected color. You can create your own synth presets to use in a track using color as a base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Intention Repeater MAX

    Intention Repeater MAX

    Repeating your Intentions to aid in manifestation

    Please see the README.txt. The ServitorConnect 4443 and Python Daemon and Intention Repeater Android are better because repeating once-per-hour is better than millions of times per second (or even 3Hz). The archive bundle includes binaries and source code for: MAX and Simple Intention Repeaters CUDA version for Windows/Linux Memory Frequency Generator Multi-Format to WAV Repeater Android app Sourcecode File/Image Writers Nesting Files Creator ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 11
    garysfm

    garysfm

    An advanced file manager with qss themes and iso and folder previews

    garysfm which stands for Gary's File Manager is a file manager with some advanced features. Those features include bulk renaming and folder image previews. I has rather advanced search functions, tab browsing with persistence between launches. It remembers your folder sorting and view options in icon view. It also remembers your active tabs between sessions. It has progress dialog while doing large operations like copying large files, and folders with many files. python version works on...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Amiga Memories

    Amiga Memories

    A walk along memory lane

    Amiga Memories is a project (started & released in 2013) that aims to make video programmes that can be published on the internet. The images and sound produced by Amiga Memories are 100% automatically generated. The generator itself is implemented in Squirrel, the 3D rendering is done on GameStart 3D. An Amiga Memories video is mostly based on a narrative. The purpose of the script is to define the spoken and written content. The spoken text will be read by a voice synthesizer (Text To Speech or TTS), the written text is simply drawn on the image as subtitles. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Riffusion

    Riffusion

    Real-time music generation using stable diffusion techniques AI

    Riffusion (hobby) is a Python-based open source library designed for real-time music and audio generation using stable diffusion techniques. Riffusion (hobby) works by generating and manipulating spectrogram images, which are then converted into playable audio clips, effectively bridging image-based diffusion models with sound synthesis. It implements a diffusion pipeline that supports prompt interpolation, allowing smooth transitions between different musical styles or prompts over time. Riffusion (hobby) serves as the core implementation for audio and image processing, providing essential building blocks for generating music from text prompts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    VocalEasel is a simple lead sheet editor and music rehearsal tool designed for the needs of Jazz vocalists, using LilyPond to produce printable lead sheets and MMA (Musical Midi Accompaniment) to produce MIDI accompaniments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    FF Multi Converter
    FF Multi Converter is a simple graphical application which enables you to convert audio, video, image and document files between all popular formats, using and combining other programs. It uses ffmpeg for audio/video files, unoconv for document files and ImageMagick for image file conversions.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 17
    Wizards of Magic

    Wizards of Magic

    A fun game inspired by the classic Pong

    ...*Bugfix: When trying to change the language, an error message may appear. To fix that, run the game as administrator. ~ Attributions ~ Python 2.7.9 (https://www.python.org/) Sound effects: - http://bit.ly/1r4pLAl (modified) - http://bit.ly/24lu4ED Background: - http://bit.ly/286TkTT (modified) Image editors: - GraphicsGale - GIMP - Piskel Audio editor: - Audacity Third-party modules and libraries: - PIL 1.1.7 (http://bit.ly/1Xmherh) - Pyganim (http://bit.ly/22B74lQ) - Pygame 1.9.1 (http://bit.ly/286WOFG) - cx_Freeze 4.3.4 (http://bit.ly/1VAcBrJ)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The program MinDia can be used to create, modify and run multimedia slide shows with a diaprojector Rolleivision 35 Twin digital P (with RS232 interface).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Vegastrike evolved

    evolution for vegastrike SpaceSim/Combat/Trade game

    ...Let's Vegastrike be evolved , with new goals : _Dynamic Market system , better AI , mission and campain editor , etc ... _MMO : Create somewhere a dedicated host to help debug the actual working network code . Artists ( Sound , Image , Video , Mesh modelling , Animation ) and Devs are welcome ! Our ShipYard : https://sourceforge.net/p/vegastrikevo/wiki/Shipyard/ Checkout for alpha version : https://sourceforge.net/p/vegastrikevo/svn/HEAD/tree/branch/VS_evolved_v1.0/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    dacapo

    dacapo

    Music player, shows fullscreen coverarts, pictures, lyrics

    Lightweight music player, which shows fullscreen coverarts, pictures, metadata and even song-lyrics (unsynced and synced) stored in the audio-files . Runs on linux. Uses Gtk+ and GStreamer
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21

    GPSMovieGenerator

    Create videos from image and video data linked with GPS tracks

    This is a python script for creating videos from GPS track and media data. It renders a moving map with the respective track. On that map, media thumbnails are displayed. These pop up when the track reaches them. If desired, an overlay map and some track statistics may be added to the video. Python3.3 will be needed An example video can be found here: http://www.youtube.com/watch?v=O_IiXrOfFCw&hd=1 To hard to use this script or don't know about linux/python etc ? get in touch with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    diwacs

    diwacs

    Diwa is a project collaboration tool.

    ...It offers simple solutions for example exporting urls and files to another PC via drag and drop and the files are stored in "Project Folder" and there can be database entry for it as well. Diwa also supports remote controlling another PC running the DiWaCS software and creating events which save screenshots from all the computers and optionally a image capture is taken of the room and a sound recording is saved (if you have the hardware). The project is still in alpha stage so the documentation is not yet complete, but the code is not overly complex so it should be possible to deploy the system without further assistance if required. DiWaWa is the webapplication interface for the system and should go hand-in-hand with DiWaCS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    gPlayer

    gPlayer

    Music player with cover and lyrics display

    Due to name-conflicts, this project has moved to dacapo : https://sourceforge.net/projects/dacapo-player/ Lightweight music player, which shows fullscreen coverarts, pictures, metadata and even song-lyrics (unsynced and synced) stored in the audio-files . Runs on linux and windows. Uses Gtk+ and GStreamer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    EnDiskEx

    Bulk extractor for Ensoniq-formatted disk images

    EnDiskEx is a command-line tool that bulk extracts instruments, sequences, songs, and banks from Ensoniq-formatted disk images (RAW, GKH, EDE, and EDA) for the EPS/ASR family of samplers. The extracted files are saved as EFE / SMF / TXT files. EnDiskEx is designed to extract Ensoniq banks for re-creation within a different DAW. It will track down the instrument and song files from bank references even if they were saved on another disk. There also exists a disk mapping feature to...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    EnKoDeur-Mixeur
    EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB