Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Multimedia
Sound/Audio Software
Search Results

Search Results for "open source png text"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 39
Linux 38
Mac 28
More...
BSD 21
ChromeOS 16
Desktop Operating Systems 2
Mobile Operating Systems 2

Category

Multimedia 47
Artificial Intelligence 10
Text Editors 6
Software Development 5
Education 4
Internet 4
Communications 3
Formats and Protocols 2
Games 2
System 2
Database 1
Desktop Environment 1
Printing 1
Scientific/Engineering 1
Social sciences 1

License

OSI-Approved Open Source 47

Translations

English 17
German 5
Dutch 2
French 2
More...
Spanish 2
Thai 2
Afrikaans 1
Brazilian Portuguese 1
Chinese (Simplified) 1
Czech 1
Slovene 1

Programming Language

Python 47
C++ 5
C 3
Ruby 1
XBasic 1

Status

Beta 19
Production/Stable 12
Pre-Alpha 4
Alpha 4
More...
Mature 1

Showing 47 open source projects for "open source png text"

View related business solutions

Sound/Audio Python Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

Speakr

Speakr is a personal, self-hosted web application

Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.

Downloads: 0 This Week

Last Update: 2026-05-09
See Project
2

Text to Chord

Turn words into chords

Convert words and sentences to 5 note chords you can use to inspire music creation. Have fun turning your name, your city name, your friends' names, your team's name, your pet's name into wild and original harmonies that go beyond serialism and classic jazz.

Downloads: 5 This Week

Last Update: 2025-12-06
See Project
3

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 3 This Week

Last Update: 2024-11-16
See Project
4

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. ...

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
5

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...

Downloads: 0 This Week

Last Update: 2024-11-05
See Project
6

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...

Downloads: 9 This Week

Last Update: 2025-10-13
See Project
7

Midi_Master_Clock

A simple command line Midi Master Clock

MIDI Master Clock: Simple MIDI Song and Control Change Manager MIDI Master Clock is a lightweight, command-line utility for effortless MIDI device control and synchronization. It lets you define songs or cues in a simple text file, each with its own BPM and custom MIDI Control Change (CC) messages. What It Does: Reads Song Data: Configure song titles, BPM, and MIDI CCs (e.g., CC_ch0 10) in an easy-to-read data.txt file. Interactive Control: Select songs from a list, advance through...

Downloads: 1 This Week

Last Update: 2025-06-17
See Project
8

Text to Waveform

Create synth presets from words

Convert words to waveforms you can load into a synthesizer oscillator to create synth presets. Have fun turning your name, your friends' names, your city name, your pet's name, your team's name into synth presets you can use to produce a track.

Downloads: 0 This Week

Last Update: 2023-12-09
See Project
9

psgdump

Dump psg/ym chip tune files to txt and midi format

PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.

Downloads: 0 This Week

Last Update: 2022-09-19
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

18k-youtube-download

❤️ 18k-youtube-download with python and kivy Dev.Wk-18k

18k-youtube-download A simple project to make gui on kivy mixed with the function of downloading music from youtube with youtube_dl package

Downloads: 1 This Week

Last Update: 2022-02-25
See Project
11

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.

Downloads: 4 This Week

Last Update: 2021-04-08
See Project
12

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...

Downloads: 0 This Week

Last Update: 2020-06-20
See Project
13

Defox text to speech and downloader

Written or imported text offline read or online download.

This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...

1 Review

Downloads: 0 This Week

Last Update: 2019-09-27
See Project
14

oscp

multiplatform, small and handy audio/video player with network remote

oscp by ariel/KotCzarny @ irc://irc.freenode.com/h3droid tiny media player: - libav (mp3, wav, ogg, aac, mpc, wma, flac, ape, avi, mkv, flv, etc.) - wildmidi (mid) - xmp (mod, med, xm, s3m, it, dbm, psm, omx, okt, digi, 669, mtm, acid, umx) - gme (ay, gbs, gym, hes, kss, nsf, nsfe, sap, spc, vgm) - sidplay2 (sid, psid, info) - mdxplay (mdx) - fc14dec (fc, fc13, fc14) - sc68 (sc68, sndh) - asapconv (sap, cmc, cm3, cmr, cms, dmc, dlt, mpt, mpd, rmt, tmc, tm8, tm2) -...

Downloads: 0 This Week

Last Update: 2019-06-08
See Project
15

Canorus

Music score editor

Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond

8 Reviews

Downloads: 23 This Week

Last Update: 2020-01-04
See Project
16

AngelReader

An E-book, Audio-book, & Library Loader in One Application

AngelReader: A minimalist but powerful GUI application that has the capacity to load [1] E-books in plain text format with the least use of both software and hardware resources. It can also load [2] Audio-books with the basic functions of play, stop, pause, and resume with the same minimalist economy that doesn't hog computer resources. When used in integration with the AngelReader Library Selector, it can function as a mini library management system for books in electronic formats. It's in...

Downloads: 0 This Week

Last Update: 2018-10-16
See Project
17

puddletag

SImple, powerful audio tagger for GNU/Linux

puddletag is an audio tag editor (primarily created) for GNU/Linux similar to the Windows program, Mp3tag. Unlike most taggers for GNU/Linux, it uses a spreadsheet-like layout so that all the tags you want to edit by hand are visible and easily editable. The usual tag editor features are supported like extracting tag information from filenames, renaming files based on their tags by using patterns and basic tag editing. Then there’re Functions, which can do things like replace text,...

12 Reviews

Downloads: 6 This Week

Last Update: 2015-10-12
See Project
18

Analyzer for Files

A tool to look into file contents

Analyzer for Files (AoF) is a tool to look into file contents, analyze the structure with installed plug-ins, and show the results with several split windows including converted data and a tree if successful. It was designed as a workbench with a core and plug-in extensions. It can handle the normal plain-text file and data, complex binaries supported with the corresponding plug-ins. What's more, the developers can deploy and release their own plug-ins according to the plug-in developing...

Downloads: 0 This Week

Last Update: 2015-05-17
See Project
19

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...

Downloads: 0 This Week

Last Update: 2016-03-15
See Project
20

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14
See Project
21

Speect

...Speect is free and open source software. As a collection it is distributed under a MIT license.

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
22

PhonoRipper

A GUI tool for creating audio CDs from analog sources.

PhonoRipper is an easy to use, all-in-one GUI tool for creating audio CDs from analog sources like vinyl records. It uses Cdrdao for CD writing and Normalize for volume normalization. Features include a fully buffered WAV file recorder with mixer and VU meter (supporting both ALSA and OSS drivers), automatic detection of track borders, an intuitive drag'n'drop CD writing interface with CD-text support, CDDB queries of CD text data, and a WYSIWYG CD cover editor.

Downloads: 0 This Week

Last Update: 2012-11-17
See Project
23

EnKoDeur-Mixeur

EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-30
See Project
24

Canorus - a music score editor

Canorus is the next generation music score editor. It is the official successor of NoteEdit. It uses Qt4, has scripting capibilities and uses modern development standards like eclipse, patterns and test units as base.

1 Review

Downloads: 1 This Week

Last Update: 2014-06-09
See Project
25

Mahori

The traditional Siamese music transcription is either in form of sequence of Thai alphabets or numbers. This program read the music transcription from text files and play the transcribed music. This could aid the composers to compose new song easier.

Downloads: 0 This Week

Last Update: 2015-08-01
See Project

Previous
You're on page 1
2
Next

Related Searches

podcast

mega-voice

youtube

deepspeech-0.9.3-models.scorer

cctv

convert txt file to .srt file

midi

pdf to musicxml

puddletag

sound analyzer

Related Categories

Multimedia

Artificial Intelligence

Text Editors

Software Development

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise