learning free download

Showing 67 open source projects for "learning"

View related business solutions

Sound/Audio Windows Clear Filters & Widen Search

AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

Audiomentations

A Python library for audio data augmentation

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where background noise is present. ...

Downloads: 0 This Week

Last Update: 2025-09-13
See Project
2

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...

Downloads: 20 This Week

Last Update: 2025-10-13
See Project
3

Librosa

Python library for audio and music analysis

...Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.

Downloads: 2 This Week

Last Update: 2025-07-03
See Project
4

Speakr

Speakr is a personal, self-hosted web application

...It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
5

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output.

Downloads: 14 This Week

Last Update: 2026-03-23
See Project
6

EarQuiz Frequencies

Software for technical ear training on equalization

...This application is based on (and deeply inspired by) the world-renowned Golden Ears method of David Moulton, whose course is half dedicated to building this essential critical listening skill. The overall training process involves ongoing learning and testing yourself. In the Learn mode, you listen to the pink noise or music (or other external audio) excerpts with switched off and on 1-octave or 1/3-octave graphic EQ, boosting or cutting frequency bands within certain spectral ranges. Then in the Test mode you are given a sequence of 10 similar examples, where you try to guess, which frequencies are boosted or cut, and you get scored. ...

1 Review

Downloads: 4 This Week

Last Update: 2025-01-10
See Project
7

tksolfege ear training program

Music ear training exercises

Tksolfege is an ear training program for learning to recognize chords, intervals, perform rhythm dictation, solfege dictation and singing solfege sequences. As you will discover, it is not an easy program to install and setup on your computer. You will also require to install the tcl/tk interpreter, fluidsynth, and at least one soundfont file. On Windows 11, you may also need to configure the operating system to show the file extension and the hidden directories.

Downloads: 1 This Week

Last Update: 2026-02-04
See Project
8

LAME (Lame Aint an MP3 Encoder)

A high quality MP3 encoder

LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!

Downloads: 20,797 This Week

Last Update: 2023-12-18
See Project
9

NAM-Runner

Batch file to install and run NAM (neural-amp-modeler) easily.

A Windows 10 batch file, that installs and runs the NAM model trainer (neural-amp-modeler) by Steven Atkinson right into the GUI application. Fully automated. Custom one-time installation of everything you need to train neural network models of guitar amps and more for the NAM VST plugin, no Conda required. Runs as a launcher afterwards. Portable installation. New pyTorch inclues CUDA runtime for fast Nvidia GPU support. No command line, python or conda knowledge needed! Just double click.

Downloads: 2 This Week

Last Update: 2023-06-14
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

audioFlux

A library for audio and music analysis, feature extraction.

audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
11

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
12

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure.

Downloads: 4 This Week

Last Update: 2022-09-03
See Project
13

DuranDuranbot

Teachable/trainable artificially intelligent music bot

A teachable/trainable artificially intelligent music bot fundamentally inspired by how the new wave band Duran Duran composes music. This program utilizes many algorithmic/AI techniques/processes, including machine learning; which allow you to teach/train it to compose music which you prefer... and the technique which is the foundation of the design of DuranDuranbot, which was directly inspired by how Duran Duran writes music........ Called, "bit by bit circular composition"....... and it's explanation can be found here - https://scsynth.org/t/bit-by-bit-circular-composition/1107 This program is written in the SuperCollider programming language - https://en.wikipedia.org/wiki/SuperCollider Contact - ken_brant@ymail.com

Downloads: 2 This Week

Last Update: 2022-12-25
See Project
14

Winds

A Beautiful Open Source RSS & Podcast App Powered by Getstream.io

...For Winds the follow suggestions and the list of articles from the feeds you follow is powered by Stream. Stream accounts are free for up to 3 million feed updates and handle personalization (machine learning) for up to 100 users.

Downloads: 2 This Week

Last Update: 2024-09-17
See Project
15

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 2 This Week

Last Update: 2021-04-08
See Project
16

XZVoice

Free and open source text-to-speech software

...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.

Downloads: 0 This Week

Last Update: 2022-10-04
See Project
17

TTS

Deep learning for text to speech

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model...

Downloads: 7 This Week

Last Update: 2021-10-18
See Project
18

X32 Scene Parser

An X32 scene management tool

This parsing tool can be used to extract sections of a Behringer X32 or Midas M32 scene file in order to create specialized snippets.

Downloads: 41 This Week

Last Update: 2021-05-16
See Project
19

X-Air Scene Parser

An X-Air scene management tool

The X-Air/M-Air does not include snippets (like the X32), This parsing tool is a port of the X32 Scene Parser that can be used to create modified scene files which can function like snippets.

Downloads: 9 This Week

Last Update: 2021-05-16
See Project
20

Piano Booster

Boost your Piano playing skills

A MIDI file player that teaches you how to play the piano. PianoBooster is a fun way of playing along with a musical accompaniment and at the same time learning the basics of reading musical notation. see: https://www.pianobooster.org/

9 Reviews

Downloads: 113 This Week

Last Update: 2020-12-13
See Project
21

OpenOffice.org Export As DAISY

odt2daisy is an OpenOffice.org Writer extension, enabling to export in DAISY XML, Full DAISY (xml+audio) and Audiobook format. DAISY is an NISO Z39.86 standard for blind, visual impaired, print-disabled, and learning-disabled people.

3 Reviews

Downloads: 3 This Week

Last Update: 2020-12-07
See Project
22

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:

Downloads: 0 This Week

Last Update: 2020-06-20
See Project
23

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...

2 Reviews

Downloads: 1 This Week

Last Update: 2020-03-15
See Project
24

midipiano_chung

midipiano chung is a free open source standalone virtual midi acoustic piano synthesizer /expander with samples based sounds and originals dsps effects, connected to the choozen virtual midi input (up to 3 if any) and midiout(thru) ports of your computer.It is easily extensible by adding or modifying sound files (mp3,wav) in the /sounds/ folder.Works well as output for midi_chung player and midirec_chung recorder or an external usb midi master keyboard. It is written in compiled freebasic and uses fbsound(freebasic). Can run on a small netbook . Autochord , themeonly, learn (automatic chord learning from played data) functions added to recorder/player .( auto adds/replace chords to any music melody ). can record and export to midifiles. included brainpiano_chung a version with neural network brain autochord trial. (it really adds chords) included brainpiano2_chung version with custom neural network music generator . amazing brainpiano3_chung added quantize trial added to recorder

Downloads: 3 This Week

Last Update: 2019-10-28
See Project
25

AngelReader

An E-book, Audio-book, & Library Loader in One Application

...It's in fact three apps integrated into one intuitively portable, cross platform, robust software. The app can be useful for either classroom or office environment; for personal or commercial use; for general learning or access to practical information.

Downloads: 0 This Week

Last Update: 2018-10-16
See Project