audio development free download

21 projects for "audio development" with 2 filters applied:

Artificial Intelligence ChromeOS Clear Filters & Widen Search

Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
1

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...

Downloads: 8 This Week

Last Update: 2025-10-13
See Project
2

II ElevenLabs UI

Component library and custom registry built on top of shadcn/ui

ElevenLabs UI is an open-source component library designed to accelerate the development of multimodal AI applications, particularly those involving voice agents and audio-based interactions. Built on top of modern frontend tooling such as React, Tailwind CSS, and shadcn/ui, it provides a collection of pre-built, customizable components that developers can easily integrate into their applications. The library includes specialized UI elements such as audio players, waveform visualizers, conversational interfaces, and interactive voice components, all tailored for building agent-driven experiences. ...

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
3

Groq TypeScript / Node.s

The official Node.js / Typescript library for the Groq API

Groq TypeScript / Node.s (also often referred to as “groq-sdk” on npm) is the official Node.js / TypeScript client library for Groq’s REST API, enabling JavaScript/TypeScript developers to integrate LLM and AI-powered services into web backends, serverless functions, or frontend apps. It exports strongly-typed interfaces for models, chat completions, file uploads (e.g. for audio transcription), and other endpoints, allowing for better type safety and developer experience when using Groq from...

Downloads: 0 This Week

Last Update: 2026-05-11
See Project
4

Groq Python

The official Python Library for the Groq API

...For those building AI-based tooling, automation scripts, or ML-backed backends, groq-python abstracts away HTTP request plumbing and exposes a clean API, accelerating development and reducing boilerplate.

Downloads: 0 This Week

Last Update: 2026-05-11
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
5

Pixeltable

Data Infrastructure providing an approach to multimodal AI workloads

...Developers define data transformations and AI operations using computed columns on tables, allowing pipelines to evolve incrementally as new data or models are added. The framework supports multimodal content including images, video, text, and audio, enabling applications such as retrieval-augmented generation systems, semantic search, and multimedia analytics.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
6

Spring AI Alibaba Examples

Spring AI Alibaba examples for building and testing AI apps

...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.

1 Review

Downloads: 3 This Week

Last Update: 5 days ago
See Project
7

MediaPipe Solutions

Cross-platform, customizable ML solutions

MediaPipe is an open-source framework developed by Google for building cross-platform machine learning pipelines that process audio, video, and other streaming data in real time. The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web...

Downloads: 0 This Week

Last Update: 2026-04-23
See Project
8

OmAgent

Build multimodal language agents for fast prototype and production

OmAgent is an open-source Python framework designed to simplify the development of multimodal language agents that can reason, plan, and interact with different types of data sources. The framework provides abstractions and infrastructure for building AI agents that operate on text, images, video, and audio while maintaining a relatively simple interface for developers. Instead of forcing developers to implement complex orchestration logic manually, the system manages task scheduling, worker coordination, and node optimization behind the scenes. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
9

Jina-Serve

Build multimodal AI applications with cloud-native stack

Jina Serve is an open-source framework designed for building, deploying, and scaling AI services and machine learning pipelines in production environments. The framework allows developers to create microservices that expose machine learning models through APIs that communicate using protocols such as HTTP, gRPC, and WebSockets. It is built with a cloud-native architecture that supports deployment on local machines, containerized environments, or large orchestration platforms such as...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Acontext

Context data platform for building observable, self-learning AI agents

Acontext is a cloud-native context data platform designed to support the development and operation of advanced AI agents. It provides a unified system to store and manage contexts, multimodal messages, artifacts, and task workflows, enabling developers to engineer context effectively for their agent products. The platform observes agent tasks and user feedback in real time, offering robust observability into workflows and helping teams understand how agents perform over time. Acontext also...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
11

Audio AI Timeline

A timeline of the latest AI models for audio generation

Audio AI Timeline is a curated project that organizes the development of audio-related artificial intelligence into a structured and accessible historical timeline. Rather than functioning as a model training framework, it serves as an informational resource that maps key papers, systems, models, datasets, and milestones across areas such as speech synthesis, music generation, audio understanding, source separation, and general audio machine learning. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
12

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

3 Reviews

Downloads: 1 This Week

Last Update: 2015-10-06
See Project
13

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
14

Speect

Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open...

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
15

BayesianCortex

simple algorithm for a realtime interactive visual cortex for painting

A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...

Downloads: 0 This Week

Last Update: 2013-05-30
See Project
16

Scalable Language API

Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.

Downloads: 0 This Week

Last Update: 2018-01-22
See Project
17

Speedy Composer

Speedy Composer – Artificial Neural Network Melody Composer.

Thank you for your interest in Speedy Composer. Speedy Composer is an automated application for composing melodies for Speedy Net members. We recently made changes to the source code of Speedy Net, and converted it into the Python language and Django framework. Since Speedy Composer was originally written in PHP, it is not adapted to work with Speedy Net in its current form. So unfortunately we were forced to temporarily close the app Speedy Composer. But don't worry, we kept backups of all...

Downloads: 0 This Week

Last Update: 2020-09-24
See Project
18

Italian FESTIVAL

Development of the Italian Version of FESTIVAL Text to Speech synthesis system

Downloads: 9 This Week

Last Update: 2014-05-18
See Project
19

MusiComp

musicomp is a program which most important element is an evolutionary algorithm which uses data mining methods as a fitness function to generate monophone melodies.

Downloads: 0 This Week

Last Update: 2014-06-20
See Project
20

MiMo-V2.5

Omnimodal AI model for agents, coding, and long-context tasks

...It also integrates advanced components such as multi-token prediction modules and specialized vision and audio encoders, making it well-suited for autonomous agents and software development.

Downloads: 0 This Week

Last Update: 2026-05-04
See Project
21

wav2vec2-large-xlsr-53-russian

Russian ASR model fine-tuned on Common Voice and CSS10 datasets

...It is ideal for Russian voice transcription tasks in research, accessibility, and interface development. The training was made possible with compute support from OVHcloud, and the training scripts are publicly available for replication.

Downloads: 0 This Week

Last Update: 2025-07-01
See Project