training free download

Showing 81 open source projects for "training"

View related business solutions

Multimedia Windows Clear Filters & Widen Search

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling.

Downloads: 6 This Week

Last Update: 2025-10-13
See Project
2

Segmentation Models

Segmentation models with pretrained backbones. PyTorch

...High-level API (just two lines to create a neural network) 9 models architectures for binary and multi class segmentation (including legendary Unet) 124 available encoders (and 500+ encoders from timm) All encoders have pre-trained weights for faster and better convergence. Popular metrics and losses for training routines. All encoders have pretrained weights. Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. ...

Downloads: 0 This Week

Last Update: 2025-04-17
See Project
3

SCAIL

Towards Studio-Grade Character Animation via In-Context Learning of 3D

...Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL likely emphasizes scalable, composable AI learning frameworks that support researchers and practitioners in experimenting with learning algorithms, datasets, and model components. The repository structure suggests a focus on flexibility and extensibility, with potential integration into other ZAI tooling for training or analysis.

Downloads: 0 This Week

Last Update: 2026-01-30
See Project
4

Diagrams

Diagram as Code for prototyping cloud system architectures

Diagrams lets you draw the cloud system architecture in Python code. It was born for prototyping a new system architecture without any design tools. You can also describe or visualize the existing system architecture as well. Diagram as Code allows you to track the architecture diagram changes in any version control system. Diagrams currently support main major providers including AWS, Azure, GCP, Kubernetes, Alibaba Cloud, Oracle Cloud, etc. It also supports On-Premise nodes, SaaS and major...

Downloads: 0 This Week

Last Update: 2025-11-22
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

tksolfege ear training program

Music ear training exercises

Tksolfege is an ear training program for learning to recognize chords, intervals, perform rhythm dictation, solfege dictation and singing solfege sequences. As you will discover, it is not an easy program to install and setup on your computer. You will also require to install the tcl/tk interpreter, fluidsynth, and at least one soundfont file. On Windows 11, you may also need to configure the operating system to show the file extension and the hidden directories.

Downloads: 25 This Week

Last Update: 2026-02-04
See Project
6

VCClient

Software that uses AI to perform real-time voice conversion

...The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.

Downloads: 11 This Week

Last Update: 2026-03-23
See Project
7

PML

The easiest way to use deep metric learning in your application

This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you pass into it. Anchor-positive pairs are formed by embeddings that share the same label, and anchor-negative pairs are formed by embeddings that have different labels. ...

Downloads: 1 This Week

Last Update: 2025-08-17
See Project
8

The Cognitive3D SDK for Unity

Cognitive3D SDK for Unity, analytics for VR/AR/MR

The most powerful spatial analytics platform to help you understand user behavior within immersive experiences. Identify exactly what users are paying attention to and in what order. Measure training performance and outcomes with employees. Evaluate compliance to improve safety, reduce injury and errors. Collect distributed insights for CPG, AEC and retail environments. Prove out your hypotheses by conducting experiments in VR. Receive scientific accuracy in captured eye-tracking, fixations and biometrics.. Improve product development workflows to enable rapid insights. ...

Downloads: 0 This Week

Last Update: 2026-02-27
See Project
9

xeokit SDK

Open source JavaScript SDK for viewing high-detail 3D BIM

xeokit is an open-source 3D graphics SDK from xeolabs for BIM and AEC. Built to view huge models in the browser. Used by industry leaders. Visualize complex, federated AEC models at full precision, in all major browsers, including mobile. A complete JavaScript graphics toolkit for building BIM and AEC viewing applications. Open source components to build your own product, your way. Keep everything on your own servers. Get your models on the Web using a toolkit of production-proven components...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
10

Habitat-Sim

A flexible, high-performance 3D simulator for Embodied AI research

Habitat-Sim is a high-performance 3D simulator for embodied AI research, designed to run photorealistic indoor environments at thousands of frames per second. It offers GPU-accelerated rendering and a flexible sensor suite—RGB, depth, semantic segmentation, and more—so agents can perceive and act in realistic scenes. The engine is written in C++ with Python bindings and integrates physics, navigation meshes, and shortest-path planners to support tasks like point-goal navigation,...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
11

Audiomentations

A Python library for audio data augmentation

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where background noise is present. ...

Downloads: 0 This Week

Last Update: 2025-09-13
See Project
12

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.

1 Review

Downloads: 44 This Week

Last Update: 2025-03-06
See Project
13

EarQuiz Frequencies

Software for technical ear training on equalization

EarQuiz Frequencies is a software for ear training on equalization. Its goal is to help musicians, audio professionals, hobbyists and students learn how to hear frequency bands. Available for Windows 10, 11 (x64), macOS 11 or higher (both for Intel and Apple Silicon) and Linux. This application is based on (and deeply inspired by) the world-renowned Golden Ears method of David Moulton, whose course is half dedicated to building this essential critical listening skill.

1 Review

Downloads: 4 This Week

Last Update: 2025-01-10
See Project
14

byzorgan

Specialized sound synthesizer with Byzantine Church music scales

This software integrates a small, specialized synthesizer and vocal processor. It can be used to learn Byzantine Church singing. You can play from the keyboard, mouse or touch screen. MIDI input is also available. Voice functions include: pitch highlighting, synthesizer control by voice, pitch correction and voice-to-ison conversion. On the screen there are labels with symbols of Byzantine notes. There is a metronome. The program is oriented on the Chrysanthos tuning of the diatonic scale:...

4 Reviews

Downloads: 31 This Week

Last Update: 2025-07-28
See Project
15

Youtube Downloader & Splitter - Ad-free

Another training development

Warning please update yt-dlp module manully!!!! This application development started in mid 2020. Easy to use and one of the fastest method to get youtube music videos to mp3 Please use it fairly and use it legal way Qt6.6 based for Linux dotnet (8) +Avalonia UI for Linux develops still in progress.

Downloads: 0 This Week

Last Update: 2025-07-20
See Project
16

MMDeploy

OpenMMLab Model Deployment Framework

...All kinds of modules in the SDK can be extended, such as Transform for image processing, Net for Neural Network inference, Module for postprocessing and so on. Install and build your target backend. ONNX Runtime is a cross-platform inference and training accelerator compatible with many popular ML/DNN frameworks. Please read getting_started for the basic usage of MMDeploy.

Downloads: 0 This Week

Last Update: 2023-12-25
See Project
17

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

...Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. To train CLIP, you can either use x-clip package, or join the LAION discord, where a lot of replication efforts are already underway. Then, you will need to train the decoder, which learns to generate images based on the image embedding coming from the trained CLIP.

Downloads: 1 This Week

Last Update: 2023-10-19
See Project
18

Asteroid

The PyTorch-based audio source separation toolkit for researchers

...Add a new filterbank, separator architecture, dataset or even recipe very easily. Recipes provide an easy way to reproduce results with data preparation, system design, training and evaluation in a single script. This is an essential tool for the community! The default logger is TensorBoard in all the recipes. From the recipe folder, you can run the following to visualize the logs of all your runs.

Downloads: 0 This Week

Last Update: 2023-10-12
See Project
19

MMAction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

...We support 27 different algorithms and 20 different datasets for the four major tasks. We provide detailed documentation and API reference, as well as unit tests. We support Multigrid on Kinetics400, achieve 76.07% Top-1 accuracy and accelerate training speed.

Downloads: 0 This Week

Last Update: 2023-10-12
See Project
20

Roop

One-click face swap

Take a video and replace the face with a face of your choice. You only need one image of the desired face. No dataset, and no training.

Downloads: 138 This Week

Last Update: 2024-09-03
See Project
21

MMClassification

OpenMMLab Image Classification Toolbox and Benchmark

MMClassification is an open-source image classification toolbox based on PyTorch. It is a part of the OpenMMLab project. Supports DenseNet, VAN and PoolFormer, and provide pre-trained models. Supports training on IPU. Supports a series of CSP networks, such as CSP-ResNet, CSP-ResNeXt and CSP-DarkNet. MMClassification is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedback. ...

Downloads: 0 This Week

Last Update: 2023-07-31
See Project
22

Lyra

A Very Low-Bitrate Codec for Speech Compression

lyra is a neural audio codec designed to deliver intelligible, natural-sounding speech at extremely low bitrates, making real-time communication viable on constrained networks. It replaces hand-engineered codecs with learned models that capture speech characteristics more efficiently and reconstruct waveforms with a neural vocoder. The system targets mobile-class hardware, balancing latency and quality so it can run in real-time on phones. Its architecture is resilient to packet loss and...

Downloads: 3 This Week

Last Update: 2025-10-09
See Project
23

audioFlux

A library for audio and music analysis, feature extraction.

...It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
24

Video Pre-Training

Learning to Act by Watching Unlabeled Online Videos

The Video PreTraining (VPT) repository provides code and model artifacts for a project where agents learn to act by watching human gameplay videos—specifically, gameplay of Minecraft—using behavioral cloning. The idea is to learn general priors of control from large-scale, unlabeled video data, and then optionally fine-tune those priors for more goal-directed behavior via environment interaction. The repository contains demonstration models of different widths, fine-tuned variants (e.g. for...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
25

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems in post. ...

Downloads: 5 This Week

Last Update: 2022-09-03
See Project