deep learning toolbox free download

Showing 28 open source projects for "deep learning toolbox"

View related business solutions

Multimedia Mac Clear Filters & Widen Search

Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

satellite-image-deep-learning

Resources for deep learning with satellite & aerial imagery

This page lists resources for performing deep learning on satellite imagery. To a lesser extent classical Machine learning (e.g. random forests) are also discussed, as are classical image processing techniques. Note there is a huge volume of academic literature published on these topics, and this repository does not seek to index them all but rather list approachable resources with published code that will benefit both the research and developer communities.

Downloads: 0 This Week

Last Update: 2025-07-05
See Project
2

Audiomentations

A Python library for audio data augmentation

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where background noise is present. ...

Downloads: 0 This Week

Last Update: 2025-09-13
See Project
3

PML

The easiest way to use deep metric learning in your application

This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you...

Downloads: 0 This Week

Last Update: 2025-08-17
See Project
4

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...

Downloads: 4 This Week

Last Update: 2025-10-13
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

OpenCV

Open Source Computer Vision Library

The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...

123 Reviews

Downloads: 9,114 This Week

Last Update: 2025-12-31
See Project
6

Clarity AI Upscaler

AI Image Upscaler & Enhancer

Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. ...

Downloads: 11 This Week

Last Update: 2026-03-12
See Project
7

Transcoder

Hardware-accelerated video transcoding using Android MediaCodec APIs

...Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver high-quality translations across languages. It’s built with researchers and developers in mind, offering tools for testing, evaluating, and deploying AI-driven media localization.

Downloads: 1 This Week

Last Update: 2025-03-25
See Project
8

MMDeploy

OpenMMLab Model Deployment Framework

MMDeploy is an open-source deep learning model deployment toolset. It is a part of the OpenMMLab project. Models can be exported and run in several backends, and more will be compatible. All kinds of modules in the SDK can be extended, such as Transform for image processing, Net for Neural Network inference, Module for postprocessing and so on. Install and build your target backend.

Downloads: 0 This Week

Last Update: 2023-12-25
See Project
9

MMAction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

OpenMMLab's next generation video understanding toolbox and benchmark. MMAction2 is an open-source toolbox for video understanding based on PyTorch. It is a part of the OpenMMLab project. Modular design: We decompose a video understanding framework into different components. One can easily construct a customized video understanding framework by combining different modules. Support four major video understanding tasks: MMAction2 implements various algorithms for multiple video understanding...

Downloads: 0 This Week

Last Update: 2023-10-12
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

MMClassification

OpenMMLab Image Classification Toolbox and Benchmark

MMClassification is an open-source image classification toolbox based on PyTorch. It is a part of the OpenMMLab project. Supports DenseNet, VAN and PoolFormer, and provide pre-trained models. Supports training on IPU. Supports a series of CSP networks, such as CSP-ResNet, CSP-ResNeXt and CSP-DarkNet. MMClassification is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or...

Downloads: 0 This Week

Last Update: 2023-07-31
See Project
11

AI Upscaler for Blender

AI Upscaler for Blender using Real-ESRGAN

...Upscaling is done entirely on the CPU. Blender renders a low-resolution image. The Real-ESRGAN Upscaler upscales the low-resolution image to a higher-resolution image. Real-ESRGAN is a deep learning upscaler that uses neural networks to achieve excellent results by adding in detail when it upscales.

Downloads: 1 This Week

Last Update: 2023-08-08
See Project
12

audioFlux

A library for audio and music analysis, feature extraction.

audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
13

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
14

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure.

Downloads: 2 This Week

Last Update: 2022-09-03
See Project
15

KAIR

Image Restoration Toolbox (PyTorch). Training and testing codes

Image restoration toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSR/GAN, SwinIR.

Downloads: 7 This Week

Last Update: 2022-08-11
See Project
16

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 1 This Week

Last Update: 2021-04-08
See Project
17

XZVoice

Free and open source text-to-speech software

...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.

Downloads: 0 This Week

Last Update: 2022-10-04
See Project
18

TTS

Deep learning for text to speech

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model...

Downloads: 1 This Week

Last Update: 2021-10-18
See Project
19

Surface Defect Detection Dataset Papers

Constantly summarizing open source dataset and critical papers

...A reasonable imaging scheme helps to obtain images with uniform illumination and clearly reflect the surface defects of the object. In recent years, many defect detection methods based on deep learning have also been widely used in various industrial scenarios.

Downloads: 0 This Week

Last Update: 2022-08-17
See Project
20

GIMP ML

AI for GNU Image Manipulation Program

...In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.

Downloads: 9 This Week

Last Update: 2022-08-19
See Project
21

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:

Downloads: 1 This Week

Last Update: 2020-06-20
See Project
22

PRMLT

Matlab code of machine learning algorithms in book PRML

This Matlab package implements machine learning algorithms described in the great textbook: Pattern Recognition and Machine Learning by C. Bishop (PRML). It is written purely in Matlab language. It is self-contained. There is no external dependency. This package requires Matlab R2016b or latter, since it utilizes a new Matlab syntax called Implicit expansion (a.k.a. broadcasting).

Downloads: 0 This Week

Last Update: 2021-10-27
See Project
23

Image Super-Resolution (ISR)

Super-scale your images and run experiments with Residual Dense

The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...

Downloads: 1 This Week

Last Update: 2022-03-31
See Project
24

YouTube-8M

Starter code for working with the YouTube-8M dataset

...The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.

Downloads: 0 This Week

Last Update: 1 day ago
See Project
25

TenorSpace.js

Neural network 3D visualization framework

TensorSpace is a neural network 3D visualization framework built using TensorFlow.js, Three.js and Tween.js. TensorSpace provides Keras-like APIs to build deep learning layers, load pre-trained models, and generate a 3D visualization in the browser. From TensorSpace, it is intuitive to learn what the model structure is, how the model is trained and how the model predicts the results based on the intermediate information. After preprocessing the model, TensorSpace supports the visualization of pre-trained models from TensorFlow, Keras and TensorFlow.js. ...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project