Page 2 | deep free download

Showing 60 open source projects for "deep"

View related business solutions

Multimedia Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

EnCodec

State-of-the-art deep learning based audio codec

Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
2

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure.

Downloads: 1 This Week

Last Update: 2022-09-03
See Project
3

G2SConverter

Convert models from GoldSource engine to Source engine with AI

...To improve the quality of the texture, it is first Upscaled using RealESRGAN. The user can select scaling factor: x2, x4 or x8. After the Upscaling procedure, the texture is deblured using the Scale-recurrent Network for Deep Image Deblurring. An example of a processed texture is shown in the following image (parameters used: scaling-factor = 4 and deblur iterations = 4) besides upscaling and debluring the utility also generates normal maps for each texture. This is implemented using the DeepBump by HugoTiny model. Examples of normal maps are shown in the following images.

Downloads: 0 This Week

Last Update: 2023-03-30
See Project
4

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 4 This Week

Last Update: 2021-04-08
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
5

CinePaint

Deep paint and stop-motion animation software

CinePaint 1.4.4 Windows version released 2021/5/6. Linux and MacOS versions coming. CinePaint is a deep paint image retouching tool that supports higher color fidelity than ordinary painting tools. CinePaint opens high fidelity image file formats such as DPX, 16-bit TIFF, and OpenEXR, and conventional formats like JPEG and PNG. -- Robin Rowe CinePaint Project Manager Beverly Hills, California robin.rowe@cinepaint.org imdb.me/robinrowe 323-535-0952

14 Reviews

Downloads: 37 This Week

Last Update: 2021-05-30
See Project
6

XZVoice

Free and open source text-to-speech software

...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.

Downloads: 0 This Week

Last Update: 2022-10-04
See Project
7

TTS

Deep learning for text to speech

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model...

Downloads: 2 This Week

Last Update: 2021-10-18
See Project
8

vdx

An intuitive CLI for processing video, powered by FFmpeg

...It supports chaining multiple transformations in a single command, enabling streamlined workflows for video editing and automation. vdx relies on Node.js and FFmpeg, making it portable across systems where these dependencies are available. Its focus on usability makes it accessible for both developers and content creators who want quick transformations without deep FFmpeg knowledge. Overall, it acts as a productivity layer that abstracts FFmpeg complexity into simple CLI commands.

Downloads: 0 This Week

Last Update: 2026-04-26
See Project
9

VRWorkout

High-intensity virtual reality workout game

The game tracks your running speed and awards more points the faster you go. Arm movements to the beat of the music and fast ducking to avoid bad targets. Shallow or deep squats depending on your comfort level. Engange your core muscles by reaching for higher targets. Pushups followed by jumps get your heart pumping. Classic pushup, one-armed planks and side planks. Fast and small jumps similar to skip rope jumping. A “workout first” VR music game with hand tracking based full-body exercises in boot camp style workouts, your fitness wins regardless of your score. ...

Downloads: 0 This Week

Last Update: 2022-01-21
See Project
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
10

Surface Defect Detection Dataset Papers

Constantly summarizing open source dataset and critical papers

...A reasonable imaging scheme helps to obtain images with uniform illumination and clearly reflect the surface defects of the object. In recent years, many defect detection methods based on deep learning have also been widely used in various industrial scenarios.

Downloads: 0 This Week

Last Update: 2022-08-17
See Project
11

DeepMind Lab

A customizable 3D platform for agent-based AI research

...DeepMind Lab provides a suite of challenging 3D navigation and puzzle-solving tasks for learning agents. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning. If you use DeepMind Lab in your research and would like to cite the DeepMind Lab environment, we suggest you cite the DeepMind Lab paper. To enable compiler optimizations, pass the flag --compilation_mode=opt, or -c opt for short, to each bazel build, bazel test and bazel run command. The flag is omitted from the examples here for brevity, but it should be used for real training and evaluation where performance matters. ...

Downloads: 0 This Week

Last Update: 2021-08-25
See Project
12

GIMP ML

AI for GNU Image Manipulation Program

...In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.

Downloads: 4 This Week

Last Update: 2022-08-19
See Project
13

quick-media

media(audio/image/qrcode/markdown/html/svg/png) support

...It supports batch processing, making it efficient for handling multiple media files in a single workflow. quick-media is designed for developers and content creators who want to automate repetitive media tasks without deep FFmpeg expertise. Its modular structure allows integration into scripts or larger pipelines. Overall, it serves as a productivity tool for managing multimedia processing tasks efficiently.

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
14

Xabe.FFmpeg

.NET Standard wrapper for FFmpeg. It allows to process media

...It includes utilities for retrieving media information through FFprobe, enabling detailed analysis of files. Xabe.FFmpeg is designed to work across platforms and integrates easily into C# projects without requiring deep knowledge of FFmpeg commands. It also supports progress tracking and event handling during processing tasks. Overall, it provides a developer-friendly approach to multimedia processing in .NET environments.

Downloads: 1 This Week

Last Update: 2026-04-27
See Project
15

Consistent Depth

We estimate dense, flicker-free, geometrically consistent depth

Consistent Depth is a research project developed by Facebook Research that presents an algorithm for reconstructing dense and geometrically consistent depth information for all pixels in a monocular video. The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
16

DeepFaceLab

The leading software for creating deepfakes

...DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to strengthen their own pipeline with other features without having to write complicated boilerplate code. DeepFaceLab can achieve results with high fidelity that are indiscernible by mainstream forgery detection approaches. ...

1 Review

Downloads: 17,221 This Week

Last Update: 2023-09-07
See Project
17

FastoCloud PRO

IPTV/NVR/CCTV/Video cloud https://fastocloud.com

IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:

Downloads: 3 This Week

Last Update: 2020-06-20
See Project
18

VirtualDub2

An enhanced version of the original VirtualDub

A streamlined video editor with simple frame-by-frame timeline. Free, open-source, portable. The default package is ready for use with many essential plugins included, and can be further extended with 3rd party codecs and filters. See Wiki for detailed description: https://sourceforge.net/p/vdfiltermod/wiki. Previous name of this project: VirtualDub FilterMod

16 Reviews

Downloads: 1,072 This Week

Last Update: 2020-03-19
See Project
19

Image Super-Resolution (ISR)

Super-scale your images and run experiments with Residual Dense

The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...

Downloads: 6 This Week

Last Update: 2022-03-31
See Project
20

YouTube-8M

Starter code for working with the YouTube-8M dataset

...It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
21

TenorSpace.js

Neural network 3D visualization framework

TensorSpace is a neural network 3D visualization framework built using TensorFlow.js, Three.js and Tween.js. TensorSpace provides Keras-like APIs to build deep learning layers, load pre-trained models, and generate a 3D visualization in the browser. From TensorSpace, it is intuitive to learn what the model structure is, how the model is trained and how the model predicts the results based on the intermediate information. After preprocessing the model, TensorSpace supports the visualization of pre-trained models from TensorFlow, Keras and TensorFlow.js. ...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
22

waifu2x

Single-image super-resolution for anime-style art

Single-Image Super-Resolution for Anime-Style Art using Deep Convolutional Neural Networks. And it supports photo. You can train your own model, change image size, reduce image noise, upscale and customize your image's style. It provides the option of converting and downloading your edited images.

Downloads: 5 This Week

Last Update: 2023-09-07
See Project
23

High Speed Parallel Imaging (HSPI)

HSPI does parallel capture and sky surveys.

HSPI offers to the astronomer the possibility to capture pictures from the CCD camera in a fast and intuitive way, keeping under control all the instrumentation in posses. HSPI can control the CCD camera, the mount, the focuser, the filter wheel and the dome from a single interface, and not only this: these features can be added on as many observatories and instrumentations you want. HSPI is ASCOM and INDI compatible, so if you want to use particular setups like Linux computers connected...

Downloads: 0 This Week

Last Update: 2017-07-18
See Project
24

lxnstack

A program to align and stack astronomical images.

Lxnstack is a program designed to align and stack astronomical images (both planetary and deep-sky). The features of latest version are: * Several stacking methods (average, median, sigma-clipping, ...) * Two automatic align method: align-points; phase correlation, * One manual align method. * sub-pixel precision (0.1 pixel for phase correlation and 0.01 pixel for manual alignment). * Automatic image derotation

Downloads: 0 This Week

Last Update: 2016-10-09
See Project
25

MusicPlayer

Music player - endlessly plays your music

This music player is supposed to be simple and all centered around an infinite intelligent queue (some other players call this PartyShuffle or DJ mode). You can manually add songs to it - but if you don't or the queue gets too short, it will automatically intelligently fill it with further songs. The intelligent queue decision currently is based on: * song ratings * context-based choices, e.g. related songs more likely Other features of this player: * open source, simplified BSD...

Downloads: 0 This Week

Last Update: 2014-11-09
See Project