Showing 60 open source projects for "deep"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    G2SConverter

    G2SConverter

    Convert models from GoldSource engine to Source engine with AI

    ...To improve the quality of the texture, it is first Upscaled using RealESRGAN. The user can select scaling factor: x2, x4 or x8. After the Upscaling procedure, the texture is deblured using the Scale-recurrent Network for Deep Image Deblurring. An example of a processed texture is shown in the following image (parameters used: scaling-factor = 4 and deblur iterations = 4) besides upscaling and debluring the utility also generates normal maps for each texture. This is implemented using the DeepBump by HugoTiny model. Examples of normal maps are shown in the following images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    CinePaint

    CinePaint

    Deep paint and stop-motion animation software

    CinePaint 1.4.4 Windows version released 2021/5/6. Linux and MacOS versions coming. CinePaint is a deep paint image retouching tool that supports higher color fidelity than ordinary painting tools. CinePaint opens high fidelity image file formats such as DPX, 16-bit TIFF, and OpenEXR, and conventional formats like JPEG and PNG. -- Robin Rowe CinePaint Project Manager Beverly Hills, California robin.rowe@cinepaint.org imdb.me/robinrowe 323-535-0952
    Downloads: 37 This Week
    Last Update:
    See Project
  • 6
    XZVoice

    XZVoice

    Free and open source text-to-speech software

    ...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    TTS

    TTS

    Deep learning for text to speech

    TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    vdx

    vdx

    An intuitive CLI for processing video, powered by FFmpeg

    ...It supports chaining multiple transformations in a single command, enabling streamlined workflows for video editing and automation. vdx relies on Node.js and FFmpeg, making it portable across systems where these dependencies are available. Its focus on usability makes it accessible for both developers and content creators who want quick transformations without deep FFmpeg knowledge. Overall, it acts as a productivity layer that abstracts FFmpeg complexity into simple CLI commands.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    VRWorkout

    VRWorkout

    High-intensity virtual reality workout game

    The game tracks your running speed and awards more points the faster you go. Arm movements to the beat of the music and fast ducking to avoid bad targets. Shallow or deep squats depending on your comfort level. Engange your core muscles by reaching for higher targets. Pushups followed by jumps get your heart pumping. Classic pushup, one-armed planks and side planks. Fast and small jumps similar to skip rope jumping. A “workout first” VR music game with hand tracking based full-body exercises in boot camp style workouts, your fitness wins regardless of your score. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    Surface Defect Detection Dataset Papers

    Surface Defect Detection Dataset Papers

    Constantly summarizing open source dataset and critical papers

    ...A reasonable imaging scheme helps to obtain images with uniform illumination and clearly reflect the surface defects of the object. In recent years, many defect detection methods based on deep learning have also been widely used in various industrial scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DeepMind Lab

    DeepMind Lab

    A customizable 3D platform for agent-based AI research

    ...DeepMind Lab provides a suite of challenging 3D navigation and puzzle-solving tasks for learning agents. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning. If you use DeepMind Lab in your research and would like to cite the DeepMind Lab environment, we suggest you cite the DeepMind Lab paper. To enable compiler optimizations, pass the flag --compilation_mode=opt, or -c opt for short, to each bazel build, bazel test and bazel run command. The flag is omitted from the examples here for brevity, but it should be used for real training and evaluation where performance matters. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GIMP ML

    GIMP ML

    AI for GNU Image Manipulation Program

    ...In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    quick-media

    quick-media

    media(audio/image/qrcode/markdown/html/svg/png) support

    ...It supports batch processing, making it efficient for handling multiple media files in a single workflow. quick-media is designed for developers and content creators who want to automate repetitive media tasks without deep FFmpeg expertise. Its modular structure allows integration into scripts or larger pipelines. Overall, it serves as a productivity tool for managing multimedia processing tasks efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Xabe.FFmpeg

    Xabe.FFmpeg

    .NET Standard wrapper for FFmpeg. It allows to process media

    ...It includes utilities for retrieving media information through FFprobe, enabling detailed analysis of files. Xabe.FFmpeg is designed to work across platforms and integrates easily into C# projects without requiring deep knowledge of FFmpeg commands. It also supports progress tracking and event handling during processing tasks. Overall, it provides a developer-friendly approach to multimedia processing in .NET environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    Consistent Depth is a research project developed by Facebook Research that presents an algorithm for reconstructing dense and geometrically consistent depth information for all pixels in a monocular video. The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DeepFaceLab

    DeepFaceLab

    The leading software for creating deepfakes

    ...DeepFaceLab is an open-source deepfake system that enables users to swap the faces on images and on video. It offers an imperative and easy-to-use pipeline that even those without a comprehensive understanding of the deep learning framework or model implementation can use; and yet also provides a flexible and loose coupling structure for those who want to strengthen their own pipeline with other features without having to write complicated boilerplate code. DeepFaceLab can achieve results with high fidelity that are indiscernible by mainstream forgery detection approaches. ...
    Downloads: 17,221 This Week
    Last Update:
    See Project
  • 17

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    VirtualDub2

    VirtualDub2

    An enhanced version of the original VirtualDub

    A streamlined video editor with simple frame-by-frame timeline. Free, open-source, portable. The default package is ready for use with many essential plugins included, and can be further extended with 3rd party codecs and filters. See Wiki for detailed description: https://sourceforge.net/p/vdfiltermod/wiki. Previous name of this project: VirtualDub FilterMod
    Leader badge
    Downloads: 1,072 This Week
    Last Update:
    See Project
  • 19
    Image Super-Resolution (ISR)

    Image Super-Resolution (ISR)

    Super-scale your images and run experiments with Residual Dense

    The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    YouTube-8M

    YouTube-8M

    Starter code for working with the YouTube-8M dataset

    ...It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    TenorSpace.js

    TenorSpace.js

    Neural network 3D visualization framework

    TensorSpace is a neural network 3D visualization framework built using TensorFlow.js, Three.js and Tween.js. TensorSpace provides Keras-like APIs to build deep learning layers, load pre-trained models, and generate a 3D visualization in the browser. From TensorSpace, it is intuitive to learn what the model structure is, how the model is trained and how the model predicts the results based on the intermediate information. After preprocessing the model, TensorSpace supports the visualization of pre-trained models from TensorFlow, Keras and TensorFlow.js. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    waifu2x

    waifu2x

    Single-image super-resolution for anime-style art

    Single-Image Super-Resolution for Anime-Style Art using Deep Convolutional Neural Networks. And it supports photo. You can train your own model, change image size, reduce image noise, upscale and customize your image's style. It provides the option of converting and downloading your edited images.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    High Speed Parallel Imaging (HSPI)

    High Speed Parallel Imaging (HSPI)

    HSPI does parallel capture and sky surveys.

    HSPI offers to the astronomer the possibility to capture pictures from the CCD camera in a fast and intuitive way, keeping under control all the instrumentation in posses. HSPI can control the CCD camera, the mount, the focuser, the filter wheel and the dome from a single interface, and not only this: these features can be added on as many observatories and instrumentations you want. HSPI is ASCOM and INDI compatible, so if you want to use particular setups like Linux computers connected...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    lxnstack

    lxnstack

    A program to align and stack astronomical images.

    Lxnstack is a program designed to align and stack astronomical images (both planetary and deep-sky). The features of latest version are: * Several stacking methods (average, median, sigma-clipping, ...) * Two automatic align method: align-points; phase correlation, * One manual align method. * sub-pixel precision (0.1 pixel for phase correlation and 0.01 pixel for manual alignment). * Automatic image derotation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MusicPlayer

    MusicPlayer

    Music player - endlessly plays your music

    This music player is supposed to be simple and all centered around an infinite intelligent queue (some other players call this PartyShuffle or DJ mode). You can manually add songs to it - but if you don't or the queue gets too short, it will automatically intelligently fill it with further songs. The intelligent queue decision currently is based on: * song ratings * context-based choices, e.g. related songs more likely Other features of this player: * open source, simplified BSD...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo