Showing 68 open source projects for "deep"

View related business solutions
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    satellite-image-deep-learning

    satellite-image-deep-learning

    Resources for deep learning with satellite & aerial imagery

    ...You can also follow me on Twitter and LinkedIn where I aim to post frequent updates on my new discoveries, and I have created a dedicated group on LinkedIn. I have also started a blog here and have published a post on the history of this repository called Dissecting the satellite-image-deep-learning repo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. ...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 3
    Audiomentations

    Audiomentations

    A Python library for audio data augmentation

    A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where background noise is present. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    The Ocular Migraine

    The Ocular Migraine

    The Ocular Migraine {Dev Mode} Master Control Program

    The Ocular Migraine is a powerful utility application designed for Meta Quest VR headsets that provides deep control over system-level performance, recording, and application behavior through an advanced interface. It operates as a “master control program” that leverages ADB-level access to modify device settings that are typically hidden from standard users, enabling extensive customization of CPU, GPU, and rendering parameters. The application allows users to create custom performance profiles that can automatically apply when launching specific games or applications, optimizing both visuals and performance. ...
    Downloads: 62 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    DirectX-Graphics-Samples

    DirectX-Graphics-Samples

    Samples that demonstrate how to build graphics intensive applications

    ...In the Samples directory, you will find samples that attempt to break off specific features and specific usage scenarios into bite-sized chunks. For example, the ExecuteIndirect sample will show you just enough about execute indirect to get started with that feature without diving too deep into multiengine whereas the nBodyGravity sample will delve into multiengine without touching on the execute indirect feature etc. By doing this, we hope to make it easier to get started with DirectX 12. D3D12 Mesh Shaders sample demonstrates how Mesh shaders can be used to increase the flexibility and performance of the geometry pipeline. ...
    Downloads: 33 This Week
    Last Update:
    See Project
  • 6
    Clarity AI Upscaler

    Clarity AI Upscaler

    AI Image Upscaler & Enhancer

    Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 7
    bfxr

    bfxr

    Flash + AIR sound effects generator. Based on Sfxr.

    ...Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound effects; you can preview in real time, export, and iterate. The project includes libraries, HTML templates, and both ActionScript and JavaScript code. It has been well-received (over a thousand stars), but as of 2025, it has been superseded by a newer version called bfxr2, which is a JavaScript reworking of the original.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 8
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    StaxRip

    StaxRip

    Video encoding GUI for Windows

    StaxRip is a powerful, open-source video and audio encoding GUI for Windows that orchestrates industry-standard console tools (such as x265, FFmpeg, mkvmerge) and frame-server systems (like AviSynth+ or VapourSynth) to allow users to transcode, mux, remux, or process media files with fine-grained control. It is not a “one-click” encoder; instead, it grants the user deep control over encoding settings, filtering, resizing, cropping, subtitles, audio processing, container formats, and more — making it a tool of choice for videophiles, enthusiasts, and anyone needing high-quality and customized media output. Because StaxRip automates the invocation of complex command-line tools via a GUI, it lowers the barrier for less technical users while offering advanced configuration for experts. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    nunif

    nunif

    Misc; latest version of waifu2x; 2D video to stereo 3D video

    nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif supports GPU acceleration and batch processing, making it suitable for creators, archivists, and enthusiasts handling large image collections. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Transcoder

    Transcoder

    Hardware-accelerated video transcoding using Android MediaCodec APIs

    ...Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver high-quality translations across languages. It’s built with researchers and developers in mind, offering tools for testing, evaluating, and deploying AI-driven media localization.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    fooyin

    fooyin

    A customisable music player

    ...Its library management tools offer advanced filtering, tagging, and playlist organization for large music collections. Designed primarily for Linux, it emphasizes performance and deep customization over simplicity.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    OpenBrand

    OpenBrand

    Extract brand assets (logos, colors, backdrops) from any website

    ...The project is built with extensibility in mind, allowing developers to integrate additional AI models or design pipelines to expand its capabilities. It provides a structured approach to branding by combining automation with user input, enabling rapid prototyping of brand concepts without requiring deep design expertise. OpenBrand also emphasizes modularity and reproducibility, making it suitable for teams that want to standardize branding processes across projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    FFmpegCommand

    FFmpegCommand

    Command library suitable for Android. It implements audio and video

    ...It supports common operations such as transcoding, trimming, and format conversion. FFmpegCommand is particularly useful for users who want to leverage FFmpeg’s power without needing deep technical knowledge. It also enables experimentation by allowing users to test configurations before execution. Overall, it bridges the gap between ease of use and advanced media processing functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...
    Leader badge
    Downloads: 45,534 This Week
    Last Update:
    See Project
  • 16
    Image-Editor

    Image-Editor

    AI based photo editing website for changing image background

    ...With cv2, you can easily read, write, filter, and display images, and much more. Image-Editor uses Mediapipe's selfie_segmentation model for background removal in real-time video streams. This advanced model uses deep neural networks to detect and remove the background.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    RawTherapee

    RawTherapee

    A powerful cross-platform raw photo processing program

    RawTherapee is a powerful, cross-platform raw photo processing application targeted at enthusiast and semi-professional photographers who want deep control over image development. It’s written primarily in C++ with a GTK+ front-end, providing a high-precision 32-bit floating-point processing engine along with multiple demosaicing algorithms and advanced color-management features. The program supports a very wide range of camera raw formats (via a patched version of dcraw) as well as non-raw formats such as JPEG, TIFF, and PNG. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    PML

    PML

    The easiest way to use deep metric learning in your application

    This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    ...Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors. With Kornia we fill the gap between classical and deep computer vision that implements standard and advanced vision algorithms for AI. Our libraries and initiatives are always according to the community needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Ultralight

    Ultralight

    Lightweight, high-performance HTML renderer for game developers

    ...Ultralight is engineered for peak performance, ensuring minimal CPU and memory usage. Customize low-level platform functionality, integrate JavaScript directly with native code, dive deep into performance tuning, and more. Built for maximum portability, optimized for PCs, game consoles, TVs, and embedded systems.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    Waifu2x-Extension-GUI

    Waifu2x-Extension-GUI

    Photo/Video/GIF enlargement using machine learning

    Image & GIF & Video Super-Resolution using Deep Convolutional Neural Networks. Built-in image processing algorithm: Waifu2x / SRMD / RealSR / Anime4K / ACNet Built-in image processing engine: Waifu2x-caffe / Waifu2x-converter / Waifu2x-ncnn-vulkan / SRMD-ncnn-vulkan / RealSR-ncnn-vulkan / Anime4KCPP Github: https://github.com/AaronFeng753/Waifu2x-Extension-GUI
    Leader badge
    Downloads: 496 This Week
    Last Update:
    See Project
  • 23
    Fractal Zoomer

    Fractal Zoomer

    A Fractal Zoomer with various functions.

    An application that lets you render some of the most known fractal functions, like the Mandelbrot set and many more! It comes with alot of options to further enhance your fractal experience! Its easy to use and does not require installation. A java version higher than 1.8 is required to be installed.
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 24
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content,...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 25
    QuickBeat

    QuickBeat

    Fully Customizable Full-Fledged Audio Player Supporting Many Formats

    Full fledged music player and local library manager. Enjoy your media collection in one unified interface, have access to your local network, Deezer, Youtube-DL and Remote file, never miss information on your remote media via remote tags reading, tweak your music how you want using a variety of built-in audio effects and if that doesn't suit you, you can import your winamp plugins , immerse yourself with our selection of visualizers and if you get bored with them, import your winamp,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo