73 projects for "image processing toolbox for..." with 2 filters applied:

  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    MiniMax-01

    MiniMax-01

    Large-language-model & vision-language-model based on Linear Attention

    MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    libsombrero

    Astronomical object/structure detection from 1D and 2D data sets.

    Sombrero is a fast wavelet image processing and object detection C library for astronomical images. Sombrero is named after the "Mexican Hat" shape of the wavelet masks used in image convolution and is released under the GNU LGPL library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    MATLAB Deep Learning Model Hub

    MATLAB Deep Learning Model Hub

    Discover pretrained models for deep learning in MATLAB

    Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    VideoCrafter2

    VideoCrafter2

    Overcoming Data Limitations for High-Quality Video Diffusion Models

    VideoCrafter is an open-source video generation and editing toolbox designed to create high-quality video content. It features models for both text-to-video and image-to-video generation. The system is optimized for generating videos from textual descriptions or still images, leveraging advanced diffusion models. VideoCrafter2, an upgraded version, improves on its predecessor by enhancing motion dynamics and concept combinations, especially in low-data scenarios.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    HunyuanVideo-I2V

    HunyuanVideo-I2V

    A Customizable Image-to-Video Model based on HunyuanVideo

    HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    MediaPipe Face Detection

    MediaPipe Face Detection

    Detect faces in an image

    The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Deep Learning Models

    Deep Learning Models

    A collection of various deep learning architectures, models, and tips

    ...The code favors readability and pedagogy: components are organized so you can trace data flow through layers, losses, optimizers, and evaluation. Examples span fundamental architectures—MLPs, CNNs, RNN/Transformers—and practical tasks like image classification or text modeling. Reproducible training scripts and configuration files make it straightforward to rerun experiments or adapt them to your own datasets. The repo often pairs implementations with notes on design choices and trade-offs, turning it into both a toolbox and a learning resource. It’s suitable for students, researchers prototyping ideas, and practitioners who want clean baselines before adding complexity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ADAMS

    ADAMS

    ADAMS is a workflow engine for building complex knowledge workflows.

    ...This allows rapid development and easy maintenance of large workflows, with hundreds or thousands of operators. Operators include machine learning (WEKA, MOA, MEKA) and image processing (ImageJ, JAI, BoofCV, LIRE and Gnuplot). R available using Rserve. WEKA webservice allows other frameworks to use WEKA models. Fast prototyping with Groovy and Jython. Read/write support for various databases and spreadsheet applications.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Computer vision projects

    Computer vision projects

    computer vision projects | Fun AI projects related to computer vision

    Computer vision projects is an open-source collection of computer vision projects and experiments that demonstrate practical applications of modern AI techniques in image processing, robotics, and real-time visual analysis. The repository includes multiple demonstration systems implemented using languages such as Python and C++, covering topics ranging from object detection to embedded vision systems. Many of the projects illustrate how computer vision algorithms can interact with hardware platforms, including robotics systems and edge computing devices. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Riffusion

    Riffusion

    Real-time music generation using stable diffusion techniques AI

    Riffusion (hobby) is a Python-based open source library designed for real-time music and audio generation using stable diffusion techniques. Riffusion (hobby) works by generating and manipulating spectrogram images, which are then converted into playable audio clips, effectively bridging image-based diffusion models with sound synthesis. It implements a diffusion pipeline that supports prompt interpolation, allowing smooth transitions between different musical styles or prompts over time. Riffusion (hobby) serves as the core implementation for audio and image processing, providing essential building blocks for generating music from text prompts. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    FrankMocap

    FrankMocap

    A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

    FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes demo apps for live webcam capture as well as batch processing. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    hora

    hora

    Efficient approximate nearest neighbor search algorithm collections

    hora is an open-source high-performance vector similarity search library designed for large-scale machine learning and information retrieval systems. The project focuses on approximate nearest neighbor search, a fundamental technique used in modern AI applications such as recommendation systems, image search, and semantic search engines. Hora implements multiple efficient indexing algorithms that allow systems to rapidly search through high-dimensional vectors produced by machine learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    course-v3

    course-v3

    The 3rd edition of course.fast.ai

    ...The materials demonstrate how to train neural networks using the fastai library and the PyTorch deep learning framework, enabling learners to quickly create applications such as image classifiers, natural language processing models, and recommendation systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LTI-Lib is an object oriented computer vision library written in C++ for Windows/MS-VC++ and Linux/gcc. It provides lots of functionality to solve mathematical problems, many image processing algorithms, some classification tools and much more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VoiceShot API - PHP SDK

    VoiceShot API - PHP SDK

    PHP SDK for processing phone calls and SMS through the VoiceShot API.

    VoiceShot's API allows you to quickly integrate both incoming and outgoing phone calling and text messaging services into your applications. From your own applications, you can easily place and receive interactive telephone calls and text messages. Put callers in touch with the data and people they want when they want it. Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VoiceShot API - .NET SDK

    VoiceShot API - .NET SDK

    .NET SDK for processing phone calls and SMS through the VoiceShot API.

    VoiceShot's API allows you to quickly integrate both incoming and outgoing phone calling and text messaging services into your applications. From your own applications, you can easily place and receive interactive telephone calls and text messages. Put callers in touch with the data and people they want when they want it. Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    VoiceShot API - ASP SDK

    VoiceShot API - ASP SDK

    ASP SDK for processing phone calls and SMS through the VoiceShot API.

    VoiceShot's API allows you to quickly integrate both incoming and outgoing phone calling and text messaging services into your applications. From your own applications, you can easily place and receive interactive telephone calls and text messages. Put callers in touch with the data and people they want when they want it. Send notification phone calls and text messages, automate customer service calls/texts, provide order status and integrate with your own custom applications to provide...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    lbpcascade_animeface

    lbpcascade_animeface

    A Face detector for anime/manga using OpenCV

    lbpcascade_animeface is a computer vision project that provides a trained LBP cascade classifier specifically optimized for detecting anime-style faces in images. Built using OpenCV’s cascade classifier framework, the project adapts traditional face detection techniques to stylized anime and manga artwork, where conventional human face detectors often fail. It is commonly used in anime image analysis, automated cropping tools, avatar systems, illustration indexing, and preprocessing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    OpenPR
    OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    FastPhotoStyle

    FastPhotoStyle

    Style transfer, deep learning, feature transform

    ...It is computationally efficient due to its closed-form solution, allowing fast processing compared to iterative optimization-based methods. The framework is particularly useful in applications such as photo editing, film post-processing, and dataset augmentation where realism is critical. By preserving structural details and avoiding distortions, it produces results that are visually consistent with natural images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    EasyPR

    EasyPR

    An easy, flexible, and accurate plate recognition project

    EasyPR is an open-source license plate recognition system designed to detect and recognize vehicle license plates from images using computer vision and machine learning techniques. The project focuses primarily on recognizing Chinese license plates but also demonstrates general approaches to automatic number plate recognition systems. Built on top of the OpenCV computer vision library, EasyPR provides algorithms for detecting license plate regions in images, segmenting characters, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Scilab Image Processing Toolbox

    Scilab Image Processing Toolbox

    Advanced image processing toolbox for Scilab on Unix/Linux/Mac OS

    SIP is the image processing and computer vision package for SciLab, a free Matlab-like programming environment. SIP reads/writes images in formats like JPEG, PNG, and BMP. It does filtering, segmentation, edge detection, morphology, and shape analysis. Download from Git http://siptoolbox.sourceforge.net/devel
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25

    Service Grid - Language Grid Base System

    SOA infrastracture initially developed by NICT Language Grid Project

    Service Grid is an infrastructure for accumulating and sharing Web services. Resources with complicated intellectual property issues are wrapped as Web services and shared on the Service Grid. If you release your software by using the software of this project, please include the following description in the documents or on the website. * This software uses the [SOFTWARE] by the Language Grid project (http://langrid.org/). [SOFTWARE] is one of: * Service Grid Server Software...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB