Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "wavelets image processing"

x

Sort By:

Relevance

Clear All Filters

OS

ChromeOS 33
BSD 33
Linux 33
More...
Mac 33
Windows 33
Desktop Operating Systems 1
Mobile Operating Systems 1

Category

Artificial Intelligence 33
Multimedia 16
Scientific/Engineering 8
Business 4
Education 3
Printing 1
Security 1
Software Development 1
Text Editors 1

License

OSI-Approved Open Source 27

Translations

English 11
French 2
Brazilian Portuguese 1
German 1
More...
Italian 1
Romanian 1
Russian 1

Programming Language

Python 12
Java 6
C++ 5
C 2
More...
JavaScript 2
MATLAB 2
ASP.NET 1
C# 1
PHP 1
Scilab 1

Status

Beta 9
Pre-Alpha 2
Alpha 2
Production/Stable 2
More...
Planning 1

33 projects for "wavelets image processing" with 2 filters applied:

Artificial Intelligence ChromeOS Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.

Explore 10,000+ tools
1

HunyuanDiT

Diffusion Transformer with Fine-Grained Chinese Understanding

HunyuanDiT is a high-capability text-to-image diffusion transformer with bilingual (Chinese/English) understanding and multi-turn dialogue capability. It trains a diffusion model in latent space using a transformer backbone and integrates a Multimodal Large Language Model (MLLM) to refine captions and support conversational image generation. It supports adapters like ControlNet, IP-Adapter, LoRA, and can run under constrained VRAM via distillation versions. LoRA, ControlNet (pose, depth,...

Downloads: 9 This Week

Last Update: 2025-11-27
See Project
2

DeepSeek-OCR

Contexts Optical Compression

...It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.

Downloads: 9 This Week

Last Update: 2025-10-25
See Project
3

OpenAI Quickstart Node

Node.js example app from the OpenAI API quickstart tutorial

...The repository provides structured sample code for a variety of API endpoints, including chat completions, assistants, embeddings, fine-tuning, moderation, batch processing, and image generation. Each folder contains runnable scripts that demonstrate both basic usage and more advanced scenarios. By following the examples, developers can quickly understand how to authenticate with an API key, send requests, and handle responses within a Node.js environment. The project is a practical starting point for building AI-powered applications, serving as a foundation for experimentation and integration into larger projects. ...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
4

MATLAB Deep Learning Model Hub

Discover pretrained models for deep learning in MATLAB

Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.

Downloads: 0 This Week

Last Update: 2024-10-11
See Project
Most modern and flexible cloud platform for MLM companies
ERP-class software for multi-level marketing

For direct selling (MLM) companies, from startup to well established enterprises with millions of distributors across the world

Learn More
5

Step3-VL-10B

Multimodal model achieving SOTA performance

...It achieves this efficiency and strong performance through unified pre-training on a massive 1.2 trillion-token multimodal corpus that jointly optimizes a language-aligned perception encoder with a powerful decoder, creating deep synergy between image processing and text understanding.

Downloads: 4 This Week

Last Update: 2 days ago
See Project
6

DreamCraft3D

Official implementation of DreamCraft3D

DreamCraft3D is DeepSeek’s generative 3D modeling framework / model family that likely extends their earlier 3D efforts (e.g. Shap-E or Point-E style models) with more capability, control, or expression. The name suggests a “dream crafting” metaphor—users probably supply textual or image prompts and generate 3D assets (point clouds, meshes, scenes). The repository includes model code, inference scripts, sample prompts, and possibly dataset preparation pipelines. It may integrate rendering or post-processing modules (e.g. mesh smoothing, texturing) to make the outputs more output-ready. Because 3D generation is hardware‐intensive, the repository likely also includes optimizations like quantization, pruning, or inference accelerations (e.g. using FlashMLA or DeepEP) to make the generation pipeline faster or more efficient. ...

Downloads: 2 This Week

Last Update: 2025-10-03
See Project
7

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. The...

Downloads: 2 This Week

Last Update: 2025-10-08
See Project
8

GLM-4.5V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
9

Dolphin

Document Image Parsing via Heterogeneous Anchor Prompting”

Dolphin — maintained by ByteDance — is a project aimed at providing a high-performance, robust, and extensible media or multimedia framework / player infrastructure (or possibly a streaming media solution), intended to meet modern demands for efficiency, flexibility, and integration in media-heavy applications. It seeks to combine performant media playback or handling (audio/video decoding, streaming, buffering) with a modular, developer-friendly API that allows easy embedding into larger...

Downloads: 0 This Week

Last Update: 2025-12-17
See Project
The AI workplace management platform
Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.

By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.

Learn More
10

OAGI Python SDK

Python SDK for the Computer Use model Lux, developed by OpenAGI

OAGI Python SDK is a Python client library for the Lux computer-use model that turns Lux into a programmable automation layer for operating human-facing software via vision and actions. It exposes the OAGI API in an ergonomic way, letting you trigger Lux in three main modes: Tasker for precise scripted sequences, Actor for fast one-shot tasks, and Thinker for open-ended, multi-step objectives. The SDK is designed around “computer use” as a paradigm, where the AI actually navigates...

Downloads: 1 This Week

Last Update: 1 day ago
See Project
11

MiniMax-01

Large-language-model & vision-language-model based on Linear Attention

MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
12

ADAMS

ADAMS is a workflow engine for building complex knowledge workflows.

...This allows rapid development and easy maintenance of large workflows, with hundreds or thousands of operators. Operators include machine learning (WEKA, MOA, MEKA) and image processing (ImageJ, JAI, BoofCV, LIRE and Gnuplot). R available using Rserve. WEKA webservice allows other frameworks to use WEKA models. Fast prototyping with Groovy and Jython. Read/write support for various databases and spreadsheet applications.

Downloads: 5 This Week

Last Update: 2024-03-21
See Project
13

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V is a customizable image-to-video generation framework developed by Tencent, extending the capabilities of HunyuanVideo. It allows for high-quality video creation from still images, using PyTorch and providing pre-trained model weights, inference code, and customizable training options. The system includes a LoRA training code for adding special effects and enhancing video realism, aiming to offer versatile and scalable solutions for generating videos from static image inputs.

1 Review

Downloads: 9 This Week

Last Update: 2025-03-10
See Project
14

MediaPipe Face Detection

Detect faces in an image

The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.

Downloads: 4 This Week

Last Update: 2025-03-19
See Project
15

FrankMocap

A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes demo apps for live webcam capture as well as batch processing. ...

Downloads: 1 This Week

Last Update: 2025-10-07
See Project
16

Spectral Python

A python module for hyperspectral image processing

Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.

4 Reviews

Downloads: 0 This Week

Last Update: 2020-08-01
See Project
17

OpenPR

OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.

Downloads: 0 This Week

Last Update: 2018-05-15
See Project
18

Training Image Operators from Samples

Tools to train Image Operators automatically from a set of samples.

TRIOS - Training Image Operators from Samples is a set of tools to bring Image Processing closer to scientists in general. It is capable of estimating an operator between two images using only pairs of samples that contain an input image and the desired output. The operator is saved to a file and can be applied to any image.

Downloads: 0 This Week

Last Update: 2017-07-31
See Project
19

ImageApp - Java Advanced Imaging GUI

An IDE for people interested in Machine Vision/Image Processing. Written in Java, using JAI. It allows users to view image data and also provides a drag and drop environment that users can create/execute graphs of JAI operators.

Downloads: 0 This Week

Last Update: 2017-01-21
See Project
20

libcrn

libcrn is document image processing library written in C++11 for Linux, Windows, Mac OsX and Google Android. It is a toolbox that allows to create easily software such as OCRs and layout analysis tools.

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
21

WebDjVuTextEd

Edit the OCR text layer of DjVu documents in a web browser

WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. Note that current SVN...

Downloads: 0 This Week

Last Update: 2015-11-21
See Project
22

QVision: Computer Vision Library for Qt

Computer vision and image processing library for Qt.

This library contains among other things a set of graphical widgets for video output, performance evaluation and augmented reality. The library also provides classes for several data types usually required by computer vision and image processing applications such as vectors, matrices, quaternions and images. Thanks to a large number of wrapper functions these objects can be used with highly efficient functionality from third party libraries such as OpenCV, GNU Scientific Library, Computational Geometry Algorithms Library, Intel's Math Kernel Library and Integrated Performance Primitives, the Octave library, etc...

Downloads: 1 This Week

Last Update: 2013-07-02
See Project
23

Math Transformations Library

A library analog to those included in Matlab without the need of external libraries; just right for embedded or static linking. MTL was used to build a 3d Scanner. MTL consists of pars B - Basic Functions, Matrices, Images, Hypermodels (3d Models and up) N - Numeric Functions ranging from linear regression over nonlinear optimization to singular-value computation I - Image filters and Image enhancement H - Hardware related (optional part), does require additional libraries and is only...

Downloads: 0 This Week

Last Update: 2016-11-02
See Project
24

Flavia Plant Leaf Recognition System

The purpose of this program is to teach a computer to classify plants via their leaves. You just need to input the image of a leaf(acquired from scanner or camera), then the computer can tell you what kind of plant it is.

Downloads: 6 This Week

Last Update: 2013-04-17
See Project
25

contraCaptcha

ContraCaptcha is a Java-based libary for solving (image-based) captchas. It provides classes for image processing and character recognition. It also includes solvers for some of the popular captcha-systems in use DISCLAIMER: only for learning purpose (;

Downloads: 0 This Week

Last Update: 2016-07-23
See Project

Previous
You're on page 1
2
Next

Related Searches

ocr

deep learning toolbox matlab

deepseek

windows boot repair

ocr from pdf

image to video

mediapipe

hyperspectral

openpr

java advanced imaging 1.1.3

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Business

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: