Search Results for "image processing framework" - Page 10

Showing 1778 open source projects for "image processing framework"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    SAHI

    SAHI

    A lightweight vision library for performing large object detection

    ...Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and fine-tuning pipeline for small object detection.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Ray

    Ray

    A unified framework for scalable computing

    Modern workloads like deep learning and hyperparameter tuning are compute-intensive and require distributed or parallel execution. Ray makes it effortless to parallelize single machine code — go from a single CPU to multi-core, multi-GPU or multi-node with minimal code changes. Accelerate your PyTorch and Tensorflow workload with a more resource-efficient and flexible distributed execution framework powered by Ray. Accelerate your hyperparameter search workloads with Ray Tune. Find the best...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Readest

    Readest

    Readest is a modern, feature-rich ebook reader

    Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...
    Downloads: 15 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 5
    GeoStats.jl

    GeoStats.jl

    An extensible framework for geospatial data science

    GeoStats.jl is a Julia framework for geospatial data science and geostatistical modeling. It’s fully implemented in Julia and designed to provide an extensible, high-performance stack that handles spatial domains, interpolation, simulation, learning, and visualization. The package is modular: it breaks out geometry, spatial domains, transforms, variograms, covariance models, and modeling into subpackages (e.g., GeoStatsBase, GeoStatsModels, GeoStatsTransforms). Users can represent...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Caesium - Image Compressor
    !! THIS PROJECT HAS BEEN MOVED!! https://github.com/Lymphatus/caesium-image-compressor Caesium reduces the size of your picture up to 90%, preserving the original visual quality. Allows you to save a lot of space and easily upload your pictures on the web in a moment. The software is user-friendly with a simple and clear interface.
    Downloads: 101 This Week
    Last Update:
    See Project
  • 8
    Vinix

    Vinix

    An effort to write a modern, fast, and useful operating system

    Vinix is an experimental Unix-like operating system written in V, a statically typed compiled programming language known for its simplicity and performance. The project aims to create a minimalistic and efficient OS that adheres to Unix principles while leveraging the modern features of the V language. Vinix serves as both a proof of concept for system programming in V and a platform for exploring OS design concepts.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 9
    ergo

    ergo

    Framework for creating microservices using technologies of Erlang/OTP

    ...The easiest way to create an OTP-designed application in Golang. The goal of this project is to leverage Erlang/OTP experience with Golang performance. The ideal framework for creating complex and distributed solutions (machine learning, data processing pipeline, etc.) being simple and reliable. You don't have to reinvent the wheel. There are ready-to-use implemented design patterns. Two processes can be linked to each other. Termination one terminates another. Any process can monitor the service node. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Haystack

    Haystack

    Haystack is an open source NLP framework to interact with your data

    Apply the latest NLP technology to your own data with the use of Haystack's pipeline architecture. Implement production-ready semantic search, question answering, summarization and document ranking for a wide range of NLP applications. Evaluate components and fine-tune models. Ask questions in natural language and find granular answers in your documents using the latest QA models with the help of Haystack pipelines. Perform semantic search and retrieve ranked documents according to meaning,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Wan Move

    Wan Move

    Motion-controllable Video Generation via Latent Trajectory Guidance

    Wan Move is an open-source research codebase for motion-controllable video generation that focuses on enabling fine-grained control of motion within generative video models. It is designed to guide the temporal evolution of visual content by leveraging latent trajectory guidance, allowing users to manipulate how objects move over time without modifying the underlying generative architecture. By representing motion information as dense point trajectories and integrating them into the latent...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Agentic Data Scientist

    Agentic Data Scientist

    An end-to-end Data Scientist

    ...Each agent is designed to independently call functions, interact with data sources, and adapt to uncertainties during processing, enabling iterative refinement of models without manual coordination. The framework supports interoperability with existing data tools and libraries, letting the agents leverage libraries like pandas, scikit-learn, and visualization frameworks to perform real computations rather than mock demonstrations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Lingvo

    Lingvo

    Framework for building neural networks

    Lingvo is a TensorFlow based framework focused on building and training sequence models, especially for language and speech tasks. It was originally developed for internal research and later open sourced to support reproducible experiments and shared model implementations. The framework provides a structured way to define models, input pipelines, and training configurations using a common interface for layers, which encourages reuse across different tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    GraphicsMagick

    Swiss army knife of image processing

    GraphicsMagick provides a set of commandline tools and programming APIs for manipulating, editing, and converting raster and vector images. It is derived from ImageMagick, with the objective of providing better stability and performance than ImageMagick while retaining the original MIT/X11 license.
    Leader badge
    Downloads: 3,309 This Week
    Last Update:
    See Project
  • 15
    dxos

    dxos

    TypeScript implementation of the DXOS protocols, SDK, toolchain

    DXOS is a decentralized operating system framework that empowers developers to build local-first, collaborative applications without relying on central servers. By providing a comprehensive SDK and toolchain, DXOS facilitates the creation of apps that prioritize user privacy, offline functionality, and seamless peer-to-peer synchronization. Its flagship application, Composer, exemplifies the platform's capabilities by enabling users to organize and sync knowledge across devices, with support for real-time collaboration and local AI processing.​
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    OpenNext

    OpenNext

    Open-source Next.js adapter for AWS

    This project packages Next.js applications to run natively on AWS’s serverless stack, mapping framework features onto services like Lambda, CloudFront, and S3. It translates routing, API handlers, and middleware into deployable artifacts, handling edge cases such as image optimization, ISR, and streaming responses. Static assets are uploaded to object storage and cached at the edge, while dynamic routes are executed in short-lived compute with cold-start-aware build settings. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    ...Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain is especially helpful if batch operations can be optimized – for instance, when synchronous API calls can be parallelized or where an LLM API offers batch processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    MiniMax-01

    MiniMax-01

    Large-language-model & vision-language-model based on Linear Attention

    MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    TensorFlow

    TensorFlow

    TensorFlow is an open source library for machine learning

    ...Available across all common operating systems (desktop, server and mobile), TensorFlow provides stable APIs for Python and C as well as APIs that are not guaranteed to be backwards compatible or are 3rd party for a variety of other languages. The platform can be easily deployed on multiple CPUs, GPUs and Google's proprietary chip, the tensor processing unit (TPU). TensorFlow expresses its computations as dataflow graphs, with each node in the graph representing an operation. Nodes take tensors—multidimensional arrays—as input and produce tensors as output. The framework allows for these algorithms to be run in C++ for better performance, while the multiple levels of APIs let the user determine how high or low they wish the level of abstraction to be in the models produced. ...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 20
    Learning Interpretability Tool

    Learning Interpretability Tool

    Interactively analyze ML models to understand their behavior

    The Learning Interpretability Tool (LIT, formerly known as the Language Interpretability Tool) is a visual, interactive ML model-understanding tool that supports text, image, and tabular data. It can be run as a standalone server, or inside of notebook environments such as Colab, Jupyter, and Google Cloud Vertex AI notebooks.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    Tiled

    Tiled

    Flexible level editor

    Tiled is a free and open source, easy to use, and flexible level editor. Tiled is highly flexible. It can be used to create maps of any size, with no restrictions on tile size, or the number of layers or tiles that can be used. Maps, layers, tiles, and objects can all be assigned arbitrary properties. Tiled's map format (TMX) is easy to understand and allows multiple tilesets to be used in any map. Tilesets can be modified at any time. Tiled is a general purpose tile map editor for all...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 22
    Apache Sedona

    Apache Sedona

    Cluster computing framework for processing large-scale geospatial data

    Apache Sedona™ is a cluster computing system for processing large-scale spatial data. Sedona extends existing cluster computing systems, such as Apache Spark and Apache Flink, with a set of out-of-the-box distributed Spatial Datasets and Spatial SQL that efficiently load, process, and analyze large-scale spatial data across machines. According to our benchmark and third-party research papers, Sedona runs 2X - 10X faster than other Spark-based geospatial data systems on computation-intensive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    RuleGo

    RuleGo

    Component orchestration rule engine framework for Go

    RuleGo is a high-performance, component-based rule engine implemented in Go. It’s lightweight, embeddable, orchestration-ready, and built for flexible composition of business logic into reusable components. No external middleware dependencies, efficient data processing and linkage on low-cost devices, suitable for IoT edge computing. Embedded and Standalone Deployment modes. Supports embedding RuleGo into existing applications. It can also be deployed independently as middleware, providing...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Waifu2x-Extension-GUI

    Waifu2x-Extension-GUI

    Photo/Video/GIF enlargement using machine learning

    Image & GIF & Video Super-Resolution using Deep Convolutional Neural Networks. Built-in image processing algorithm: Waifu2x / SRMD / RealSR / Anime4K / ACNet Built-in image processing engine: Waifu2x-caffe / Waifu2x-converter / Waifu2x-ncnn-vulkan / SRMD-ncnn-vulkan / RealSR-ncnn-vulkan / Anime4KCPP Github: https://github.com/AaronFeng753/Waifu2x-Extension-GUI
    Leader badge
    Downloads: 726 This Week
    Last Update:
    See Project
  • 25
    Weaviate

    Weaviate

    Weaviate is a cloud-native, modular, real-time vector search engine

    Weaviate in a nutshell: Weaviate is a vector search engine and vector database. Weaviate uses machine learning to vectorize and store data, and to find answers to natural language queries. With Weaviate you can also bring your custom ML models to production scale. Weaviate in detail: Weaviate is a low-latency vector search engine with out-of-the-box support for different media types (text, images, etc.). It offers Semantic Search, Question-Answer-Extraction, Classification, Customizable...
    Downloads: 2 This Week
    Last Update:
    See Project