Showing 286 open source projects for "segmentation"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    ProStack

    ProStack

    ProStack - a platform for image processing and analysis

    ProStack - a platform for image processing and analysis. It implements various image processing methods as separate modules, that can be joined in a complex image processing scenario by use of a graphical user interface. RPMs are available at https://build.opensuse.org/project/repositories/home:mackoel:compbio
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Flying-Bird-Wallpaper

    Flying-Bird-Wallpaper

    Flying Bird Wallpaper is a feature-rich desktop wallpaper application

    Flying Bird Wallpaper is a feature-rich desktop wallpaper application that supports multiple wallpaper types including images, videos, rhythm wallpapers, and solid colors, making your desktop unique and vibrant.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    ChatbotX

    ChatbotX

    A self-hosted, white-label alternative to ManyChat for agencies.

    ChatbotX is an open-source omnichannel chatbot platform built as a modern alternative to ManyChat for agencies and businesses that want full control through self-hosting, white-labeling, and reselling. Build AI-powered workflows with a visual flow builder, live chat inbox, CRM, broadcasts, sequences, analytics, and advanced automation tools. Connect across WhatsApp, Facebook, Instagram, Telegram, Zalo, Email, and Webchat with support for rich messaging, comment-to-DM, triggers, webhooks, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Tally

    Tally

    Your favorite dark mode word counter, now with grammar checking!

    Tally - Word Counter is a free online tool to count the number of characters, words, paragraphs, and lines in your text. It can also show counts for different types of characters like letters, digits, spaces, punctuation, and symbols/special characters. Make sure you have the right number of words for your essay or post by counting them instantly with Tally.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    PyDenseCRF

    PyDenseCRF

    Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

    ...The project allows developers and researchers to integrate Dense CRF inference into Python-based machine learning pipelines, particularly for computer vision tasks such as image segmentation and labeling. Conditional Random Fields are probabilistic graphical models used to model contextual relationships between neighboring pixels or features, improving prediction consistency across images. By implementing a fully connected CRF model with Gaussian edge potentials, the library enables efficient inference across all pixel pairs in an image rather than only local neighborhoods. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DeepSeek MoE

    DeepSeek MoE

    Towards Ultimate Expert Specialization in Mixture-of-Experts Language

    DeepSeek-MoE (“DeepSeek MoE”) is the DeepSeek open implementation of a Mixture-of-Experts (MoE) model architecture meant to increase parameter efficiency by activating only a subset of “expert” submodules per input. The repository introduces fine-grained expert segmentation and shared expert isolation to improve specialization while controlling compute cost. For example, their MoE variant with 16.4B parameters claims comparable or better performance to standard dense models like DeepSeek 7B or LLaMA2 7B using about 40% of the total compute. The repo publishes both Base and Chat variants of the 16B MoE model (deepseek-moe-16b) and provides evaluation results across benchmarks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    tdsft

    tdsft

    TDSFT (Two-Dimensional Segmentation Fusion Tool)

    ...Numerous algorithms have been developed over time, but to date, there is no validated method for this procedure. Therefore, research is still active in this area. Two-Dimensional Segmentation Fusion Tool (TDSFT) is an open-source tool developed in MATLAB and distributed as a standalone application for MAC, Linux, and Windows, which offers a simple and extensible interface where numerous algorithms are proposed to "mediate" (e.g., process and fuse) multiple segmentations. TDSFT is a tool made with ease of use as a fixed point, to support and help medical specialists during their work. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Detectron

    Detectron

    FAIR's research platform for object detection research

    Detectron is an object detection and instance segmentation research framework that popularized many modern detection models in a single, reproducible codebase. Built on Caffe2 with custom CUDA/C++ operators, it provided reference implementations for models like Faster R-CNN, Mask R-CNN, RetinaNet, and Feature Pyramid Networks. The framework emphasized a clean configuration system, strong baselines, and a “model zoo” so researchers could compare results under consistent settings. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ControlNet

    ControlNet

    Let us control diffusion models

    ...Rather than training from scratch, ControlNet “locks” the weights of a pre-trained diffusion model and introduces a parallel trainable branch that learns additional conditions—like edges, depth maps, segmentation, human pose, scribbles, or other guidance signals. This allows the system to control where and how the model should focus during generation, enabling users to steer layout, structure, and content more precisely than prompt text alone. The project includes many trained model variants that accept different types of conditioning (e.g., canny edge input, normal maps, skeletal pose) and produce improved fidelity in stable diffusion outputs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    ICCV2023-Paper-Code-Interpretation

    ICCV2023-Paper-Code-Interpretation

    ICCV2021/2019/2017 Paper/Code/Interpretation/Live Broadcast Collection

    ...The repository organizes papers and implementations into categories, allowing readers to explore different areas of computer vision research such as detection, segmentation, and generative models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FastViT

    FastViT

    This repository contains the official implementation of research

    ...The models use lightweight attention and carefully engineered blocks to minimize token mixing costs while preserving representation power. Training and inference recipes highlight straightforward integration into common vision tasks such as classification, detection, and segmentation. The codebase provides reference implementations and checkpoints that make it easy to evaluate or fine-tune on downstream datasets. In practice, FastViT offers drop-in backbones that reduce compute and memory pressure without exotic training tricks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    funNLP

    funNLP

    Resources, corpora, and tools for Chinese natural language processing

    ...The repository is organized into categories such as sentiment analysis, text classification, named entity recognition, knowledge graphs, and various lexicons (e.g. sensitive words, emotion dictionaries, stopwords). It also includes links to academic papers, open-source model implementations, and practical utilities like word segmentation or text cleaning scripts. The project is highly community-oriented, frequently updated with contributions and new resources, and it’s widely used in both academic and applied NLP research. Its value lies in providing not just tools but also curated, domain-specific data, which can be hard to find elsewhere.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    daily-paper-computer-vision

    daily-paper-computer-vision

    Document papers compiled daily in computer vision/deep learning

    This repo is a running feed of computer-vision research, tracking new papers and notable results so practitioners can keep up without scouring multiple sites. It’s organized chronologically and often thematically, making it easy to scan what’s new in detection, segmentation, recognition, generative vision, 3D, and video understanding. The cadence is intentionally frequent, reflecting how quickly CV advances and how hard it is to maintain awareness while working full time. By aggregating paper titles and references in one place, it reduces the overhead of deciding what to read next and helps you spot trends early. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TaskMatrix

    TaskMatrix

    Enable sending and receiving images during chatting

    ...Originally introduced alongside the Visual ChatGPT concept, TaskMatrix acts as an orchestration framework where a central language model delegates subtasks to domain-specific AI systems such as image generators, segmentation tools, or recognition models. The architecture focuses on modularity, allowing new APIs and foundation models to be integrated as interchangeable task-solving components. The project also explores low-code human-AI interaction workflows that improve controllability and transparency during complex task execution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Lightning Flash

    Lightning Flash

    Flash enables you to easily configure and run complex AI recipes

    ...All data loading in Flash is performed via a from_* classmethod on a DataModule. Which DataModule to use and which from_* methods are available depends on the task you want to perform. For example, for image segmentation where your data is stored in folders, you would use the from_folders method of the SemanticSegmentationData class. Our tasks come loaded with pre-trained backbones and (where applicable) heads. You can view the available backbones to use with your task using available_backbones. With Flash, swapping among 40+ optimizers and 15 + schedulers recipes are simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Golang HLS Streamer

    Golang HLS Streamer

    A server that exposes a directory for video streaming

    Golang HLS Streamer is a Go-based implementation of HTTP Live Streaming (HLS) functionality, designed to handle media segmentation and playlist generation for streaming applications. It provides tools for creating and managing HLS streams, including segmenting video into smaller chunks and generating M3U8 playlists. The project is intended for developers building streaming servers or media delivery systems. It focuses on performance and simplicity, leveraging Go’s concurrency model to handle streaming tasks efficiently. gohls can be integrated into backend services to enable adaptive streaming workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NÜWA - Pytorch

    NÜWA - Pytorch

    Implementation of NÜWA, attention network for text to video synthesis

    ...However, I will continue on with NUWA, extending it to use multi-headed codes + hierarchical causal transformer. I think that direction is untapped for improving on this line of work. In the paper, they also present a way to condition the video generation based on segmentation mask(s). You can easily do this as well, given you train a VQGanVAE on the sketches beforehand. Then, you will use NUWASketch instead of NUWA, which can accept the sketch VAE as a reference. This repository will also offer a variant of NUWA that can produce both video and audio. For now, the audio will need to be encoded manually.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MMTracking

    MMTracking

    OpenMMLab Video Perception Toolbox

    ...We are the first open-source toolbox that unifies versatile video perception tasks include video object detection, multiple object tracking, single object tracking and video instance segmentation. We decompose the video perception framework into different components and one can easily construct a customized method by combining different modules. MMTracking interacts with other OpenMMLab projects. It is built upon MMDetection that we can capitalize any detector only through modifying the configs. All operations run on GPUs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BEVFormer

    BEVFormer

    Implementation of BEVFormer, a camera-only framework

    3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, which learns unified BEV representations with spatiotemporal transformers to support multiple autonomous driving perception tasks. In a nutshell, BEVFormer exploits both spatial and temporal information by interacting with spatial and temporal space through predefined grid-shaped BEV queries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Official YOLOv7

    Official YOLOv7

    YOLOv7: Trainable bag-of-freebies sets new state-of-the-art

    ...YOLOv7 introduced training-time improvements that raise accuracy without increasing inference cost, which is why the project became important in real-time detection research. It supports multiple model sizes and related tasks such as object detection and instance segmentation through associated branches or weights. It is useful for researchers, engineers, and developers building detection systems for video, edge devices, robotics, analytics, and industrial vision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    PoreMeth

    PoreMeth is a tool for the identification of DMRs from Nanopore data.

    PoreMeth is a tool for the detection of differentially methylated regions (DMRs) from nanopore sequencing data of two samples. PoreMeth tool is based on a heterogeneous form of the shifting level model, which integrates distances between consecutive CpGs in the segmentation algorithm. PoreMeth is fed with ∆β values and generates genomic segments with increased (hyper-) or decreased (hypo-) methylation levels between two samples, which are then evaluated for statistical significance using the Wilcoxon-rank sum test.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Mask2Former

    Mask2Former

    Code release for "Masked-attention Mask Transformer

    Mask2Former is a unified segmentation architecture that handles semantic, instance, and panoptic segmentation with one model and one training recipe. Its core idea is to cast segmentation as mask classification: a transformer decoder predicts a set of mask queries, each with an associated class score, eliminating the need for task-specific heads. A pixel decoder fuses multi-scale features and feeds masked attention in the transformer so each query focuses computation on its current spatial support. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Unet

    Unet

    Source code for unet-pytorch, which can train its own model

    ...Its README notes that U-Net is better suited to datasets with fewer features and shallow visual structures, such as medical image segmentation, rather than complex VOC-style scenes. It is useful for developers and students who want a clear U-Net implementation for segmentation experiments, custom masks, and biomedical-style image analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    avio

    Python version of ffplay with built-in AI

    See the Files tab above for installation instructions
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    DeepLabv3 Plus

    DeepLabv3 Plus

    Encoder-Decoder with Atrous Separable Convolution

    ...The project also supports multi-GPU training, multiple backbones, learning rate schedules with step and cosine options, optimizer selection, and adaptive learning rate behavior based on batch size. It is useful for users who want a stronger semantic segmentation baseline than U-Net for scene-level segmentation tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo