Showing 20 open source projects for "segmentation"

View related business solutions
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    MatImage

    MatImage

    Image Processing library for Matlab

    matImage is an open-source MATLAB library for image processing and analysis. It provides a variety of tools for image enhancement, segmentation, and feature extraction. It’s especially useful for users working on biomedical images or those needing detailed image analysis in MATLAB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Albumentations

    Albumentations

    Fast image augmentation library and an easy-to-use wrapper

    ...Albumentations efficiently implements a rich variety of image transform operations that are optimized for performance, and does so while providing a concise, yet powerful image augmentation interface for different computer vision tasks, including object classification, segmentation, and detection. Albumentations supports different computer vision tasks such as classification, semantic segmentation, instance segmentation, object detection, and pose estimation. Albumentations works well with data from different domains: photos, medical images, satellite imagery, manufacturing and industrial applications, Generative Adversarial Networks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ncnn

    ncnn

    High-performance neural network inference framework for mobile

    ...It is cross-platform and supports most commonly used CNN networks, including Classical CNN (VGG AlexNet GoogleNet Inception), Face Detection (MTCNN RetinaFace), Segmentation (FCN PSPNet UNet YOLACT), and more. ncnn is currently being used in a number of Tencent applications, namely: QQ, Qzone, WeChat, and Pitu.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 4
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    fvcore

    fvcore

    Collection of common code shared among different research projects

    fvcore is a lightweight utility library that factors out common performance-minded components used across Facebook/Meta computer-vision codebases. It provides numerics and loss layers (e.g., focal loss, smooth-L1, IoU/GIoU) implemented for speed and clarity, along with initialization helpers and normalization layers for building PyTorch models. Its common modules include timers, logging, checkpoints, registry patterns, and configuration helpers that reduce boilerplate in research code. A...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Official YOLOv7

    Official YOLOv7

    YOLOv7: Trainable bag-of-freebies sets new state-of-the-art

    ...YOLOv7 introduced training-time improvements that raise accuracy without increasing inference cost, which is why the project became important in real-time detection research. It supports multiple model sizes and related tasks such as object detection and instance segmentation through associated branches or weights. It is useful for researchers, engineers, and developers building detection systems for video, edge devices, robotics, analytics, and industrial vision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Unet

    Unet

    Source code for unet-pytorch, which can train its own model

    ...Its README notes that U-Net is better suited to datasets with fewer features and shallow visual structures, such as medical image segmentation, rather than complex VOC-style scenes. It is useful for developers and students who want a clear U-Net implementation for segmentation experiments, custom masks, and biomedical-style image analysis.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    DeepLabv3 Plus

    DeepLabv3 Plus

    Encoder-Decoder with Atrous Separable Convolution

    ...The project also supports multi-GPU training, multiple backbones, learning rate schedules with step and cosine options, optimizer selection, and adaptive learning rate behavior based on batch size. It is useful for users who want a stronger semantic segmentation baseline than U-Net for scene-level segmentation tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Nerfies

    Nerfies

    This is the code for Deformable Neural Radiance Fields

    ...This lets the system generate photorealistic novel views of nonrigid subjects—faces, bodies, cloth—while preserving fine detail and consistent lighting. The training pipeline handles imperfect captures by modeling camera poses, exposure variations, and background segmentation, producing stable geometry and appearance. A set of utilities manages dataset preparation, pose estimation, and checkpoints so researchers can reproduce results on their own footage. The work sits at the intersection of graphics and vision, showing how learned volumetric rendering can handle human motion without dense markers or studio rigs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Minkowski Engine

    Minkowski Engine

    Auto-diff neural network library for high-dimensional sparse tensors

    The Minkowski Engine is an auto-differentiation library for sparse tensors. It supports all standard neural network layers such as convolution, pooling, unspooling, and broadcasting operations for sparse tensors. The Minkowski Engine supports various functions that can be built on a sparse tensor. We list a few popular network architectures and applications here. To run the examples, please install the package and run the command in the package root directory. Compressing a neural network to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TNN

    TNN

    Uniform deep learning inference framework for mobile

    TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    ...Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Replica Dataset

    Replica Dataset

    High-fidelity indoor 3D dataset for AI simulation and robotics

    Replica Dataset is a high-quality 3D dataset of realistic indoor environments designed to advance research in computer vision, robotics, and embodied AI. Developed by Facebook Research (now Meta AI), it features accurate geometric reconstructions, high-resolution and high dynamic range textures, and comprehensive semantic annotations. Each environment contains detailed models of real-world spaces, including rooms, furniture, glass, and mirror surfaces. The dataset also provides semantic and...
    Downloads: 1,023 This Week
    Last Update:
    See Project
  • 15
    Talk-OS

    Talk-OS

    Open source version of jianliao

    talk-os is an open-source team chat platform that delivers real-time messaging, channels, and file sharing for collaborative work. It organizes conversations into public rooms and private groups, supports mentions and notifications, and provides a persistent message history with search so teams can catch up quickly. The system is designed around live updates, pushing new messages and presence changes to connected clients without page reloads. File attachments, emojis, and basic rich content...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    WrapImaJ

    Multi-platform API for Image Processing systems in Life Sciences

    WrapImaJ purposes to be a multi-platform wrapper for different Image Processing systems for: - using the Java programming language. The purpose of WrapImaJ is not to combine an exhaustive collection of all functionalities of different imaging system, but to offer a simple, concise Application Programming Interface (API) - allowing to develop imaging software, the source code of which is independent from the underlying imaging system on which it relies. In it's current form, it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CRFasRNN

    CRFasRNN

    Semantic image segmentation method described in the ICCV 2015 paper

    CRF-RNN is a deep neural architecture that integrates fully connected Conditional Random Fields (CRFs) with Convolutional Neural Networks (CNNs) by reformulating mean-field CRF inference as a Recurrent Neural Network. This fusion enables end-to-end training via backpropagation for semantic image segmentation tasks, eliminating the need for separate, offline post-processing steps. Our work allows computers to recognize objects in images, what is distinctive about our work is that we also recover the 2D outline of objects. Currently we have trained this model to recognize 20 classes. This software allows you to test our algorithm on your own images – have a try and see if you can fool it, if you get some good examples you can send them to us. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Medical Data Segmentation Toolkit
    MDSTk is a collection of 2D/3D image processing tools aimed at medical images. It contains routines for volume data processing (3D filtering, segmentation, etc.) as well as fast low-level vector graphics library for surface and tetrahedral meshing. MDSTk has been forked by 3Dim Laboratory s.r.o. to provide better support and further push its development forward. Please check VPL (Voxel Processing Library) project available on Bitbucket (https://bitbucket.org/3dimlab/vpl).
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ocrlib
    ...In library: contour recognition; contour vectorisation; matrix letters feature recognition; web based GUI; assembler core on SS3 instruction; xml support; detect page rotation and segmentation;
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo