Showing 23 open source projects for "segmentation"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    MatImage

    MatImage

    Image Processing library for Matlab

    matImage is an open-source MATLAB library for image processing and analysis. It provides a variety of tools for image enhancement, segmentation, and feature extraction. It’s especially useful for users working on biomedical images or those needing detailed image analysis in MATLAB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Albumentations

    Albumentations

    Fast image augmentation library and an easy-to-use wrapper

    ...Albumentations efficiently implements a rich variety of image transform operations that are optimized for performance, and does so while providing a concise, yet powerful image augmentation interface for different computer vision tasks, including object classification, segmentation, and detection. Albumentations supports different computer vision tasks such as classification, semantic segmentation, instance segmentation, object detection, and pose estimation. Albumentations works well with data from different domains: photos, medical images, satellite imagery, manufacturing and industrial applications, Generative Adversarial Networks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CO3D (Common Objects in 3D)

    CO3D (Common Objects in 3D)

    Tooling for the Common Objects In 3D dataset

    ...It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. CO3Dv2 enables research in multi-view 3D reconstruction, novel view synthesis, and geometry-aware representation learning. Each of the thousands of sequences in CO3Dv2 captures a common object (from categories like cars, chairs, or plants) from multiple real-world viewpoints. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ncnn

    ncnn

    High-performance neural network inference framework for mobile

    ...It is cross-platform and supports most commonly used CNN networks, including Classical CNN (VGG AlexNet GoogleNet Inception), Face Detection (MTCNN RetinaFace), Segmentation (FCN PSPNet UNet YOLACT), and more. ncnn is currently being used in a number of Tencent applications, namely: QQ, Qzone, WeChat, and Pitu.
    Downloads: 22 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    UCO3D

    UCO3D

    Uncommon Objects in 3D dataset

    uCO3D is a large-scale 3D vision dataset and toolkit centered on turn-table videos of everyday objects drawn from the LVIS taxonomy. It provides about 170,000 full videos per object instance rather than still frames, along with per-video annotations including object masks, calibrated camera poses, and multiple flavors of point clouds. Each sequence also ships with a precomputed 3D Gaussian Splat reconstruction, enabling fast, differentiable rendering workflows and modern implicit/point-based...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    fvcore

    fvcore

    Collection of common code shared among different research projects

    fvcore is a lightweight utility library that factors out common performance-minded components used across Facebook/Meta computer-vision codebases. It provides numerics and loss layers (e.g., focal loss, smooth-L1, IoU/GIoU) implemented for speed and clarity, along with initialization helpers and normalization layers for building PyTorch models. Its common modules include timers, logging, checkpoints, registry patterns, and configuration helpers that reduce boilerplate in research code. A...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    smartstr

    a convenience ANSI C (C89) library for easy strings

    ...So simple, that it misses a "string" type - instead, the programmer has to compose with arrays of characters. This may often lead to bugs: buffer overflows because the string outgrew its allocated space, segmentation faults when something or someone forgot to end the char array with a null terminator, memory leaks, etc. Because I was tired of fighting with stupid strings all the time, I created smartstr.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Official YOLOv7

    Official YOLOv7

    YOLOv7: Trainable bag-of-freebies sets new state-of-the-art

    ...YOLOv7 introduced training-time improvements that raise accuracy without increasing inference cost, which is why the project became important in real-time detection research. It supports multiple model sizes and related tasks such as object detection and instance segmentation through associated branches or weights. It is useful for researchers, engineers, and developers building detection systems for video, edge devices, robotics, analytics, and industrial vision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Unet

    Unet

    Source code for unet-pytorch, which can train its own model

    ...Its README notes that U-Net is better suited to datasets with fewer features and shallow visual structures, such as medical image segmentation, rather than complex VOC-style scenes. It is useful for developers and students who want a clear U-Net implementation for segmentation experiments, custom masks, and biomedical-style image analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    DeepLabv3 Plus

    DeepLabv3 Plus

    Encoder-Decoder with Atrous Separable Convolution

    ...The project also supports multi-GPU training, multiple backbones, learning rate schedules with step and cosine options, optimizer selection, and adaptive learning rate behavior based on batch size. It is useful for users who want a stronger semantic segmentation baseline than U-Net for scene-level segmentation tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Nerfies

    Nerfies

    This is the code for Deformable Neural Radiance Fields

    ...This lets the system generate photorealistic novel views of nonrigid subjects—faces, bodies, cloth—while preserving fine detail and consistent lighting. The training pipeline handles imperfect captures by modeling camera poses, exposure variations, and background segmentation, producing stable geometry and appearance. A set of utilities manages dataset preparation, pose estimation, and checkpoints so researchers can reproduce results on their own footage. The work sits at the intersection of graphics and vision, showing how learned volumetric rendering can handle human motion without dense markers or studio rigs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Minkowski Engine

    Minkowski Engine

    Auto-diff neural network library for high-dimensional sparse tensors

    The Minkowski Engine is an auto-differentiation library for sparse tensors. It supports all standard neural network layers such as convolution, pooling, unspooling, and broadcasting operations for sparse tensors. The Minkowski Engine supports various functions that can be built on a sparse tensor. We list a few popular network architectures and applications here. To run the examples, please install the package and run the command in the package root directory. Compressing a neural network to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TNN

    TNN

    Uniform deep learning inference framework for mobile

    TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    ...Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Replica Dataset

    Replica Dataset

    High-fidelity indoor 3D dataset for AI simulation and robotics

    Replica Dataset is a high-quality 3D dataset of realistic indoor environments designed to advance research in computer vision, robotics, and embodied AI. Developed by Facebook Research (now Meta AI), it features accurate geometric reconstructions, high-resolution and high dynamic range textures, and comprehensive semantic annotations. Each environment contains detailed models of real-world spaces, including rooms, furniture, glass, and mirror surfaces. The dataset also provides semantic and...
    Downloads: 263 This Week
    Last Update:
    See Project
  • 18
    Talk-OS

    Talk-OS

    Open source version of jianliao

    talk-os is an open-source team chat platform that delivers real-time messaging, channels, and file sharing for collaborative work. It organizes conversations into public rooms and private groups, supports mentions and notifications, and provides a persistent message history with search so teams can catch up quickly. The system is designed around live updates, pushing new messages and presence changes to connected clients without page reloads. File attachments, emojis, and basic rich content...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    WrapImaJ

    Multi-platform API for Image Processing systems in Life Sciences

    WrapImaJ purposes to be a multi-platform wrapper for different Image Processing systems for: - using the Java programming language. The purpose of WrapImaJ is not to combine an exhaustive collection of all functionalities of different imaging system, but to offer a simple, concise Application Programming Interface (API) - allowing to develop imaging software, the source code of which is independent from the underlying imaging system on which it relies. In it's current form, it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    CRFasRNN

    CRFasRNN

    Semantic image segmentation method described in the ICCV 2015 paper

    CRF-RNN is a deep neural architecture that integrates fully connected Conditional Random Fields (CRFs) with Convolutional Neural Networks (CNNs) by reformulating mean-field CRF inference as a Recurrent Neural Network. This fusion enables end-to-end training via backpropagation for semantic image segmentation tasks, eliminating the need for separate, offline post-processing steps. Our work allows computers to recognize objects in images, what is distinctive about our work is that we also recover the 2D outline of objects. Currently we have trained this model to recognize 20 classes. This software allows you to test our algorithm on your own images – have a try and see if you can fool it, if you get some good examples you can send them to us. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Medical Data Segmentation Toolkit
    MDSTk is a collection of 2D/3D image processing tools aimed at medical images. It contains routines for volume data processing (3D filtering, segmentation, etc.) as well as fast low-level vector graphics library for surface and tetrahedral meshing. MDSTk has been forked by 3Dim Laboratory s.r.o. to provide better support and further push its development forward. Please check VPL (Voxel Processing Library) project available on Bitbucket (https://bitbucket.org/3dimlab/vpl).
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ocrlib
    ...In library: contour recognition; contour vectorisation; matrix letters feature recognition; web based GUI; assembler core on SS3 instruction; xml support; detect page rotation and segmentation;
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo