Search Results for "segmentation" - Page 2

Sort By:

Showing 286 open source projects for "segmentation"

View related business solutions

Mac Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

Transformers

State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

...Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like speech recognition and audio classification. Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our model hub. At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.

Downloads: 5 This Week

Last Update: 2026-06-15
See Project
2

segment-geospatial

A Python package for segmenting geospatial data with the SAM

The segment-geospatial package draws its inspiration from segment-anything-eo repository authored by Aliaksandr Hancharenka. To facilitate the use of the Segment Anything Model (SAM) for geospatial data, I have developed the segment-anything-py and segment-geospatial Python packages, which are now available on PyPI and conda-forge. My primary objective is to simplify the process of leveraging SAM for geospatial data analysis by enabling users to achieve this with minimal coding effort. I...

Downloads: 1 This Week

Last Update: 7 days ago
See Project
3

GeoAI

GeoAI: Artificial Intelligence for Geospatial Data

...It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide range of tasks including image classification, object detection, segmentation, and change detection, making it suitable for applications in environmental monitoring, urban planning, and disaster response. GeoAI simplifies complex workflows by offering high-level APIs that abstract data preprocessing, model training, and inference, reducing the technical barrier for users who are not experts in both AI and geospatial systems.

Downloads: 0 This Week

Last Update: 2026-06-02
See Project
4

The Hypersim Dataset

Photorealistic Synthetic Dataset for Holistic Indoor Scene

...It provides richly annotated renderings—RGB, depth, surface normals, instance and semantic segmentations, and material/lighting metadata—produced from high-fidelity virtual environments. The dataset spans diverse furniture layouts, room types, and camera trajectories, enabling robust training for geometry, segmentation, and SLAM-adjacent tasks. Rendering pipelines and utilities allow researchers to reproduce sequences, generate novel views, or extract task-specific supervision. Because the data are perfectly labeled and controllable, Hypersim is well suited for pretraining and for studying domain transfer to real imagery. The repository acts as both a dataset index and a set of scripts for downloading, managing, and evaluating on standardized splits.

Downloads: 0 This Week

Last Update: 2026-01-09
See Project
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
5

DINOv2

PyTorch code and models for the DINOv2 self-supervised learning

...The core promise is that a single pretrained backbone can transfer well to many downstream tasks—from linear probing on classification to retrieval, detection, and segmentation—often requiring little or no fine-tuning. The repository includes code for training, evaluating, and feature extraction, with utilities to run k-NN or linear evaluation baselines to assess representation quality. Pretrained checkpoints cover multiple model sizes so practitioners can trade accuracy for speed and memory depending on their deployment constraints.

Downloads: 1 This Week

Last Update: 2026-06-03
See Project
6

Zapret

Advanced open source tool for bypassing DPI-based censorship

Zapret is an open source, cross-platform tool designed to help users bypass and evade Deep Packet Inspection (DPI)-based traffic blocking and censorship systems. Rather than acting as a traditional proxy or VPN, it works by manipulating network packets and traffic streams locally to confuse or disrupt DPI mechanisms used by ISPs and network filters, making it possible to access restricted websites and services without relaying traffic through third-party servers. Zapret implements multiple...

Downloads: 167 This Week

Last Update: 2026-03-12
See Project
7

Ultralytics

Ultralytics YOLO

Ultralytics is a comprehensive computer vision framework that provides state-of-the-art implementations of the YOLO (You Only Look Once) family of models, enabling developers to perform tasks such as object detection, segmentation, classification, tracking, and pose estimation within a unified system. It is designed to be fast, accurate, and easy to use, offering both command-line and Python-based interfaces for training, validation, and deployment of machine learning models. The framework supports a full end-to-end workflow, including dataset preparation, model training, evaluation, and export to various deployment formats. ...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
8

imgclsmob Deep learning networks

Sandbox for training deep learning networks

...The project serves as a sandbox for training and evaluating a wide variety of neural network architectures used in image analysis. It includes implementations of models used for tasks such as image classification, object detection, semantic segmentation, and pose estimation. The repository also contains scripts that help train models, evaluate performance, and convert trained networks between different frameworks. Several deep learning frameworks are supported, allowing researchers to experiment with architectures in different environments. The project is frequently used by developers who want to study modern convolutional neural network designs and compare their performance across datasets.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
9

SlowFast

Video understanding codebase from FAIR for reproducing video models

...Together, these two pathways complement each other, allowing the network to model both appearance and motion without excessive computational cost. The architecture is modular and supports tasks like action recognition, temporal localization, and video segmentation, performing strongly on benchmarks like Kinetics and AVA. The repository provides training recipes, pretrained models, and distributed pipelines optimized for large-scale video datasets.

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

DQO Data Quality Operations Center

Data Quality Operations Center

...DQO comes with around 100 predefined data quality checks which helps you monitor the quality of your data. Table and column-level checks which allows writing your own SQL queries. Daily and monthly date partition testing. Data segmentation by up to 9 different data streams. Build-in scheduling. Calculation of data quality KPIs which can be displayed on multiple built-in data quality dashboards.

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
11

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything

...The software integrates an AI-powered labeling engine that allows users to generate annotations automatically with the assistance of modern vision models such as Segment Anything and various object detection frameworks. It supports labeling tasks across images and videos and enables developers to prepare training datasets for tasks such as object detection, segmentation, classification, tracking, and pose estimation. The tool is built with an interactive graphical interface that simplifies annotation workflows and allows users to draw and edit labels directly on visual data. It also supports a wide range of export formats compatible with popular machine learning pipelines, making it easier to integrate with training frameworks.

Downloads: 102 This Week

Last Update: 2026-06-19
See Project
12

CO3D (Common Objects in 3D)

Tooling for the Common Objects In 3D dataset

...It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. CO3Dv2 enables research in multi-view 3D reconstruction, novel view synthesis, and geometry-aware representation learning. Each of the thousands of sequences in CO3Dv2 captures a common object (from categories like cars, chairs, or plants) from multiple real-world viewpoints. ...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
13

Advanced AI explainability for PyTorch

Advanced AI Explainability for computer vision

...These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision models make predictions. The library supports a wide variety of tasks including image classification, object detection, semantic segmentation, and similarity analysis. It also provides metrics and evaluation tools that help measure the reliability and quality of the generated explanations. By integrating easily with PyTorch models, the library allows developers to diagnose model errors, detect biases in datasets, and improve model transparency.

Downloads: 0 This Week

Last Update: 2026-06-14
See Project
14

Kubernetes Network Policy Recipes

Example recipes for Kubernetes Network Policies that you can just copy

...These recipes help secure Kubernetes clusters by ensuring that pods communicate only with allowed peers, reduce attack surfaces, and enforce least-privilege connectivity at the network layer. The recipes scale from simple “deny all traffic by default” policies to more advanced micro-segmentation patterns for multi-tier apps (frontend/backends) and platform-level isolation for CI/CD systems or service meshes.

Downloads: 0 This Week

Last Update: 2025-12-12
See Project
15

Segment Anything

Provides code for running inference with the SegmentAnything Model

Segment Anything (SAM) is a foundation model for image segmentation that’s designed to work “out of the box” on a wide variety of images without task-specific fine-tuning. It’s a promptable segmenter: you guide it with points, boxes, or rough masks, and it predicts high-quality object masks consistent with the prompt. The architecture separates a powerful image encoder from a lightweight mask decoder, so the heavy vision work can be computed once and the interactive part stays fast. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
16

BlenderProc

Blender pipeline for photorealistic training image generation

...In general, one run of your script first loads or constructs a 3D scene, then sets some camera poses inside this scene and renders different types of images (RGB, distance, semantic segmentation, etc.) for each of those camera poses. Usually, you will run your script multiple times, each time producing a new scene and rendering e.g. 5-20 images from it. With a little more experience, it is also possible to change scenes during a single script call, read here how this is done. As blenderproc runs in blenders separate python environment, debugging your blenderproc script cannot be done in the same way as with any other python script.

Downloads: 1 This Week

Last Update: 2024-10-22
See Project
17

DeepDetect

Deep Learning API and Server in C++14 support for Caffe, PyTorch

...While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) ...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
18

InternGPT

Open source demo platform where you can easily showcase your AI models

...Unlike traditional chat systems that rely solely on text prompts, InternGPT allows users to interact with visual content using both language and nonverbal signals such as pointing or highlighting objects within images. The framework connects multiple specialized AI models that perform tasks such as object detection, segmentation, captioning, and visual editing while coordinating them through a central conversational interface. This architecture enables the system to plan actions, execute visual operations, and return results in a coherent dialogue with the user.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
19

SAHI

A lightweight vision library for performing large object detection

A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities.

Downloads: 0 This Week

Last Update: 2026-06-08
See Project
20

NetBird

Connect your devices into a secure WireGuard-based overlay network

NetBird is an open-source solution that builds a WireGuard®-based overlay network enabling seamless, encrypted peer-to-peer connectivity without the complexity of firewall rules, port forwarding, or centralized VPN gateways. It integrates access control features such as SSO and MFA for secure, policy-driven networking. Every machine in the network runs NetBird Agent (or Client) that manages WireGuard. Every agent connects to Management Service that holds network state, manages peer IPs, and...

Downloads: 36 This Week

Last Update: 12 hours ago
See Project
21

Wink-NLP

Developer friendly Natural Language Processing

Wink-NLP is a lightweight and fast natural language processing library for JavaScript, optimized for browser and Node.js environments.

Downloads: 0 This Week

Last Update: 2025-06-30
See Project
22

Hazm

Persian NLP Toolkit

Hazm is a natural language processing (NLP) library for Persian text, offering various tools for text preprocessing, tokenization, part-of-speech tagging, and more.

Downloads: 0 This Week

Last Update: 2026-04-01
See Project
23

DwarFS

A fast high compression read-only file system for Linux, Windows

The Deduplicating Warp-speed Advanced Read-only File System. A fast high compression read-only file system for Linux and Windows. DwarFS is a read-only file system with a focus on achieving very high compression ratios in particular for very redundant data. This probably doesn't sound very exciting, because if it's redundant, it should compress well. However, I found that other read-only, compressed file systems don't do a very good job at making use of this redundancy. See here for a...

Downloads: 39 This Week

Last Update: 2026-04-01
See Project
24

CleanVision

Automatically find issues in image datasets

...CleanVision helps you automatically identify common types of data issues lurking in image datasets. This package currently detects issues in the raw images themselves, making it a useful tool for any computer vision task such as: classification, segmentation, object detection, pose estimation, keypoint detection, generative modeling, etc.

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
25

CoolPotOS

CoolPotOS for ia32 / amd64

...Written in C and Assembly, it focuses on bootstrapping, kernel entry, and simple console output. Though in early stages, CoolPotOS showcases the boot process, memory segmentation, and essential CPU features, serving as a great learning resource for those new to operating system internals.

Downloads: 2 This Week

Last Update: 2026-03-16
See Project