Showing 12 open source projects for "segmentation"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    Segmentation models with pre trained backbones. High-level API (just two lines to create a neural network) 9 models architectures for binary and multi class segmentation (including legendary Unet) 124 available encoders (and 500+ encoders from timm) All encoders have pre-trained weights for faster and better convergence. Popular metrics and losses for training routines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MediaPipe

    MediaPipe

    Cross-platform, customizable ML solutions for live and streaming media

    MediaPipe offers open-source cross-platform, customizable ML solutions for live and streaming media. Provides segmentation masks for prominent humans in the scene. MediaPipe Face Mesh is a face geometry solution that estimates 468 3D face landmarks in real-time even on mobile devices. It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU acceleration throughout the pipeline, the solution delivers real-time performance-critical for live experiences. ...
    Downloads: 80 This Week
    Last Update:
    See Project
  • 3
    Viral-Clips-Crew

    Viral-Clips-Crew

    Your CrewAI Powered Video Editing Assistant

    ...It analyzes transcripts and video data to identify the most engaging or “viral” moments, reducing the need for manual editing. The system integrates tools like FFmpeg and AI models to handle segmentation, cropping, and formatting for vertical video platforms. It supports automation workflows that allow creators to produce multiple clips efficiently at scale. The project focuses on content repurposing, helping users adapt long videos into formats suitable for platforms like TikTok and YouTube Shorts. Its modular design allows customization of each processing stage, including selection logic and visual formatting. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Av1an

    Av1an

    Cross-platform command-line AV1 / VP9 / HEVC / H264 encoding framework

    Av1an is a command-line encoding framework designed to efficiently encode video into the AV1 format using distributed and parallel processing techniques. It acts as an orchestrator that splits video files into chunks, processes them in parallel using encoders like SVT-AV1, AOM, or rav1e, and then merges the results into a final output. The tool is optimized for high-quality encoding while reducing total processing time through multi-core and multi-machine support. It includes scene detection...
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    Esegui SB

    Flexible video encoding script supporting multiple formats and codecs.

    ...Key features include: Flexible Video and Audio Encoding: Utilizes FFmpeg's libraries for various codecs and settings. Subtitle Handling: Include or exclude subtitles as needed. Customizable Settings: Adjust CRF values and aspect ratios to suit your needs. Video Segmentation and Merging: Split videos into segments and merge them back together. Track Disposition Management: Set default tracks for audio, video, and subtitles. Audio Track Extraction: Extract and encode audio tracks independently. Audio normalization: Adjusts audio levels to ensure consistent volume across tracks. The script leverages FFmpeg's built-in libraries and tools for these functions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Golang HLS Streamer

    Golang HLS Streamer

    A server that exposes a directory for video streaming

    Golang HLS Streamer is a Go-based implementation of HTTP Live Streaming (HLS) functionality, designed to handle media segmentation and playlist generation for streaming applications. It provides tools for creating and managing HLS streams, including segmenting video into smaller chunks and generating M3U8 playlists. The project is intended for developers building streaming servers or media delivery systems. It focuses on performance and simplicity, leveraging Go’s concurrency model to handle streaming tasks efficiently. gohls can be integrated into backend services to enable adaptive streaming workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    avio

    Python version of ffplay with built-in AI

    See the Files tab above for installation instructions
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Robust Video Matting (RVM)

    Robust Video Matting (RVM)

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX

    ...Unlike most existing methods that perform video matting frame-by-frame as independent images, our method uses a recurrent architecture to exploit temporal information in videos and achieves significant improvements in temporal coherence and matting quality. Furthermore, we propose a novel training strategy that enforces our network on both matting and segmentation objectives. This significantly improves our model's robustness. Our method does not require any auxiliary inputs such as a trimap or a pre-captured background image, so it can be widely applied to existing human matting applications. RVM is specifically designed for robust human video matting.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9

    avimmir

    (audio, video, image) Multimedia Multimodal Information Retrieval

    audio classification; speaker segmentation; speaker clustering; speaker recognition; spoken document retrieval; image retrieval; video retrieval; etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • 10
    Fully automated segmentation of moving objects from a stationary scene using pre-stored scene images. Needs no external parameters or settings; all parameters are derived internally from the pre-stored images. Implemented using OpenCV framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This Project is concerned about colorization of black and white movies, it will contains some image segmentation,motion detection,Artificial intelligence for original color estimation, color distribution and some grid computing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The Mimas Toolkit is a C++ real-time computer vision library. Algorithms include edge/corner-detection, object recognition/tracking, LSI-filters, segmentation, array-operators, convolution etc. OO wrappers for LAPACK, libxine, V4L, FFTW are provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo