Showing 5 open source projects for "image processing in java"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 1
    Warlock-Studio

    Warlock-Studio

    AI-suite for image and video upscaling and enhancement. v4.0.1

    Warlock-Studio is a powerful, open-source desktop application for Windows that integrates state-of-the-art AI models for video and image enhancement. This suite provides a unified, high-performance interface for upscaling, restoration, and frame interpolation, making advanced enhancement workflows accessible and efficient. Version 4.0.1 continues this evolution with enhanced AI architecture and improved code stability for even better performance and reliability. Note: The SuperResolution-10...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    MediaPipe Face Detection

    MediaPipe Face Detection

    Detect faces in an image

    The MediaPipe Face Detection model is a high-performance, real-time face detection solution that uses machine learning to identify faces in images and video streams. It is optimized for mobile and embedded platforms, offering fast and accurate face detection while maintaining a small memory footprint. This model supports multiple face detections and is highly efficient, making it suitable for a variety of applications such as augmented reality, user authentication, and facial expression analysis.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Nanonets-OCR-s

    Nanonets-OCR-s

    State-of-the-art image-to-markdown OCR model

    Nanonets-OCR-s is an advanced image-to-markdown OCR model that transforms documents into structured and semantically rich markdown. It goes beyond basic text extraction by intelligently recognizing content types and applying meaningful tags, making the output ideal for Large Language Models (LLMs) and automated workflows. The model expertly converts mathematical equations into LaTeX syntax, distinguishing between inline and display modes for accuracy. It also generates descriptive <img> tags...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ERNIE-4.5-VL-28B-A3B-Base-PT

    ERNIE-4.5-VL-28B-A3B-Base-PT

    Pretrained multimodal MoE model for complex text and vision tasks

    ERNIE-4.5-VL-28B-A3B-Base-PT is a large-scale multimodal Mixture-of-Experts (MoE) model developed by Baidu, featuring 28 billion total parameters and 3 billion activated per token. It is pretrained to handle both text and image inputs, enabling it to excel in image-to-text and conversational AI tasks. The model uses a staged training strategy—starting with text-only training and then integrating vision components using ViT, adapters, and visual experts for robust cross-modal understanding...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    ERNIE-4.5-VL-424B-A47B-Base-PT

    ERNIE-4.5-VL-424B-A47B-Base-PT

    Multimodal MoE model fine-tuned for text and visual comprehension

    ERNIE-4.5-VL-424B-A47B-Base-PT is a powerful multimodal Mixture-of-Experts (MoE) model developed by Baidu and fine-tuned for enhanced performance across both text and visual tasks. It builds upon the pretraining of ERNIE 4.5, using modality-specific post-training techniques to optimize for general-purpose natural language processing and visual-language reasoning. The model employs a heterogeneous MoE architecture with modality-isolated routing and loss-balancing mechanisms to ensure efficient...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.