Showing 2 open source projects for "image text input"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Portkey AI Gateway

    Portkey AI Gateway

    A blazing fast AI Gateway with integrated guardrails

    Portkey AI Gateway aims to offer a blazing fast, secure, and flexible gateway for interacting with a wide variety of models and enforcing guardrails. It presents a single, friendly API through which you can route to 200+ LLMs, while applying configurable input/output guardrails to enforce policies or restrict certain content. It supports automatic retries, fallbacks, load balancing across providers or keys, and request timeouts to avoid latency spikes. The gateway is multimodal: it can handle text, vision, audio, and image models under a common interface. It also offers features for governance: role-based access, compliance with standards (SOC2, HIPAA, GDPR), secure key management, and logging/analytics of usage, latency, errors, and cost. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TagForge

    TagForge

    Cross-platform AI tagging and prompt engineering studio

    TagForge is a professional, cross-platform desktop application for AI prompt engineering and image tagging. Built on .NET 9 and Avalonia UI, it provides a sleek, high-performance workspace for crafting perfect prompts for Stable Diffusion, FLUX, Midjourney, and other generative AI models. ## Key Capabilities - Multi-model tagging: Specialized generators for Stable Diffusion (tags), FLUX/Midjourney (prose), and custom LLM workflows - Multimodal vision: Extract metadata or generate descriptive captions directly from images - Contextual chat assistant: Persistent, coding-capable AI helper with real-time provider features - Persona system: Create custom identities with dynamic input injection and role-based templates - Chat rules: Modular behavioral controls (Concise, Detailed, etc.) with CRUD interface - Agent orchestrator: Centralized management of API keys, endpoints, and model parameters
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB