Showing 2 open source projects for "image text input"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    Portkey AI Gateway

    Portkey AI Gateway

    A blazing fast AI Gateway with integrated guardrails

    Portkey AI Gateway aims to offer a blazing fast, secure, and flexible gateway for interacting with a wide variety of models and enforcing guardrails. It presents a single, friendly API through which you can route to 200+ LLMs, while applying configurable input/output guardrails to enforce policies or restrict certain content. It supports automatic retries, fallbacks, load balancing across providers or keys, and request timeouts to avoid latency spikes. The gateway is multimodal: it can handle text, vision, audio, and image models under a common interface. It also offers features for governance: role-based access, compliance with standards (SOC2, HIPAA, GDPR), secure key management, and logging/analytics of usage, latency, errors, and cost. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TagForge

    TagForge

    Cross-platform AI tagging and prompt engineering studio

    TagForge is a professional, cross-platform desktop application for AI prompt engineering and image tagging. Built on .NET 9 and Avalonia UI, it provides a sleek, high-performance workspace for crafting perfect prompts for Stable Diffusion, FLUX, Midjourney, and other generative AI models. ## Key Capabilities - Multi-model tagging: Specialized generators for Stable Diffusion (tags), FLUX/Midjourney (prose), and custom LLM workflows - Multimodal vision: Extract metadata or generate descriptive captions directly from images - Contextual chat assistant: Persistent, coding-capable AI helper with real-time provider features - Persona system: Create custom identities with dynamic input injection and role-based templates - Chat rules: Modular behavioral controls (Concise, Detailed, etc.) with CRUD interface - Agent orchestrator: Centralized management of API keys, endpoints, and model parameters
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB