Compare the Top AI Models that integrate with Adobe Firefly as of April 2026

This a list of AI Models that integrate with Adobe Firefly. Use the filters on the left to add additional filters for products that have integrations with Adobe Firefly. View the products that work with Adobe Firefly in the table below.

What are AI Models for Adobe Firefly?

AI models are systems designed to simulate human intelligence by learning from data and solving complex tasks. They include specialized types like Large Language Models (LLMs) for text generation, image models for visual recognition and editing, and video models for processing and analyzing dynamic content. These models power applications such as chatbots, facial recognition, video summarization, and personalized recommendations. Their capabilities rely on advanced algorithms, extensive training datasets, and robust computational resources. AI models are transforming industries by automating processes, enhancing decision-making, and enabling creative innovations. Compare and read user reviews of the best AI Models for Adobe Firefly currently available using the table below. This list is updated regularly.

  • 1
    Nano Banana Pro
    Nano Banana Pro is Google DeepMind’s advanced evolution of the original Nano Banana, designed to deliver studio-quality image generation with far greater accuracy, text rendering, and world knowledge. Built on Gemini 3 Pro, it brings improved reasoning capabilities that help users transform ideas into detailed visuals, diagrams, prototypes, and educational content. It produces highly legible multilingual text inside images, making it ideal for posters, logos, storyboards, and international designs. The model can also ground images in real-time information, pulling from Google Search to create infographics for recipes, weather data, or factual explanations. With powerful consistency controls, Nano Banana Pro can blend up to 14 images and maintain recognizable details across multiple people or elements. Its enhanced creative editing tools let users refine lighting, adjust focus, manipulate camera angles, and produce final outputs in up to 4K resolution.
  • 2
    Chatterbox

    Chatterbox

    Resemble AI

    Chatterbox is a free, open source voice cloning AI model developed by Resemble AI, licensed under MIT. It enables zero-shot voice cloning using just 5 seconds of reference audio, eliminating the need for training. The model offers expressive speech synthesis with unique emotion control, allowing users to adjust the intensity from monotone to dramatically expressive with a single parameter. Chatterbox supports accent control and text-based controllability, ensuring high-quality, human-like text-to-speech conversion. It operates with faster-than-real-time inference, making it suitable for real-time applications, voice assistants, and interactive media. The model is built for production and designed for developers, featuring simple installation via pip and comprehensive documentation. Chatterbox includes built-in watermarking using Resemble AI’s PerTh (Perceptual Threshold) Watermarker, embedding data imperceptibly to protect generated audio content.
    Starting Price: $5 per month
  • 3
    GPT-Image-1
    OpenAI's Image Generation API, powered by the gpt-image-1 model, enables developers and businesses to integrate high-quality, professional-grade image generation directly into their tools and platforms. This model offers versatility, allowing it to create images across diverse styles, faithfully follow custom guidelines, leverage world knowledge, and accurately render text, unlocking countless practical applications across multiple domains. Leading enterprises and startups across industries, including creative tools, ecommerce, education, enterprise software, and gaming, are already using image generation in their products and experiences. It gives creators the choice and flexibility to experiment with different aesthetic styles. Users can generate and edit images from simple prompts, adjusting styles, adding or removing objects, expanding backgrounds, and more.
    Starting Price: $0.19 per image
  • 4
    Gemini 3 Pro Image
    Gemini Image Pro is a high-capability, multimodal image-generation and editing system that enables users to create, transform, and refine visuals through natural-language prompts or by combining multiple input images, with support for consistent character and object appearance across edits, precise local transformations (such as background blur, object removal, style transfers or pose changes), and native world-knowledge understanding to ensure context-aware outcomes. It supports multi-image fusion, merging several photo inputs into a cohesive new image, and emphasizes design workflow features such as template-based outputs, brand-asset consistency, and repeated character/person-style appearances across scenes. It includes digital watermarking to tag AI-generated imagery and is available through the Gemini API, Google AI Studio, and Vertex AI platforms.
  • 5
    GPT Image 1.5
    GPT Image 1.5 is OpenAI’s state-of-the-art image generation model built for precise, high-quality visual creation. It supports both text and image inputs and produces image or text outputs with strong adherence to prompts. The model improves instruction following, enabling more accurate image generation and editing results. GPT Image 1.5 is designed for professional and creative use cases that require reliability and visual consistency. It is available through multiple API endpoints, including image generation and image editing. Pricing is token-based, with separate rates for text and image inputs and outputs. GPT Image 1.5 offers a powerful foundation for developers building image-focused applications.
  • 6
    Gen-4

    Gen-4

    Runway

    Runway Gen-4 is a next-generation AI model that transforms how creators generate consistent media content, from characters and objects to entire scenes and videos. It allows users to create cohesive, stylized visuals that maintain consistent elements across different environments, lighting, and camera angles, all with minimal input. Whether for video production, VFX, or product photography, Gen-4 provides unparalleled control over the creative process. The platform simplifies the creation of production-ready videos, offering dynamic and realistic motion while ensuring subject consistency across scenes, making it a powerful tool for filmmakers and content creators.
  • 7
    Nano Banana
    Nano Banana is Gemini’s fast, accessible image-creation model designed for quick, playful, and casual creativity. It lets users blend photos, maintain character consistency, and make small local edits with ease. The tool is perfect for transforming selfies, reimagining pictures with fun themes, or combining two images into one. With its ability to handle stylistic changes, it can turn photos into figurine-style designs, retro portraits, or aesthetic makeovers using simple prompts. Nano Banana makes creative experimentation easy and enjoyable, requiring no advanced skills or complex controls. It’s the ideal starting point for users who want simple, fast, and imaginative image editing inside the Gemini app.
  • 8
    Nano Banana 2
    Nano Banana 2 is Google DeepMind’s latest image generation model, combining the advanced capabilities of Nano Banana Pro with the high-speed performance of Gemini Flash. It delivers improved world knowledge, enabling more accurate subject rendering and data-driven visuals grounded in real-time information. The model enhances precision text rendering and translation, making it ideal for marketing assets, infographics, and localized content. Users benefit from stronger instruction following, ensuring complex prompts are captured accurately. Nano Banana 2 supports subject consistency across multiple characters and objects within a single workflow. It offers production-ready output with customizable aspect ratios and resolutions up to 4K. Available across Gemini, Search, AI Studio, Google Cloud, and more, Nano Banana 2 brings high-quality visual generation at lightning-fast speed.
  • 9
    Gemini 3.1 Flash Image
    Gemini 3.1 Flash Image is Google DeepMind’s latest image generation model, combining advanced Pro-level capabilities with lightning-fast performance. It delivers enhanced world knowledge, enabling more accurate subject rendering and data-informed visuals grounded in real-time information. The model improves precision text rendering and in-image translation, making it well-suited for marketing assets, infographics, and localized creative content. Stronger instruction following ensures complex prompts are executed with clarity and accuracy. Gemini 3.1 Flash Image maintains subject consistency across multiple characters and objects within a single workflow. It supports production-ready outputs with customizable aspect ratios and resolutions up to 4K. Available across Gemini, Search, AI Studio, Google Cloud, and more, it brings high-quality visual generation at Flash-level speed.
  • 10
    Imagen 4

    Imagen 4

    Google

    Imagen 4 is Google's most advanced image generation model, designed for creativity and photorealism. With improved clarity, sharper image details, and better typography, it allows users to bring their ideas to life faster and more accurately than ever before. It supports photo-realistic generation of landscapes, animals, and people, and offers a diverse range of artistic styles, from abstract to illustration. The new features also include ultra-fast processing, enhanced color rendering, and a mode for up to 10x faster image creation. Imagen 4 can generate images at up to 2K resolution, providing exceptional clarity and detail, making it ideal for both artistic and practical applications.
  • 11
    FLUX.2

    FLUX.2

    Black Forest Labs

    FLUX.2 is built for real production workflows, delivering high-quality visuals while maintaining character, product, and style consistency across multiple reference images. It handles structured prompts, brand-safe layouts, complex text rendering, and detailed logos with precision. The model supports multi-reference inputs, editing at up to 4 megapixels, and generates both photorealistic scenes and highly stylized compositions. With a focus on reliability, FLUX.2 processes real-world creative tasks—such as infographics, product shots, and UI mockups—with exceptional stability. It represents Black Forest Labs’ open-core approach, pairing frontier-level capability with open-weight models that invite experimentation. Across its variants, FLUX.2 provides flexible options for studios, developers, and researchers who need scalable, customizable visual intelligence.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB