104 Integrations with OpenRouter

View a list of OpenRouter integrations and software that integrates with OpenRouter below. Compare the best OpenRouter integrations as well as features, ratings, user reviews, and pricing of software that integrates with OpenRouter. Here are the current OpenRouter integrations in 2026:

  • 1
    GLM-4.6

    GLM-4.6

    Zhipu AI

    GLM-4.6 advances upon its predecessor with stronger reasoning, coding, and agentic capabilities: it demonstrates clear improvements in inferential performance, supports tool use during inference, and more effectively integrates into agent frameworks. In benchmark tests spanning reasoning, coding, and agents, GLM-4.6 outperforms GLM-4.5 and shows competitive strength against models such as DeepSeek-V3.2-Exp and Claude Sonnet 4, though it still trails Claude Sonnet 4.5 in pure coding performance. In real-world tests using an extended “CC-Bench” suite across front-end development, tool building, data analysis, and algorithmic tasks, GLM-4.6 beats GLM-4.5 and approaches parity with Claude Sonnet 4, winning ~48.6% of head-to-head comparisons, while also achieving ~15% better token efficiency. GLM-4.6 is available via the Z.ai API, and developers can integrate it as an LLM backend or agent core using the platform’s API.
    Starting Price: Free
  • 2
    Gemini Enterprise
    Gemini Enterprise is a comprehensive AI platform built by Google Cloud designed to bring the full power of Google’s advanced AI models, agent-creation tools, and enterprise-grade data access into everyday workflows. The solution offers a unified chat interface that lets employees interact with internal documents, applications, data sources, and custom AI agents. At its core, Gemini Enterprise comprises six key components: the Gemini family of large multimodal models, an agent orchestration workbench (formerly Google Agentspace), pre-built starter agents, robust data-integration connectors to business systems, extensive security and governance controls, and a partner ecosystem for tailored integrations. It is engineered to scale across departments and enterprises, enabling users to build no-code or low-code agents that automate tasks, such as research synthesis, customer support response, code assist, contract analysis, and more, while operating within corporate compliance standards.
    Starting Price: $21 per month
  • 3
    Novelcrafter

    Novelcrafter

    Novelcrafter

    Novelcrafter is an AI-powered writing platform designed to support authors throughout the entire storytelling process, from brainstorming and character building to drafting, reviewing, and exporting their work. The platform offers a dedicated “Codex” wiki where writers can catalogue characters, locations, lore, and world‐building elements, ensuring consistency and easy reference. It supports structured planning modes, acts, chapters, scenes, and integrates seamlessly with the writing interface so authors can switch fluidly between planning and prose. AI tools are optional and fully customisable: authors can connect their own API keys (OpenAI, Claude, local LLMs) and set up tailored prompts, or work manually without AI support. Novelcrafter includes a distraction-free write mode, revision history, import/export of documents (Word, Markdown, HTML), and mobile compatibility for writing on the go.
    Starting Price: $4.64 per month
  • 4
    Scraib

    Scraib

    Scraib

    Scraib.app is an AI-powered writing partner built for macOS that lives in the menu bar and enables you to select any text in any application on your Mac, press Control + R, and instantly rewrite that text with improved grammar, clarity, and style. You can define custom rules to match your tone and style, and unlike standalone writing editors, Scraib works “in the flow” across any app, from Slack and Outlook to Pages, Word, Chrome, and Figma. It offers a high degree of privacy control; you can run it through your own AI provider (ChatGPT, Claude, Gemini, Ollama, OpenRouter, etc.), use your own API key, or even run it locally with supported models so that your data stays fully private. It is designed for minimal disruption; no switching to external tools, just a shortcut-based workflow to rewrite text where it already lives.
    Starting Price: $3.99 per month
  • 5
    Apollo

    Apollo

    Liquid AI

    Apollo is a lightweight mobile application designed for fully on-device, cloud-free AI interactions, enabling users to engage with advanced language and vision models securely, privately, and with low latency. It supports a library of small foundation models from the company’s LEAP platform, allowing users to draft messages, emails, chat with a private AI assistant, craft digital characters, or use image-to-text capabilities, all without an internet connection and with no data leaving the device. Apollo is optimized for real-time responsiveness and offline operation, ensuring that inference happens entirely locally, with no API calls, servers, or user-data logging involved. It serves as both a personal AI playground and a testing bed for developers using LEAP models, letting one “vibe-check” how a model performs on their own mobile hardware before broader deployment.
    Starting Price: Free
  • 6
    GLM-4.6V

    GLM-4.6V

    Zhipu AI

    GLM-4.6V is a state-of-the-art open source multimodal vision-language model from the Z.ai (GLM-V) family designed for reasoning, perception, and action. It ships in two variants: a full-scale version (106B parameters) for cloud or high-performance clusters, and a lightweight “Flash” variant (9B) optimized for local deployment or low-latency use. GLM-4.6V supports a native context window of up to 128K tokens during training, enabling it to process very long documents or multimodal inputs. Crucially, it integrates native Function Calling, meaning the model can take images, screenshots, documents, or other visual media as input directly (without manual text conversion), reason about them, and trigger tool calls, bridging “visual perception” with “executable action.” This enables a wide spectrum of capabilities; interleaved image-and-text content generation (for example, combining document understanding with text summarization or generation of image-annotated responses).
    Starting Price: Free
  • 7
    GLM-4.1V

    GLM-4.1V

    Zhipu AI

    GLM-4.1V is a vision-language model, providing a powerful, compact multimodal model designed for reasoning and perception across images, text, and documents. The 9-billion-parameter variant (GLM-4.1V-9B-Thinking) is built on the GLM-4-9B foundation and enhanced through a specialized training paradigm using Reinforcement Learning with Curriculum Sampling (RLCS). It supports a 64k-token context window and accepts high-resolution inputs (up to 4K images, any aspect ratio), enabling it to handle complex tasks such as optical character recognition, image captioning, chart and document parsing, video and scene understanding, GUI-agent workflows (e.g., interpreting screenshots, recognizing UI elements), and general vision-language reasoning. In benchmark evaluations at the 10 B-parameter scale, GLM-4.1V-9B-Thinking achieved top performance on 23 of 28 tasks.
    Starting Price: Free
  • 8
    GLM-4.5V-Flash
    GLM-4.5V-Flash is an open source vision-language model, designed to bring strong multimodal capabilities into a lightweight, deployable package. It supports image, video, document, and GUI inputs, enabling tasks such as scene understanding, chart and document parsing, screen reading, and multi-image analysis. Compared to larger models in the series, GLM-4.5V-Flash offers a compact footprint while retaining core VLM capabilities like visual reasoning, video understanding, GUI task handling, and complex document parsing. It can serve in “GUI agent” workflows, meaning it can interpret screenshots or desktop captures, recognize icons or UI elements, and assist with automated desktop or web-based tasks. Although it forgoes some of the largest-model performance gains, GLM-4.5V-Flash remains versatile for real-world multimodal tasks where efficiency, lower resource usage, and broad modality support are prioritized.
    Starting Price: Free
  • 9
    GLM-4.5V

    GLM-4.5V

    Zhipu AI

    GLM-4.5V builds on the GLM-4.5-Air foundation, using a Mixture-of-Experts (MoE) architecture with 106 billion total parameters and 12 billion activation parameters. It achieves state-of-the-art performance among open-source VLMs of similar scale across 42 public benchmarks, excelling in image, video, document, and GUI-based tasks. It supports a broad range of multimodal capabilities, including image reasoning (scene understanding, spatial recognition, multi-image analysis), video understanding (segmentation, event recognition), complex chart and long-document parsing, GUI-agent workflows (screen reading, icon recognition, desktop automation), and precise visual grounding (e.g., locating objects and returning bounding boxes). GLM-4.5V also introduces a “Thinking Mode” switch, allowing users to choose between fast responses or deeper reasoning when needed.
    Starting Price: Free
  • 10
    GLM-4.7

    GLM-4.7

    Zhipu AI

    GLM-4.7 is an advanced large language model designed to significantly elevate coding, reasoning, and agentic task performance. It delivers major improvements over GLM-4.6 in multilingual coding, terminal-based tasks, and real-world software engineering benchmarks such as SWE-bench and Terminal Bench. GLM-4.7 supports “thinking before acting,” enabling more stable, accurate, and controllable behavior in complex coding and agent workflows. The model also introduces strong gains in UI and frontend generation, producing cleaner webpages, better layouts, and more polished slides. Enhanced tool-using capabilities allow GLM-4.7 to perform more effectively in web browsing, automation, and agent benchmarks. Its reasoning and mathematical performance has improved substantially, showing strong results on advanced evaluation suites. GLM-4.7 is available via Z.ai, API platforms, coding agents, and local deployment for flexible adoption.
    Starting Price: Free
  • 11
    Repo Prompt

    Repo Prompt

    Repo Prompt

    Repo Prompt is a macOS-native AI coding assistant and context engineering tool that helps developers interact with, refine, and modify codebases using large language models by letting users select specific files or folders, build structured prompts with exactly the relevant context, and review and apply AI-generated code changes as diffs rather than rewriting entire files, ensuring precise, auditable modifications. It provides a visual file explorer for project navigation, an intelligent context builder, and CodeMaps that reduce token usage and help models understand project structure, and multi-model support so users can bring their own API keys for providers like OpenAI, Anthropic, Gemini, Azure, or others, keeping all processing local and private unless the user explicitly sends code to an LLM. Repo Prompt works as both a standalone chat/workflow interface and an MCP (Model Context Protocol) server for integration with AI editors.
    Starting Price: $14.99 per month
  • 12
    Knolli

    Knolli

    Knolli

    Knolli is an AI copilot platform that enables users to build, launch, and scale custom AI copilots and agents without writing code by turning knowledge, documents, datasets, and proprietary content into interactive, conversational assistants. It provides a no-code workspace where creators, teams, and businesses can describe their idea in plain language and have Knolli automatically structure uploaded content into a usable AI copilot, organize and protect data securely with encrypted private knowledge bases, and connect to tools such as CRMs, file storage, and databases to pull in live data for context-aware responses. It supports multi-agent architecture to run specialized agents inside one copilot, pre-built templates for common use cases, custom branding and white-labeling, and advanced analytics so users can monitor performance, usage, and ROI. Knolli also offers workflow automation, letting copilots automate multi-step tasks and integrate with existing systems.
    Starting Price: $39 per month
  • 13
    TexTab

    TexTab

    TexTab

    TexTab is a macOS productivity application that lets users turn any AI-driven task into an instant keyboard shortcut, enabling powerful text processing and automation without switching apps. It operates at the system level, so you can select text in any macOS application, browsers, email clients, code editors, documents, and trigger AI actions with a single keystroke, turning tasks like translation, summarization, rewriting, or formalizing into one-press commands. Users can create unlimited custom AI actions with unique shortcuts and connect to multiple AI providers (such as OpenAI, Anthropic, Groq, Perplexity, or OpenRouter) using their own API keys, so the data stays private and costs are controlled; API calls go directly to the provider with no TexTab servers in between. It also includes features like a one-click AI prompt enhancer, native plugins such as a pop-up AI chat, QR code generator, image converter, and color picker.
    Starting Price: Free
  • 14
    Agent Zero

    Agent Zero

    Agent Zero

    Agent Zero is an open source AI agent framework designed to run autonomous AI assistants that can perform complex tasks by interacting directly with a computer system. It provides an environment where AI agents operate with real system access, allowing them to execute commands, write and run code, browse the web, analyze data, and manage workflows as part of real-world automation processes. Instead of functioning as a simple chat interface, Agent Zero runs in its own virtual environment where it can interact with the operating system, install tools, execute scripts, and coordinate tasks across multiple components. It emphasizes transparency and control, allowing developers to view, modify, and customize how the agent behaves, what tools it can access, and how it processes information. Agent Zero uses a modular architecture that allows the agent to dynamically create and use tools while maintaining persistent memory.
    Starting Price: $2.65 per month
  • 15
    AI SpendOps

    AI SpendOps

    AI SpendOps

    We give engineering, finance, and FinOps teams a single platform to track, attribute, and optimise LLM API spend across every provider. Costs are broken down by dimensions you define, matching how your business already reports its financials. Engineering teams get frictionless cost tracking without slowing anything down. CTOs get a single pane of glass to enforce model governance and prevent shadow usage. CFOs get finance-grade reporting for forecasting, budgeting, and chargebacks, attributed using their own reporting structure. FinOps teams get real-time, multi-provider cost data that slots straight into the workflows they already run for cloud. If your organisation uses LLM APIs and the board is asking "what are we spending and why?" we're the answer.
    Starting Price: £199
  • 16
    Hermes Agent

    Hermes Agent

    Nous Research

    Hermes Agent by Nous Research is an open-source autonomous AI agent designed to run locally on your server and improve over time. It operates independently from traditional chatbots by maintaining persistent memory and learning from past interactions. The agent can integrate with multiple platforms such as Slack, Discord, Telegram, and WhatsApp through a unified gateway. Hermes supports automation tasks like scheduling reports, managing workflows, and executing commands using natural language. It also enables parallel task execution through subagents, improving efficiency for complex operations. With built-in tools for web browsing, search, and code execution, it provides a versatile environment for various tasks. Overall, Hermes Agent acts as a continuously evolving AI system that adapts to user needs and workflows.
    Starting Price: Free
  • 17
    Octrafic

    Octrafic

    Octrafic

    Octrafic is an open source, AI-powered command-line tool designed for automated API testing and exploration, allowing users to interact with APIs using natural language instead of writing scripts or configuring complex testing frameworks. By pointing the tool to any HTTP API or OpenAPI specification, users can describe what they want to test in plain English, and the built-in AI agent automatically generates test scenarios, executes real HTTP requests, validates responses, and produces structured results. It handles the full testing workflow, including endpoint discovery, request construction, schema validation, and error detection, enabling developers to focus on testing logic rather than low-level implementation details. It supports real-time execution against live APIs, ensuring accurate status codes and behavior without relying on mocks, and can export results into styled PDF reports for sharing with teams or stakeholders.
    Starting Price: Free
  • 18
    Trinity-Large-Thinking
    Trinity Large Thinking is a frontier open source reasoning model developed by Arcee AI, designed specifically for complex, multi-step problem solving and autonomous agent workflows that require long-horizon planning and tool use. Built on a sparse Mixture-of-Experts architecture with roughly 400 billion total parameters but only about 13 billion active per token, the model achieves high efficiency while maintaining strong reasoning performance across tasks such as mathematical problem solving, code generation, and multi-step analysis. It introduces extended chain-of-thought reasoning capabilities, allowing the model to generate intermediate “thinking traces” before producing final answers, which improves accuracy and reliability in complex scenarios. Trinity Large Thinking supports a very large context window of up to 262K tokens, enabling it to process long documents, maintain state across extended interactions, and operate effectively in continuous agent loops.
    Starting Price: Free
  • 19
    GPT-5.5

    GPT-5.5

    OpenAI

    GPT-5.5 is an advanced AI model designed to handle complex, real-world tasks with greater autonomy and efficiency. It quickly understands user intent and can execute multi-step workflows such as coding, research, data analysis, and document creation with minimal guidance. Instead of requiring step-by-step instructions, GPT-5.5 plans tasks, uses tools, evaluates outputs, and continues working until completion. It excels in knowledge work, software development, and analytical problem-solving, helping users move from idea to execution faster. The model is built to operate across tools and environments, making it highly effective for modern digital workflows. With strong reasoning and persistence, GPT-5.5 enables individuals and teams to complete demanding work more efficiently and accurately.
    Starting Price: $5 per 1M tokens (input)
  • 20
    GPT-5.5 Pro
    GPT-5.5 Pro is an advanced AI model designed to handle complex, real-world work with greater autonomy and efficiency. It understands user intent quickly and can execute multi-step tasks such as coding, research, data analysis, and document creation with minimal guidance. The model is built to plan, use tools, and refine its outputs until tasks are complete. It excels in knowledge work, software development, and analytical problem-solving. With strong reasoning and persistence, GPT-5.5 Pro can manage long-running workflows across tools and systems. It delivers high-quality results while maintaining speed and efficiency. Overall, it enables individuals and teams to complete demanding tasks faster and more accurately.
    Starting Price: $30 per 1M tokens (input)
  • 21
    GPT-5

    GPT-5

    OpenAI

    GPT-5 is OpenAI’s most advanced AI model, delivering smarter, faster, and more useful responses across a wide range of topics including math, science, finance, and law. It features built-in thinking capabilities that allow it to provide expert-level answers and perform complex reasoning. GPT-5 can handle long context lengths and generate detailed outputs, making it ideal for coding, research, and creative writing. The model includes a ‘verbosity’ parameter for customizable response length and improved personality control. It integrates with business tools like Google Drive and SharePoint to provide context-aware answers while respecting security permissions. Available to everyone, GPT-5 empowers users to collaborate with an AI assistant that feels like a knowledgeable colleague.
    Starting Price: $1.25 per 1M tokens
  • 22
    AppFit

    AppFit

    AppFit

    From idea to deployment, AppFit provides everything you need to build successful web and mobile applications. Build full-stack web and mobile applications with AI assistance at every step of the development process. Generate code, design interfaces, and solve problems faster than ever before. Validate your app ideas with AI-powered market insights and analytics to find product-market fit faster. Understand your target audience and competition before writing a single line of code. Learn as you build with our gamified no-code editor. AppFit breaks down complex app development into bite-sized, interactive lessons that make learning fun and engaging, just like Duolingo does for languages. Create responsive web applications and native-like mobile apps from a single codebase. Save time and resources while reaching users on any device.
  • 23
    SheetMagic

    SheetMagic

    SheetMagic

    SheetMagic is a Google Sheets add-on that brings unlimited AI content generation and unlimited web scraping directly into your spreadsheets. It enables users to generate AI content and images via formulas, tapping into GPT-3.5 Turbo, GPT-4/GPT-4 Turbo/GPT-4o, DALL·E 3, and any LLM via OpenRouter, all without coding or markup fees. With SheetMagic you can clean, analyze, summarize, and classify data; scrape entire webpages, search engine result pages, meta titles, headings, paragraphs, and custom selectors; and automate the creation of bulk product descriptions, ad copy, sales emails, SEO-optimized content, and enriched lead lists from existing sheet data and scraped inputs. The add-on supports programmatic workflows, multi-language prompts, team sharing, audit trails, and real-time dashboards, streamlining repetitive tasks so you can focus on strategy rather than manual entry.
    Starting Price: $19 per month
  • 24
    Gemini 2.5 Flash Image
    Gemini 2.5 Flash Image is Google’s latest state-of-the-art image generation and editing model, now accessible via the Gemini API, Google AI Studio’s build mode, and Gemini Enterprise Agent Platform. It enables powerful creative control by allowing users to blend multiple input images into a single visual, maintain consistent characters or products across edits for rich storytelling, and apply precise, natural-language-based–based transformations, such as removing objects, changing poses, adjusting colors, or altering backgrounds. The model is backed by Gemini’s deep world knowledge, enabling it to understand and reinterpret scenes or diagrams in context, which unlocks dynamic use cases like educational tutors or scene-aware editing assistants. Demonstrated through customizable template apps in AI Studio (including photo editors, multi-image fusers, and interactive tools), the model supports rapid prototyping and remixing via prompts or UI.
  • 25
    ShipAhead

    ShipAhead

    Tom Han

    ShipAhead is a Nuxt boilerplate built to help founders and developers launch AI SaaS products in days instead of months. It comes pre-loaded with essential features like authentication, Stripe payments, analytics, security, SEO optimization, and internationalization, so you can focus on building your core product. With built-in integrations for AI, customer support, PWA, and deployment, ShipAhead eliminates the repetitive and time-consuming setup tasks that slow down startups. The boilerplate includes ready-to-use UI components powered by TailwindCSS and daisyUI, ensuring your app looks professional from day one. Developers can rely on its strong backend foundation with Postgres, Drizzle ORM, file storage, and admin dashboards. Overall, ShipAhead helps entrepreneurs save up to 390 development hours, letting them launch faster, test ideas, and get profitable sooner.
    Starting Price: $99 one time
  • 26
    Raptor Write

    Raptor Write

    Raptor Write

    Raptor Write is a free AI-powered writing tool created by the Future Fiction Academy that helps writers brainstorm, outline, and draft stories with minimal friction. It features a clean, distraction-free interface designed to let authors focus on ideas rather than tool complexity. All projects are stored locally in the user’s browser, giving users more control over their work. The tool connects via OpenRouter, enabling users to plug in different AI models and experiment with output styles. While it’s lightweight and easy to use, it doesn’t include some of the deeper structure tools found in more heavyweight platforms. Still, it offers a gentle, no-cost entry point for writers curious about exploring AI in their creative workflows.
    Starting Price: Free
  • 27
    GPT-5.1

    GPT-5.1

    OpenAI

    GPT-5.1 is the latest update in the GPT-5 series, designed to make ChatGPT dramatically smarter and more conversational. The release introduces two distinct model variants: GPT-5.1 Instant, which is described as the most-used model and is now warmer, better at following instructions, and more intelligent; and GPT-5.1 Thinking, which is the advanced reasoning engine that’s been tuned to be easier to understand, faster on straightforward tasks, and more persistent on complex ones. Users' queries are now routed automatically to the variant best-suited to the task. The update emphasizes not just improved raw intelligence but also enhanced communication style; the models are tuned to be more natural, enjoyable to talk to, and better aligned with user intents. The system card addendum notes that GPT-5.1 Instant uses “adaptive reasoning” that lets it decide when to think more deeply before responding, while GPT-5.1 Thinking adapts its thinking time accurately to the question at hand.
  • 28
    Gemini 3 Pro Image
    Gemini Image Pro is a high-capability, multimodal image-generation and editing system that enables users to create, transform, and refine visuals through natural-language prompts or by combining multiple input images, with support for consistent character and object appearance across edits, precise local transformations (such as background blur, object removal, style transfers or pose changes), and native world-knowledge understanding to ensure context-aware outcomes. It supports multi-image fusion, merging several photo inputs into a cohesive new image, and emphasizes design workflow features such as template-based outputs, brand-asset consistency, and repeated character/person-style appearances across scenes. It includes digital watermarking to tag AI-generated imagery and is available through the Gemini API, Google AI Studio, and Gemini Enterprise Agent Platform.
  • 29
    LFM2

    LFM2

    Liquid AI

    LFM2 is a next-generation series of on-device foundation models built to deliver the fastest generative-AI experience across a wide range of endpoints. It employs a new hybrid architecture that achieves up to 2x faster decode and prefill performance than comparable models, and up to 3x improvements in training efficiency compared to the previous generation. These models strike an optimal balance of quality, latency, and memory for deployment on embedded systems, allowing real-time, on-device AI across smartphones, laptops, vehicles, wearables, and other endpoints, enabling millisecond inference, device resilience, and full data sovereignty. Available in three dense checkpoints (0.35 B, 0.7 B, and 1.2 B parameters), LFM2 demonstrates benchmark performance that outperforms similarly sized models in tasks such as knowledge recall, mathematics, multilingual instruction-following, and conversational dialogue evaluations.
  • 30
    GPT-5.2 Thinking
    GPT-5.2 Thinking is the highest-capability configuration in OpenAI’s GPT-5.2 model family, engineered for deep, expert-level reasoning, complex task execution, and advanced problem solving across long contexts and professional domains. Built on the foundational GPT-5.2 architecture with improvements in grounding, stability, and reasoning quality, this variant applies more compute and reasoning effort to generate responses that are more accurate, structured, and contextually rich when handling highly intricate workflows, multi-step analysis, and domain-specific challenges. GPT-5.2 Thinking excels at tasks that require sustained logical coherence, such as detailed research synthesis, advanced coding and debugging, complex data interpretation, strategic planning, and sophisticated technical writing, and it outperforms lighter variants on benchmarks that test professional skills and deep comprehension.
  • 31
    GPT-5.2 Instant
    GPT-5.2 Instant is the fast, capable variant of OpenAI’s GPT-5.2 model family designed for everyday work and learning with clear improvements in information-seeking questions, how-tos and walkthroughs, technical writing, and translation compared to prior versions. It builds on the warmer conversational tone introduced in GPT-5.1 Instant and produces clearer explanations that surface key information upfront, making it easier for users to get concise, accurate answers quickly. GPT-5.2 Instant delivers speed and responsiveness for typical tasks like answering queries, generating summaries, assisting with research, and helping with writing and editing, while incorporating broader enhancements from the GPT-5.2 series in reasoning, long-context handling, and factual grounding. As part of the GPT-5.2 lineup, it shares the same foundational improvements that boost overall reliability and performance across a wide range of everyday activities.
  • 32
    GPT-5.2 Pro
    GPT-5.2 Pro is the highest-capability variant of OpenAI’s latest GPT-5.2 model family, built to deliver professional-grade reasoning, complex task performance, and enhanced accuracy for demanding knowledge work, creative problem-solving, and enterprise-level applications. It builds on the foundational improvements of GPT-5.2, including stronger general intelligence, superior long-context understanding, better factual grounding, and improved tool use, while using more compute and deeper processing to produce more thoughtful, reliable, and context-rich responses for users with intricate, multi-step requirements. GPT-5.2 Pro is designed to handle challenging workflows such as advanced coding and debugging, deep data analysis, research synthesis, extensive document comprehension, and complex project planning with greater precision and fewer errors than lighter variants.
  • 33
    SpawnHQ

    SpawnHQ

    SpawnHQ

    SpawnHQ is a software-as-a-service platform that lets users deploy, configure, and manage autonomous AI agents in minutes without writing code or setting up infrastructure by offering a marketplace of pre-built, skill-based agents trained on your brand context that run continuously on managed compute and integrate with tools like Discord, web chat widgets, Twitter, SEO services, and CRMs. Users pick a skill, such as support bot for customer questions, SEO agent for monitoring rankings and drafting content, outbound agent for lead discovery and outreach, or social and content engines, configure integrations and brand context, and deploy agents that act on natural language commands and run 24/7 on autopilot, executing tasks such as research, CRM updates, content generation, and automated responses. It handles managed compute, AI model routing (Claude, GPT, Gemini), scheduling, logs, reporting, and guardrails so agents can think and act independently.
    Starting Price: $59 per month
  • 34
    Nano Banana
    Nano Banana is Gemini’s fast, accessible image-creation model designed for quick, playful, and casual creativity. It lets users blend photos, maintain character consistency, and make small local edits with ease. The tool is perfect for transforming selfies, reimagining pictures with fun themes, or combining two images into one. With its ability to handle stylistic changes, it can turn photos into figurine-style designs, retro portraits, or aesthetic makeovers using simple prompts. Nano Banana makes creative experimentation easy and enjoyable, requiring no advanced skills or complex controls. It’s the ideal starting point for users who want simple, fast, and imaginative image editing inside the Gemini app.
  • 35
    ChatKit

    ChatKit

    OpenAI

    ChatKit is a conversational AI toolkit that lets developers embed and manage chat agents across apps and websites. It provides capabilities such as chatting over external documents, text-to-speech, prompt templates, and shortcut triggers. Users can operate ChatKit either using their own OpenAI API key (paying according to OpenAI’s token pricing) or via ChatKit’s credit system (which requires a ChatKit license). ChatKit supports integrations with diverse model backends (including OpenAI, Azure OpenAI, Google Gemini, Ollama) and routing frameworks (e.g., OpenRouter). Feature offerings include cloud sync, team collaboration, web access, launcher widgets, shortcuts, and structured conversation flows over documents. In sum, ChatKit simplifies deploying intelligent chat agents without building the full chat infrastructure from scratch.
  • 36
    GPT-5.1 Instant
    GPT-5.1 Instant is a high-performance AI model designed for everyday users that combines speed, responsiveness, and improved conversational warmth. The model uses adaptive reasoning to instantly select how much computation is required for a task, allowing it to deliver fast answers without sacrificing understanding. It emphasizes stronger instruction-following, enabling users to give precise directions and expect consistent compliance. The model also introduces richer personality controls so chat tone can be set to Default, Friendly, Professional, Candid, Quirky, or Efficient, with experiments in deeper voice modulation. Its core value is to make interactions feel more natural and less robotic while preserving high intelligence across writing, coding, analysis, and reasoning. GPT-5.1 Instant routes user requests automatically from the base interface, with the system choosing whether this variant or the deeper “Thinking” model is applied.
  • 37
    GPT-5.1 Thinking
    GPT-5.1 Thinking is the advanced reasoning model variant in the GPT-5.1 series, designed to more precisely allocate “thinking time” based on prompt complexity, responding faster to simpler requests and spending more effort on difficult problems. On a representative task distribution, it is roughly twice as fast on the fastest tasks and twice as slow on the slowest compared with its predecessor. Its responses are crafted to be clearer, with less jargon and fewer undefined terms, making deep analytical work more accessible and understandable. The model dynamically adjusts its reasoning depth, achieving a better balance between speed and thoroughness, particularly when dealing with technical concepts or multi-step questions. By combining high reasoning capacity with improved clarity, GPT-5.1 Thinking offers a powerful tool for tackling complex tasks, such as detailed analysis, coding, research, or technical explanations, while reducing unnecessary latency for routine queries.
  • 38
    GPT-5.2

    GPT-5.2

    OpenAI

    GPT-5.2 is the newest evolution in the GPT-5 series, engineered to deliver even greater intelligence, adaptability, and conversational depth. This release introduces enhanced model variants that refine how ChatGPT reasons, communicates, and responds to complex user intent. GPT-5.2 Instant remains the primary, high-usage model—now faster, more context-aware, and more precise in following instructions. GPT-5.2 Thinking takes advanced reasoning further, offering clearer step-by-step logic, improved consistency on multi-stage problems, and more efficient handling of long or intricate tasks. The system automatically routes each query to the most suitable variant, ensuring optimal performance without requiring user selection. Beyond raw intelligence gains, GPT-5.2 emphasizes more natural dialogue flow, stronger intent alignment, and a smoother, more humanlike communication style.
  • 39
    Grok 4.1 Thinking
    Grok 4.1 Thinking is xAI’s advanced reasoning-focused AI model designed for deeper analysis, reflection, and structured problem-solving. It uses explicit thinking tokens to reason through complex prompts before delivering a response, resulting in more accurate and context-aware outputs. The model excels in tasks that require multi-step logic, nuanced understanding, and thoughtful explanations. Grok 4.1 Thinking demonstrates a strong, coherent personality while maintaining analytical rigor and reliability. It has achieved the top overall ranking on the LMArena Text Leaderboard, reflecting strong human preference in blind evaluations. The model also shows leading performance in emotional intelligence and creative reasoning benchmarks. Grok 4.1 Thinking is built for users who value clarity, depth, and defensible reasoning in AI interactions.
  • 40
    Nano Banana 2
    Nano Banana 2 is Google DeepMind’s latest image generation model, combining the advanced capabilities of Nano Banana Pro with the high-speed performance of Gemini Flash. It delivers improved world knowledge, enabling more accurate subject rendering and data-driven visuals grounded in real-time information. The model enhances precision text rendering and translation, making it ideal for marketing assets, infographics, and localized content. Users benefit from stronger instruction following, ensuring complex prompts are captured accurately. Nano Banana 2 supports subject consistency across multiple characters and objects within a single workflow. It offers production-ready output with customizable aspect ratios and resolutions up to 4K. Available across Gemini, Search, AI Studio, Google Cloud, and more, Nano Banana 2 brings high-quality visual generation at lightning-fast speed.
  • 41
    Fluent

    Fluent

    Epic Bits

    Fluent is a native AI assistant for macOS that lets you use any AI model across any app without switching tools. It brings real-time app context into your AI workflows, allowing you to write, edit, and chat directly where you work. Fluent supports over 500 AI models, including OpenAI, Gemini, Anthropic, Grok, OpenRouter, and local models for full privacy. The app preserves original formatting while helping users rewrite content, compare ideas, and follow up seamlessly. Fluent works inside popular apps like browsers, email clients, note-taking tools, calendars, and document editors. Custom actions and keyboard shortcuts help users stay focused and maintain productivity flow. Designed for Apple Silicon and Intel Macs, Fluent delivers fast, private, and powerful AI assistance with a one-time lifetime license.
    Starting Price: $49
  • 42
    nanobot

    nanobot

    nanobot

    nanobot is an open source, ultra-lightweight personal AI assistant framework designed to deliver the core agent loop and autonomous AI capabilities in a minimal, readable codebase, approximately ~3,400–4,000 lines of Python, which is ~99% smaller than comparable large agent frameworks. It’s intentionally simple and modular, making it easy to understand, extend, and experiment with for research or custom projects. nanobot supports persistent memory, scheduled tasks, built-in tools, and integration with multiple large language models (via OpenRouter or other providers), and can run locally or be deployed quickly with CLI commands; it also offers optional real-time web search and multi-platform chat interfaces (e.g., Telegram, Discord, WhatsApp, Feishu) so you can interact with the agent from different environments. Its minimal footprint enables fast startup, low resource use, and a clean architecture that developers can adapt without heavy abstractions.
  • 43
    Gemini 3.1 Flash Image
    Gemini 3.1 Flash Image is Google DeepMind’s latest image generation model, combining advanced Pro-level capabilities with lightning-fast performance. It delivers enhanced world knowledge, enabling more accurate subject rendering and data-informed visuals grounded in real-time information. The model improves precision text rendering and in-image translation, making it well-suited for marketing assets, infographics, and localized creative content. Stronger instruction following ensures complex prompts are executed with clarity and accuracy. Gemini 3.1 Flash Image maintains subject consistency across multiple characters and objects within a single workflow. It supports production-ready outputs with customizable aspect ratios and resolutions up to 4K. Available across Gemini, Search, AI Studio, Google Cloud, and more, it brings high-quality visual generation at Flash-level speed.
  • 44
    GPT-5.3 Instant
    GPT-5.3 Instant is an updated version of ChatGPT’s most-used model, designed to make everyday conversations more fluid, helpful, and accurate. The release focuses on improving tone, relevance, and conversational flow based directly on user feedback. It reduces unnecessary refusals and cuts back on overly cautious disclaimers, delivering clearer and more direct answers when appropriate. The model also improves how it integrates web results, providing better-contextualized information rather than long lists of loosely connected links. Accuracy has been strengthened, with measurable reductions in hallucinations across both high-stakes domains and everyday queries. GPT-5.3 Instant enhances creative writing capabilities, producing more textured, emotionally resonant prose. It is available to all ChatGPT users and developers via the API under ‘gpt-5.3-chat-latest,’ with legacy versions scheduled for retirement.
  • 45
    GPT-5.4 Pro
    GPT-5.4 Pro is an advanced AI model developed by OpenAI to deliver high-performance capabilities for professional and complex tasks. It combines improvements in reasoning, coding, and agent-based workflows into a single unified system. The model is designed to work efficiently across professional tools such as spreadsheets, presentations, documents, and development environments. GPT-5.4 Pro also includes native computer-use capabilities, enabling AI agents to interact with software, websites, and operating systems to complete tasks. With support for up to one million tokens of context, it can manage long workflows and large datasets more effectively than previous models. The model also improves tool usage, allowing it to search for and select the right tools during multi-step processes. By delivering more accurate outputs with fewer tokens, GPT-5.4 Pro helps professionals complete complex work faster and more efficiently.
  • 46
    GPT‑5.4 Thinking
    GPT-5.4 Thinking is an advanced reasoning-focused AI model available within ChatGPT, designed to help users complete complex professional tasks more effectively. It combines improvements in reasoning, coding, and agent-based workflows to provide more accurate and reliable outputs. The model can present an upfront outline of its reasoning process, allowing users to adjust instructions while it is generating a response. This capability helps produce results that better align with user goals without requiring multiple follow-up prompts. GPT-5.4 Thinking also improves deep web research, enabling it to locate and synthesize information from multiple sources more efficiently. With stronger context management, it can handle longer conversations and complex problem-solving tasks with greater coherence. These capabilities make GPT-5.4 Thinking well suited for professional knowledge work and advanced analytical tasks.
  • 47
    GPT-5.4 mini
    GPT-5.4 mini is a fast and efficient AI model designed for high-performance tasks such as coding, reasoning, and multimodal understanding. It delivers strong capabilities similar to larger models while maintaining lower latency and cost. The model is optimized for responsive applications where speed is critical, including coding assistants and real-time workflows. GPT-5.4 mini supports advanced features such as tool use, function calling, and image interpretation. It performs well on complex tasks while running significantly faster than previous mini models. The model is also suitable for subagent systems, where it handles smaller tasks within larger AI workflows. By combining speed, efficiency, and strong performance, GPT-5.4 mini enables scalable AI applications across various use cases.
  • 48
    GPT-5.4 nano
    GPT-5.4 nano is a lightweight and highly efficient AI model designed for fast, cost-effective task execution. It is optimized for simple and high-volume tasks such as classification, data extraction, and basic coding support. The model delivers quick responses with minimal latency, making it ideal for real-time and large-scale applications. GPT-5.4 nano improves significantly over previous nano models in both performance and efficiency. It supports essential capabilities like tool use and structured data processing. The model is commonly used as a supporting component within larger AI systems. By focusing on speed and affordability, GPT-5.4 nano enables scalable automation across various workflows.
  • 49
    TaskMaster AI

    TaskMaster AI

    TaskMaster AI

    Taskmaster is an AI-powered project management tool designed specifically for coordinating and guiding AI agents through complex workflows by breaking down large objectives into structured, manageable tasks with clear dependencies. It functions as a programmable “project manager” for AI-driven development, enabling users to define requirements, automatically generate task lists, and organize execution in a way that preserves context across long, multi-step processes. It supports the creation of Product Requirement Documents (PRDs) that can be parsed into actionable tasks and subtasks, allowing agents to work sequentially and coherently without losing track of previous steps. It integrates with multiple AI providers and models, enabling flexible configuration of main, research, and fallback agents to optimize performance and reliability.
  • 50
    Tune AI

    Tune AI

    NimbleBox

    Leverage the power of custom models to build your competitive advantage. With our enterprise Gen AI stack, go beyond your imagination and offload manual tasks to powerful assistants instantly – the sky is the limit. For enterprises where data security is paramount, fine-tune and deploy generative AI models on your own cloud, securely.