25 Integrations with Vercel AI Gateway

View a list of Vercel AI Gateway integrations and software that integrates with Vercel AI Gateway below. Compare the best Vercel AI Gateway integrations as well as features, ratings, user reviews, and pricing of software that integrates with Vercel AI Gateway. Here are the current Vercel AI Gateway integrations in 2026:

  • 1
    Vercel

    Vercel

    Vercel

    Vercel is an AI-powered cloud platform that helps developers build, deploy, and scale high-performance web experiences with speed and security. It provides a unified set of tools, templates, and infrastructure designed to streamline development workflows from idea to global deployment. With support for modern frameworks like Next.js, Svelte, Vite, and Nuxt, teams can ship fast, responsive applications without managing complex backend operations. Vercel’s AI Cloud includes an AI Gateway, SDKs, workflow automation tools, and fluid compute, enabling developers to integrate large language models and advanced AI features effortlessly. The platform emphasizes instant global distribution, enabling deployments to become available worldwide immediately after a git push. Backed by strong security and performance optimizations, Vercel helps companies deliver personalized, reliable digital experiences at massive scale.
  • 2
    Gemini 3.5 Flash
    Gemini 3.5 Flash is Google’s latest frontier AI model designed to combine advanced intelligence, high-speed performance, and agentic workflow execution for developers, enterprises, and everyday users. Built as part of the Gemini 3.5 family, the model excels at coding, long-horizon reasoning, multimodal understanding, and complex multi-step automation tasks while delivering significantly faster output speeds than many competing frontier models. Gemini 3.5 Flash powers AI agents capable of planning, executing, and managing workflows such as application development, codebase maintenance, data analysis, and financial document preparation through the Antigravity harness. The model also supports rich multimodal experiences by generating interactive graphics, dynamic web interfaces, animations, and advanced visual content. Gemini 3.5 Flash is integrated across Google products including the Gemini app, Google Search AI Mode, Google Antigravity, Google AI Studio, Android Studio, and more.
    Starting Price: $1.50 per 1M tokens (input)
  • 3
    Claude Opus 4.7
    Claude Opus 4.7 is the latest Anthropic AI model release designed to significantly improve performance in advanced software engineering and complex problem-solving tasks. It builds upon the previous Opus 4.6 model by delivering stronger results on difficult coding challenges and long-running workflows. The model is known for its ability to follow instructions precisely and verify its own outputs for greater reliability. It also introduces enhanced multimodal capabilities, particularly in processing high-resolution images with improved accuracy. Opus 4.7 supports more detailed visual tasks such as analyzing dense screenshots and extracting data from complex diagrams. In professional settings, it produces higher-quality outputs including documents, presentations, and user interfaces. The model includes updated safety features that detect and block high-risk cybersecurity-related requests.
    Starting Price: $5 per million tokens (input)
  • 4
    Grok Build
    Grok Build is an AI-powered command-line development environment designed to help developers build, manage, and automate software projects more efficiently. The platform provides a fast and flicker-free CLI experience that supports planning, coding, reviewing, and coordinating tasks across multiple AI-powered agents. Grok Build can adapt to different workflows and user preferences through customizable skills and interface enhancements. Developers can use the platform to architect complex projects with plan viewers, subagents, and parallel task execution capabilities. The system also includes marketplaces that allow teams to share workflows, capabilities, and productivity tools across projects. Grok Build supports interactive coding assistance, interface refinement suggestions, and contextual prompts that help streamline development processes.
  • 5
    GLM-5.1

    GLM-5.1

    Zhipu AI

    GLM-5.1 is the latest iteration of Z.ai’s GLM series, designed as a frontier-level, agent-oriented AI model optimized for coding, reasoning, and long-horizon workflows. It builds on the GLM-5 architecture, which uses a Mixture-of-Experts (MoE) design to deliver high performance while keeping inference costs efficient, and is part of a broader push toward open-weight, developer-accessible models. A core focus of GLM-5.1 is enabling agentic behavior, meaning it can plan, execute, and iterate across multi-step tasks rather than simply responding to single prompts. It is specifically designed to handle complex workflows such as debugging code, navigating repositories, and executing chained operations with sustained context. Compared to earlier models, GLM-5.1 improves reliability in long interactions, maintaining coherence across extended sessions and reducing breakdowns in multi-step reasoning.
    Starting Price: Free
  • 6
    Qwen3.6-Max-Preview
    Qwen3.6-Max-Preview is a next-generation frontier language model designed to push the limits of intelligence, instruction following, and real-world agent capabilities within the Qwen ecosystem. Building on the Qwen3 series, this preview release introduces stronger world knowledge, sharper instruction alignment, and significant improvements in agentic coding performance, enabling the model to better handle complex, multi-step tasks and software engineering workflows. It is engineered for advanced reasoning and execution scenarios, where the model not only generates responses but also interacts with tools, processes long contexts, and supports structured problem-solving across domains such as coding, research, and enterprise workflows. The architecture continues the Qwen focus on large-scale, high-efficiency models capable of handling extensive context windows and delivering consistent performance across multilingual and knowledge-intensive tasks.
    Starting Price: Free
  • 7
    Kimi K2.6

    Kimi K2.6

    Moonshot AI

    Kimi K2.6 is a next-generation agentic AI model developed by Moonshot AI, designed to push forward real-world execution, coding, and multi-step reasoning beyond earlier K2 and K2.5 versions. It builds on a Mixture-of-Experts architecture and the multimodal, agent-first foundation of the Kimi series, combining language understanding, coding, and tool use into a single system capable of planning and executing complex workflows. It introduces deeper reasoning capabilities and significantly improved agent planning, allowing it to break down tasks, coordinate tools, and handle multi-file or multi-step problems with greater accuracy and efficiency. It supports advanced tool calling with high reliability, enabling integration with external systems such as web search or APIs, and includes built-in validation mechanisms to ensure correct execution formats.
    Starting Price: Free
  • 8
    Qwen3.7-Max
    Qwen3.7-Max is Qwen’s latest proprietary model designed for the agent era, built to be a versatile agent foundation that is equally capable of writing and debugging code, automating office workflows, and sustaining autonomous browser sessions over long horizons. It reaches frontier-level coding performance, with stronger results across software engineering, terminal tasks, GUI grounding, web browsing, and agentic tool use. Qwen3.7-Max is designed to reduce the gap between model intelligence and real agent execution by supporting planning, long-context reasoning, reliable function calling, and multi-step task completion across complex workflows. It also strengthens multimodal and document-oriented work through Qwen Studio, which supports chatbot interaction, image and video understanding, image generation, document processing, presentation generation, coding assistance, deep research, and web development.
    Starting Price: Free
  • 9
    Next.js

    Next.js

    Vercel

    Next.js gives you the best developer experience with all the features you need for production: hybrid static & server rendering, TypeScript support, smart bundling, route pre-fetching, and more. The interactive course with quizzes will guide you through everything you need to know to use Next.js. Next.js has built-in support for internationalized (i18n) routing since v10.0.0. You can provide a list of locales, the default locale, and domain-specific locales and Next.js will automatically handle the routing.
    Starting Price: Free
  • 10
    Flags SDK
    Flags SDK is an open source library designed for integrating feature flags into Next.js and SvelteKit applications. It allows developers to manage and implement feature flags, providing flexibility with integrations for any flag provider. The SDK supports AB testing, controlled rollouts, and experimentation, helping developers ship features faster and more efficiently. It features a simple, declarative API to define and use flags, making integration effortless. With Flags SDK, developers can release new features strategically, test them in production without breaking apps, and experiment with confidence. The toolkit ensures the website remains fast while following best practices for feature flag usage.
    Starting Price: Free
  • 11
    Vercel AI SDK
    The Vercel AI SDK is a free, open source TypeScript toolkit from the creators of Next.js that gives developers unified, high-level primitives to build AI-powered features quickly across any model provider by changing a single line of code. It abstracts common complexities like streaming responses, multi-turn tool execution, error handling and recovery, and model switching while remaining framework-agnostic so builders can go from idea to working application in minutes. With a unified provider API, developers can generate typed objects, compose generative UIs, and deliver instant, streamed AI responses without reinventing plumbing, and the SDK includes documentation, cookbooks, a playground, and community-driven extensibility to accelerate development. It handles the hard parts under the hood while exposing enough control to get under the hood when needed, making integration with multiple LLMs seamless.
    Starting Price: Free
  • 12
    DeepSeek-V4

    DeepSeek-V4

    DeepSeek

    DeepSeek-V4 is a next-generation open-source language model designed for high-performance reasoning, coding, and long-context intelligence. It introduces a powerful architecture with up to one million token context length, enabling seamless handling of large datasets and complex multi-step workflows. The model comes in two variants: DeepSeek-V4-Pro for maximum performance and DeepSeek-V4-Flash for efficiency and speed. DeepSeek-V4-Pro features 1.6 trillion total parameters with 49 billion activated, delivering near state-of-the-art performance comparable to leading closed-source models. It excels in agentic coding, mathematical reasoning, and world knowledge tasks. The model integrates advanced attention mechanisms, including token-wise compression and sparse attention, significantly reducing compute and memory costs. It is also optimized for AI agents, supporting tool use and multi-step workflows.
    Starting Price: Free
  • 13
    Trinity-Large-Thinking
    Trinity Large Thinking is a frontier open source reasoning model developed by Arcee AI, designed specifically for complex, multi-step problem solving and autonomous agent workflows that require long-horizon planning and tool use. Built on a sparse Mixture-of-Experts architecture with roughly 400 billion total parameters but only about 13 billion active per token, the model achieves high efficiency while maintaining strong reasoning performance across tasks such as mathematical problem solving, code generation, and multi-step analysis. It introduces extended chain-of-thought reasoning capabilities, allowing the model to generate intermediate “thinking traces” before producing final answers, which improves accuracy and reliability in complex scenarios. Trinity Large Thinking supports a very large context window of up to 262K tokens, enabling it to process long documents, maintain state across extended interactions, and operate effectively in continuous agent loops.
    Starting Price: Free
  • 14
    GPT-5.5

    GPT-5.5

    OpenAI

    GPT-5.5 is an advanced AI model designed to handle complex, real-world tasks with greater autonomy and efficiency. It quickly understands user intent and can execute multi-step workflows such as coding, research, data analysis, and document creation with minimal guidance. Instead of requiring step-by-step instructions, GPT-5.5 plans tasks, uses tools, evaluates outputs, and continues working until completion. It excels in knowledge work, software development, and analytical problem-solving, helping users move from idea to execution faster. The model is built to operate across tools and environments, making it highly effective for modern digital workflows. With strong reasoning and persistence, GPT-5.5 enables individuals and teams to complete demanding work more efficiently and accurately.
    Starting Price: $5 per 1M tokens (input)
  • 15
    GPT-5.5 Pro
    GPT-5.5 Pro is an advanced AI model designed to handle complex, real-world work with greater autonomy and efficiency. It understands user intent quickly and can execute multi-step tasks such as coding, research, data analysis, and document creation with minimal guidance. The model is built to plan, use tools, and refine its outputs until tasks are complete. It excels in knowledge work, software development, and analytical problem-solving. With strong reasoning and persistence, GPT-5.5 Pro can manage long-running workflows across tools and systems. It delivers high-quality results while maintaining speed and efficiency. Overall, it enables individuals and teams to complete demanding tasks faster and more accurately.
    Starting Price: $30 per 1M tokens (input)
  • 16
    DeepSeek-V4-Pro
    DeepSeek-V4-Pro is a large-scale Mixture-of-Experts (MoE) language model designed for advanced reasoning, coding, and long-context understanding. It features 1.6 trillion total parameters with 49 billion activated parameters, enabling high performance while maintaining efficiency. The model supports an exceptionally large context window of up to one million tokens, allowing it to process extensive documents and workflows. It uses a hybrid attention architecture to optimize long-context performance and reduce computational cost. DeepSeek-V4-Pro is trained on over 32 trillion tokens, improving its knowledge and reasoning capabilities. It also includes advanced optimization techniques for stability and faster convergence during training. The model supports multiple reasoning modes, allowing users to balance speed and accuracy based on their needs. Overall, it provides a powerful open-source solution for complex AI tasks and large-scale applications.
    Starting Price: Free
  • 17
    DeepSeek-V4-Flash
    DeepSeek-V4-Flash is a high-efficiency Mixture-of-Experts (MoE) language model designed for fast, scalable reasoning and text generation. It features 284 billion total parameters with 13 billion activated parameters, delivering strong performance while optimizing computational cost. The model supports an extensive context window of up to one million tokens, enabling it to process large documents and complex workflows with ease. Its hybrid attention architecture enhances long-context efficiency by reducing memory and compute requirements. Trained on over 32 trillion tokens, DeepSeek-V4-Flash demonstrates solid capabilities across knowledge, reasoning, and coding tasks. It is designed for scenarios where speed and efficiency are critical, offering a balance between performance and resource usage. The model also supports multiple reasoning modes, allowing users to adjust between faster outputs and deeper analysis.
    Starting Price: Free
  • 18
    Recraft

    Recraft

    Recraft

    Recraft is an AI-powered image generation platform designed to create high-quality visuals with strong design aesthetics. It enables users to generate photorealistic images, vectors, and design assets from simple prompts. The platform stands out for its ability to produce vector graphics directly, making it useful for professional design work. Recraft focuses on delivering visually consistent and stylistically refined outputs without requiring extensive training. Users can easily create and reuse custom styles by uploading reference images. It also includes tools for editing, upscaling, and refining images within a single platform. The system is built to support creative workflows for branding, marketing, and visual content creation. Overall, Recraft helps designers and creators produce polished visuals quickly and efficiently.
    Starting Price: $10/month
  • 19
    Gemma 4

    Gemma 4

    Google

    Gemma 4 is an AI model introduced by Google and built on the Gemini architecture to deliver improved performance and flexibility. The model is designed to run efficiently on a single GPU or TPU, making it more accessible to developers and researchers. Gemma 4 enhances capabilities in natural language understanding and text generation, supporting a wide range of AI-driven applications. Its architecture allows it to handle complex tasks while maintaining efficient resource usage. Developers can use the model to build applications that rely on advanced language processing and automation. The design emphasizes scalability so that it can support both smaller projects and larger AI systems. By combining efficiency with powerful language capabilities, Gemma 4 helps advance the development of modern AI solutions.
    Starting Price: Free
  • 20
    Qwen3.6-Plus
    Qwen3.6-Plus is an advanced AI model developed by Alibaba Cloud, designed to power real-world intelligent agents and complex workflows. It introduces significant improvements in agentic coding, enabling developers to handle everything from frontend development to large-scale codebase management. The model features a massive 1 million token context window, allowing it to process and reason over long and complex inputs. It integrates reasoning, memory, and execution capabilities to deliver highly accurate and reliable results. Qwen3.6-Plus also enhances multimodal capabilities, enabling it to understand and analyze images, videos, and documents. The platform is optimized for real-world applications, including automation, planning, and tool-based workflows. Overall, it provides a powerful foundation for building next-generation AI agents and intelligent systems.
  • 21
    MiMo-V2.5-Pro

    MiMo-V2.5-Pro

    Xiaomi Technology

    Xiaomi MiMo-V2.5-Pro is an advanced open-source AI model designed to handle complex, long-horizon tasks with strong agentic capabilities. It features a Mixture-of-Experts architecture with over one trillion parameters and a large context window of up to one million tokens. The model is built to perform sophisticated reasoning, coding, and problem-solving across extended workflows. It demonstrates high performance on benchmark tests related to software engineering, reasoning, and general intelligence. MiMo-V2.5-Pro can autonomously complete complex projects, such as building full software systems or optimizing engineering designs. It uses hybrid attention mechanisms to balance efficiency and performance across long contexts. The model is also optimized for token efficiency, reducing computational cost while maintaining strong results. By combining scalability, efficiency, and advanced reasoning, MiMo-V2.5-Pro represents a major step forward in open-source AI models.
  • 22
    MiMo-V2.5

    MiMo-V2.5

    Xiaomi Technology

    Xiaomi MiMo-V2.5 is an advanced open-source AI model designed to combine strong agentic capabilities with native multimodal understanding. It can process and reason across text, images, and audio within a single unified system. The model uses a sparse Mixture-of-Experts architecture with hundreds of billions of parameters for efficient performance. It supports an extended context window of up to one million tokens, enabling long and complex workflows. MiMo-V2.5 is built to handle tasks such as coding, reasoning, and multimodal analysis with high accuracy. It incorporates dedicated visual and audio encoders to enhance perception and cross-modal reasoning. The model demonstrates strong benchmark performance across coding, reasoning, and multimodal tasks. By combining multimodality, efficiency, and agentic intelligence, MiMo-V2.5 advances the capabilities of open-source AI systems.
  • 23
    Seedance 2.0

    Seedance 2.0

    ByteDance

    Seedance 2.0 is ByteDance’s advanced AI video generation platform built to turn creative inputs into cinematic-quality videos. It supports text prompts, images, audio, and video, blending them into polished visuals with smooth transitions and native sound. The platform uses sophisticated multimodal and motion synthesis to preserve visual consistency and character identity across multiple scenes. Users can combine up to twelve reference assets in a single project, enabling complex storytelling without manual editing. Seedance 2.0 automatically plans camera movement and pacing, giving creators director-level control with minimal effort. The system is capable of producing high-resolution video output, including 1080p and above. Its rapid popularity highlights its ability to generate engaging animated and narrative-driven content from simple inputs.
  • 24
    Grok 4.3
    Grok 4.3 is the latest iteration of xAI’s Grok model, designed to deliver improved reasoning, real-time information access, and advanced task automation. It builds on earlier Grok 4 models by enhancing performance in complex problem-solving, coding, and analytical workflows. The model is integrated with real-time web and X (formerly Twitter) data, allowing it to provide up-to-date insights and answers. Grok 4.3 supports multimodal capabilities, enabling it to work with text, images, and other data types. It operates within the SuperGrok Heavy tier, offering access to more powerful compute and advanced features. The model is designed to handle long-context tasks and multi-step reasoning with greater accuracy. It also supports tool use and integrations, enabling it to interact with external systems and automate workflows. Overall, Grok 4.3 is positioned as a high-performance AI assistant for real-time, data-driven tasks.
  • 25
    ChatGPT Images 2.0
    ChatGPT Images 2.0 is a next-generation AI image generation system developed by OpenAI to create high-quality visuals from text prompts. It introduces advanced visual reasoning, allowing the model to “think” through prompts before generating images. The system significantly improves text rendering, making it possible to include accurate and readable text inside images. It supports multilingual content, enabling users to generate visuals with text in multiple languages. ChatGPT Images 2.0 can produce multiple consistent images from a single prompt, maintaining characters and objects across variations. The model also offers higher resolution outputs and better control over layout and composition. It is designed to move beyond simple image generation into practical design use cases like presentations, marketing visuals, and UI mockups. By combining reasoning with image creation, it delivers more accurate and usable visual results.
  • Previous
  • You're on page 1
  • Next