42 Integrations with omp
View a list of omp integrations and software that integrates with omp below. Compare the best omp integrations as well as features, ratings, user reviews, and pricing of software that integrates with omp. Here are the current omp integrations in 2026:
-
1
OpenRouter
OpenRouter
OpenRouter is a unified interface for LLMs. OpenRouter scouts for the lowest prices and best latencies/throughputs across dozens of providers, and lets you choose how to prioritize them. No need to change your code when switching between models or providers. You can even let users choose and pay for their own. Evals are flawed; instead, compare models by how often they're used for different purposes. Chat with multiple at once in the chatroom. Model usage can be paid by users, developers, or both, and may shift in availability. You can also fetch models, prices, and limits via API. OpenRouter routes requests to the best available providers for your model, given your preferences. By default, requests are load-balanced across the top providers to maximize uptime, but you can customize how this works using the provider object in the request body. Prioritize providers that have not seen significant outages in the last 10 seconds.Starting Price: Free -
2
Git
Git
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency. Git is easy to learn and has a tiny footprint with lightning fast performance. It outclasses SCM tools like Subversion, CVS, Perforce, and ClearCase with features like cheap local branching, convenient staging areas, and multiple workflows. You can query/set/replace/unset options with this command. The name is actually the section and the key separated by a dot, and the value will be escaped.Starting Price: Free -
3
Perplexity
Perplexity AI
Perplexity is an AI-powered search and answer engine designed to provide accurate, real-time information. It combines natural language processing with web search to deliver concise and reliable answers. Users can ask questions conversationally and receive responses backed by cited sources. The platform focuses on transparency by showing where information comes from. It supports research, learning, and decision-making across various topics. Perplexity also offers follow-up questions to deepen understanding. Overall, it is a modern alternative to traditional search engines.Starting Price: Free -
4
OpenAI
OpenAI
OpenAI’s mission is to ensure that artificial general intelligence (AGI)—by which we mean highly autonomous systems that outperform humans at most economically valuable work—benefits all of humanity. We will attempt to directly build safe and beneficial AGI, but will also consider our mission fulfilled if our work aids others to achieve this outcome. Apply our API to any language task — semantic search, summarization, sentiment analysis, content generation, translation, and more — with only a few examples or by specifying your task in English. One simple integration gives you access to our constantly-improving AI technology. Explore how you integrate with the API with these sample completions. -
5
GitHub Copilot
GitHub
GitHub Copilot is an AI-powered development assistant designed to accelerate software workflows from the editor to the enterprise. It works directly inside popular IDEs, terminals, and GitHub itself to help developers write, understand, and improve code faster. Copilot supports multiple leading large language models, allowing users to optimize for speed, accuracy, or cost. Developers can use Copilot to complete code, explain concepts, propose edits, and validate files in real time. It also enables agent-based workflows where Copilot can autonomously handle issues, write code, and create pull requests. With seamless integration across tools, Copilot keeps developers focused without breaking their flow. GitHub Copilot is built to scale from individual developers to large organizations with enterprise-grade controls.Starting Price: $10 per month -
6
Claude
Anthropic
Claude is a next-generation AI assistant developed by Anthropic to help individuals and teams solve complex problems with safety, accuracy, and reliability at its core. It is designed to support a wide range of tasks, including writing, editing, coding, data analysis, and research. Claude allows users to create and iterate on documents, websites, graphics, and code directly within chat using collaborative tools like Artifacts. The platform supports file uploads, image analysis, and data visualization to enhance productivity and understanding. Claude is available across web, iOS, and Android, making it accessible wherever work happens. With built-in web search and extended reasoning capabilities, Claude helps users find information and think through challenging problems more effectively. Anthropic emphasizes security, privacy, and responsible AI development to ensure Claude can be trusted in professional and personal workflows.Starting Price: Free -
7
Gemini
Google
Gemini is Google’s advanced AI assistant designed to help users think, create, learn, and complete tasks with a new level of intelligence. Powered by Google’s most capable models, including Gemini 3, it enables users to ask complex questions, generate content, analyze information, and explore ideas through natural conversation. Gemini can create images, videos, summaries, study plans, and first drafts while also providing feedback on uploaded files and written work. The platform is grounded in Google Search, allowing it to deliver accurate, up-to-date information and support deep follow-up questions. Gemini connects seamlessly with Google apps like Gmail, Docs, Calendar, Maps, YouTube, and Photos to help users complete tasks without switching tools. Features such as Gemini Live, Deep Research, and Gems enhance brainstorming, research, and personalized workflows. Available through flexible free and paid plans, Gemini supports everyday users, students, and professionals across devices.Starting Price: Free -
8
Mistral AI
Mistral AI
Mistral AI is a pioneering artificial intelligence startup specializing in open-source generative AI. The company offers a range of customizable, enterprise-grade AI solutions deployable across various platforms, including on-premises, cloud, edge, and devices. Flagship products include "Le Chat," a multilingual AI assistant designed to enhance productivity in both personal and professional contexts, and "La Plateforme," a developer platform that enables the creation and deployment of AI-powered applications. Committed to transparency and innovation, Mistral AI positions itself as a leading independent AI lab, contributing significantly to open-source AI and policy development.Starting Price: Free -
9
OpenAI Codex
OpenAI
Codex is an AI-powered coding agent from OpenAI designed to help developers build, manage, and ship software more efficiently across the entire development lifecycle. It acts as an intelligent pair programmer that can understand codebases, generate features, and deliver production-ready pull requests. Codex can safely execute commands in sandboxed environments while assisting with debugging, refactoring, and testing. A key advancement is its computer use capability, allowing it to operate your computer by seeing, clicking, and typing across applications. This enables Codex to interact with tools that don’t have APIs, making it useful for tasks like frontend testing and app navigation. The platform also includes an in-app browser and integrations with various developer tools for a more unified workflow. Codex supports automation by handling ongoing tasks such as monitoring, issue triage, and follow-ups.Starting Price: $20/month -
10
Cursor
Cursor
Cursor is an advanced AI-powered IDE designed to make developers exponentially more productive. Built with deep codebase understanding and intelligent automation, it combines natural language interaction with precise, context-aware editing tools. Its Agent feature acts as a human-AI coding partner capable of planning and executing entire development workflows, while the Tab model delivers remarkably accurate autocompletion and targeted suggestions. Cursor seamlessly integrates across environments—from GitHub and Slack to the command line—ensuring AI assistance is available wherever you code. Supporting leading models like GPT-5, Claude Sonnet, Gemini Pro, and Grok Code, it gives developers full control over autonomy and model selection. Fast, intuitive, and built for serious builders, Cursor is redefining what an IDE can be.Starting Price: $20 per month -
11
Grok
xAI
Grok is an advanced AI assistant developed by xAI, designed to provide real-time insights, intelligent responses, and conversational support. It is deeply integrated with the X (formerly Twitter) platform, allowing users to access up-to-date information and trending discussions. Grok is built to answer complex questions with a mix of reasoning, humor, and personality. It can assist with tasks such as research, content creation, and general problem-solving. The platform leverages large language models to deliver accurate and context-aware responses. Grok stands out for its ability to access live data, making it highly relevant for current events. Overall, it offers a dynamic and engaging AI experience for everyday users.Starting Price: Free -
12
Kilo Code
Kilo Code
Kilo Code is a powerful open-source coding agent designed to help developers build, ship, and iterate faster across every stage of the software development workflow. It offers multiple modes—including Ask, Architect, Code, Debug, and Orchestrator—so developers can switch seamlessly between tasks with tailored AI support. The platform includes features such as hallucination-free code, automatic failure recovery, and deep context awareness to ensure accuracy and reliability. Developers can run parallel agents, enjoy fast autocomplete, and even deploy applications with a single click. With access to 500+ models and integration across terminals, VS Code, and JetBrains editors, Kilo provides unmatched flexibility. As the #1 agent on OpenRouter with over 750,000 users, it has quickly become a preferred choice for modern AI-assisted development.Starting Price: $15/user/month -
13
Google Antigravity
Google
Google Antigravity is an agentic development platform that reimagines the traditional IDE for the AI-first era. Designed for developers of all levels, it enables seamless collaboration between humans and intelligent agents across the editor, terminal, and browser. The platform allows developers to issue natural language commands, monitor autonomous coding workflows, and review generated artifacts—all from a unified interface. Antigravity introduces cross-surface agent synchronization, ensuring consistency and context sharing across multiple workspaces. Its mission control view lets users manage and refine multiple agents simultaneously, making complex development tasks faster, smarter, and more intuitive. Whether you’re building enterprise-scale systems or experimenting creatively, Google Antigravity elevates the development experience into a new era of agent-driven productivity.Starting Price: Free -
14
Python
Python
The core of extensible programming is defining functions. Python allows mandatory and optional arguments, keyword arguments, and even arbitrary argument lists. Whether you're new to programming or an experienced developer, it's easy to learn and use Python. Python can be easy to pick up whether you're a first-time programmer or you're experienced with other languages. The following pages are a useful first step to get on your way to writing programs with Python! The community hosts conferences and meetups to collaborate on code, and much more. Python's documentation will help you along the way, and the mailing lists will keep you in touch. The Python Package Index (PyPI) hosts thousands of third-party modules for Python. Both Python's standard library and the community-contributed modules allow for endless possibilities.Starting Price: Free -
15
Qwen
Alibaba
Qwen is a powerful, free AI assistant built on the advanced Qwen model series, designed to help anyone with creativity, research, problem-solving, and everyday tasks. While Qwen Chat is the main interface for most users, Qwen itself powers a broad range of intelligent capabilities including image generation, deep research, website creation, advanced reasoning, and context-aware search. Its multimodal intelligence enables Qwen to understand and process text, images, audio, and video simultaneously for richer insights. Qwen is available on web, desktop, and mobile, ensuring seamless access across all devices. For developers, the Qwen API provides OpenAI-compatible endpoints, making integration simple and allowing Qwen’s intelligence to power apps, services, and automation. Whether you're chatting through Qwen Chat or building with the Qwen API, Qwen delivers fast, flexible, and highly capable AI support.Starting Price: Free -
16
OpenCode
Anomaly Innovations
OpenCode is the AI coding agent purpose-built for the terminal. It delivers a responsive, themeable terminal UI that feels native while streamlining your workflow. With LSP auto-loading, it ensures the right language servers are always available for accurate, context-aware coding support. Developers can spin up multiple AI agents in parallel sessions on the same project, maximizing productivity. Shareable links make it easy to reference, debug, or collaborate across sessions. Supporting Claude Pro and 75+ LLM providers via Models.dev, OpenCode gives you full freedom to choose your coding companion.Starting Price: Free -
17
Anthropic
Anthropic
Anthropic is an AI research and technology company focused on building safe, reliable, and advanced artificial intelligence systems. It is best known for developing the Claude family of AI models, designed for reasoning, coding, and enterprise applications. The company emphasizes AI safety and alignment, ensuring that its systems behave predictably and responsibly. Anthropic’s models are used across industries for tasks such as content creation, analysis, and automation. It develops both consumer-facing tools and enterprise AI solutions through APIs and integrations. The company collaborates with major cloud providers to make its models widely accessible. Anthropic also invests heavily in research to improve AI transparency and controllability. Overall, it aims to create powerful AI systems that are both useful and trustworthy. -
18
Zed
Zed Industries
Zed is a next-generation code editor designed for high-performance collaboration with humans and AI. Written from scratch in Rust to efficiently leverage multiple CPU cores and your GPU. Integrate upcoming LLMs into your workflow to generate, transform, and analyze code. Chat with teammates, write notes together, and share your screen and project. Multibuffers compose excerpts from across the codebase in one editable surface. Evaluate code inline via Jupyter runtimes and collaboratively edit notebooks. Support for many languages via Tree-sitter, WebAssembly, and the Language Server Protocol. Fast native terminal tightly integrates with Zed's language-aware task runner and AI capabilities. First-class modal editing via Vim bindings, including features like text objects and marks. Zed is built by a global community of thousands of developers. Boost your Zed experience by choosing from hundreds of extensions that broaden language support, offer different themes, and more.Starting Price: Free -
19
Hugging Face
Hugging Face
Hugging Face is a leading platform for AI and machine learning, offering a vast hub for models, datasets, and tools for natural language processing (NLP) and beyond. The platform supports a wide range of applications, from text, image, and audio to 3D data analysis. Hugging Face fosters collaboration among researchers, developers, and companies by providing open-source tools like Transformers, Diffusers, and Tokenizers. It enables users to build, share, and access pre-trained models, accelerating AI development for a variety of industries.Starting Price: $9 per month -
20
Go
Golang
With a strong ecosystem of tools and APIs on major cloud providers, it is easier than ever to build services with Go. With popular open source packages and a robust standard library, use Go to create fast and elegant CLIs. With enhanced memory performance and support for several IDEs, Go powers fast and scalable web applications. With fast build times, lean syntax, an automatic formatter and doc generator, Go is built to support both DevOps and SRE. Everything there is to know about Go. Get started on a new project or brush up for your existing Go code. An interactive introduction to Go in three sections. Each section concludes with a few exercises so you can practice what you've learned. The Playground allows anyone with a web browser to write Go code that we immediately compile, link, and run on our servers.Starting Price: Free -
21
Ollama
Ollama
Ollama is an innovative platform that focuses on providing AI-powered tools and services, designed to make it easier for users to interact with and build AI-driven applications. Run AI models locally. By offering a range of solutions, including natural language processing models and customizable AI features, Ollama empowers developers, businesses, and organizations to integrate advanced machine learning technologies into their workflows. With an emphasis on usability and accessibility, Ollama strives to simplify the process of working with AI, making it an appealing option for those looking to harness the potential of artificial intelligence in their projects.Starting Price: Free -
22
Kimi
Moonshot AI
Kimi is an intelligent assistant with a large "memory" that can read a 200,000-word novel in one go and surf the Internet. Kimi can understand and process long documents, helping you quickly summarize analysis reports, financial reports, etc., saving time in reading and organizing. When preparing for exams or researching new fields, Kimi can help you understand and summarize large amounts of textbooks or professional papers. If you work in programming or technology, Kimi can help you reproduce code or provide technical solutions based on your code or pseudocode from your paper. Kimi has significant advantages in Chinese and can handle multi-language documents, helping you communicate and understand more efficiently in international work. Kimi Chat can also play your favorite game characters, have interesting conversations with you, and provide entertainment and relaxation.Starting Price: Free -
23
Fireworks AI
Fireworks AI
Fireworks partners with the world's leading generative AI researchers to serve the best models, at the fastest speeds. Independently benchmarked to have the top speed of all inference providers. Use powerful models curated by Fireworks or our in-house trained multi-modal and function-calling models. Fireworks is the 2nd most used open-source model provider and also generates over 1M images/day. Our OpenAI-compatible API makes it easy to start building with Fireworks. Get dedicated deployments for your models to ensure uptime and speed. Fireworks is proudly compliant with HIPAA and SOC2 and offers secure VPC and VPN connectivity. Meet your needs with data privacy - own your data and your models. Serverless models are hosted by Fireworks, there's no need to configure hardware or deploy models. Fireworks.ai is a lightning-fast inference platform that helps you serve generative AI models.Starting Price: $0.20 per 1M tokens -
24
GitLab Duo
GitLab
Ship more secure software faster with AI throughout the entire software development lifecycle GitLab Duo is an AI-powered assistant integrated into the GitLab platform, designed to enhance software development productivity and collaboration. It provides intelligent code suggestions, helps refactor and debug code, and streamlines workflows by automating repetitive tasks. GitLab Duo also supports natural language queries for project management and development insights, enabling developers to interact with their repositories more intuitively. Integrated with popular IDEs like JetBrains and Neovim, GitLab Duo ensures seamless assistance within developers' preferred environments. By combining AI-driven efficiency with GitLab’s robust DevSecOps features, GitLab Duo empowers teams to deliver high-quality software faster and more collaboratively.Starting Price: $19/user/month -
25
Vercel AI Gateway
Vercel
Vercel AI Gateway is a unified AI infrastructure platform that allows developers to access, manage, and route requests across hundreds of AI models and providers through a single API interface. Built as part of the Vercel AI ecosystem, the platform supports text, image, and video generation models from providers such as OpenAI, Anthropic, xAI, and others while simplifying authentication, billing, observability, and failover management. Developers can use one API key and centralized dashboard to integrate multiple AI providers into applications without managing separate provider accounts or infrastructure. The platform also includes built-in routing, automatic failovers, usage tracking, unified billing, and compatibility with SDKs such as the Vercel AI SDK, enabling faster development and more resilient AI-powered applications. -
26
LiteLLM
LiteLLM
LiteLLM is a versatile platform designed to streamline interactions with over 100 Large Language Models (LLMs) through a unified interface. It offers both a Proxy Server (LLM Gateway) and a Python SDK, enabling developers to integrate various LLMs seamlessly into their applications. The Proxy Server facilitates centralized management, allowing for load balancing, cost tracking across projects, and consistent input/output formatting compatible with OpenAI standards. This setup supports multiple providers. It ensures robust observability by generating unique call IDs for each request, aiding in precise tracking and logging across systems. Developers can leverage pre-defined callbacks to log data using various tools. For enterprise users, LiteLLM offers advanced features like Single Sign-On (SSO), user management, and professional support through dedicated channels like Discord and Slack.Starting Price: Free -
27
Gemini CLI
Google
Gemini CLI is a free, open-source AI agent that integrates Gemini’s powerful AI capabilities directly into developers’ command line terminals. It offers fast, lightweight access to Gemini 3 Pro, enabling developers to generate code, solve problems, and manage tasks using natural language prompts. The CLI supports up to 60 model requests per minute and 1,000 requests per day at no cost, with additional paid options for professionals requiring higher usage. Gemini CLI includes advanced features like Google Search grounding for real-time web context, prompt customization, and automation within scripts. It is fully extensible and open source, welcoming community contributions via GitHub. Designed to enhance workflow efficiency, Gemini CLI brings AI-powered coding assistance to the terminal environment.Starting Price: Free -
28
Xiaomi MiMo
Xiaomi Technology
The Xiaomi MiMo API open platform is a developer-oriented interface for accessing and integrating Xiaomi’s MiMo family of AI models, including reasoning and language models such as MiMo-V2-Flash, into applications and services through standardized APIs and cloud endpoints, enabling developers to build AI-enabled features like conversational agents, reasoning workflows, code assistance, and search-augmented tasks without managing model infrastructure themselves. It offers REST-style API access with authentication, request signing, and structured responses so software can send prompts and receive generated text or processed outputs programmatically, and it supports common operations like text generation, prompt handling, and inference over MiMo models. By providing documentation and onboarding tools, the open platform lets teams integrate Xiaomi’s latest open source large language models, which leverage Mixture-of-Experts (MoE) architectures.Starting Price: Free -
29
Alibaba AI Coding Plan
Alibaba Cloud
Alibaba Cloud’s AI Scene Coding campaign introduces a cloud-based development environment designed to help developers write, test, and deploy software faster using advanced AI coding models. It provides access to powerful models such as Qwen3-Coder-Plus and integrates with popular developer tools, including Cline, Claude Code, Qwen Code, and OpenClaw, allowing engineers to use their preferred coding interfaces while leveraging Alibaba Cloud’s AI infrastructure. It is built to streamline software development by combining large language models with cloud computing resources so developers can generate code, analyze projects, and automate development workflows from a unified environment. These AI models are capable of understanding prompts, writing code, debugging programs, and assisting with complex development tasks, allowing applications to be built in minutes rather than through traditional manual coding cycles.Starting Price: $3 per month -
30
ZenMux
ZenMux
ZenMux is an enterprise-grade AI gateway that provides a unified interface for accessing and orchestrating multiple leading large language models through a single account and API. Instead of managing separate providers, keys, and integrations, users can connect to top models from companies like OpenAI, Anthropic, Google, and others through one consistent system, fully compatible with existing protocols such as OpenAI and Gemini Enterprise Agent Platform. It eliminates the complexity of multi-provider setups by offering intelligent routing that automatically selects the most suitable model for each task based on cost, performance, and reliability. ZenMux emphasizes direct access to official providers and authorized cloud partners, ensuring that all outputs come from authentic, high-quality sources without proxies or degraded versions. One of its defining features is a built-in AI model insurance, which detects issues.Starting Price: $20 per month -
31
MiniMax Code
MiniMax
MiniMax Code brings the agent experience to Mac and Windows, where users can pick a workspace, describe what they need, and let the agent read, analyze, batch-process, and act on local files or remote tasks. Instead of manually managing every step, users define the goal and MiniMax Code builds the right agent team, soloing simple tasks and teaming up on complex work. The agent remembers habits, preferences, projects, and repeated workflows through persistent memory, generating skills over time so users do not have to explain the same context again. It is designed to work where people already chat, handling local files, remote work, schedules, teams, memories, and skills directly from the conversation. The product supports advanced coding and agentic workflows, including multi-file edits, test-validated repairs, long-horizon tool chains, planning, document summarization, creative writing, research, full-stack development, reports, presentations, web development, and everyday Q&A.Starting Price: $20 per month -
32
Wafer
Wafer
Wafer delivers the fastest open source LLMs for enterprise through serverless and dedicated inference built for production AI workloads. Its serverless inference gives teams access to top open models with no infrastructure, no deployment overhead, and fast APIs, including GLM-5.2-Fast for low-latency inference with EAGLE speculative decoding and a per-stream throughput SLA, GLM-5.2 as a flagship model with stronger coding and reasoning capabilities, and more. Wafer’s technology uses agents that optimize inference across the stack, identifying and enhancing bottlenecks in orchestration, algorithms, serving engines, GPU kernels, and diverse hardware. It profiles the stack to see whether latency or throughput comes from scheduling, decoding, kernels, memory pressure, or hardware fit, then tries many paths and ships the measured winner. Instead of relying on a single switch or heuristic, Wafer searches model, engine, kernel, and hardware combinations.Starting Price: Free -
33
JavaScript
JavaScript
JavaScript is a scripting language and programming language for the web that enables developers to build dynamic elements on the web. Over 97% of the websites in the world use client-side JavaScript. JavaScript is one of the most important scripting languages on the web. Strings in JavaScript are contained within a pair of either single quotation marks '' or double quotation marks "". Both quotes represent Strings but be sure to choose one and STICK WITH IT. If you start with a single quote, you need to end with a single quote. There are pros and cons to using both IE single quotes tend to make it easier to write HTML within Javascript as you don’t have to escape the line with a double quote. Let’s say you’re trying to use quotation marks inside a string. You’ll need to use opposite quotation marks inside and outside of JavaScript single or double quotes.Starting Price: Free -
34
Venice.ai
Venice.ai
Venice is a permissionless alternative to the popular AI apps. It is an alternative needed because those apps violate your privacy and censor the AI’s responses. Venice utilizes leading open source AI technology to deliver uncensored, unbiased machine intelligence, and we do it while preserving your privacy. Search the world’s information, have rich conversations, analyze documents, and create images and art at the push of a button. Venice is super simple, with no downloads. There are no installations of anything. For basic use, no account is necessary and the service is free, you can use it right now. The Venice front-end is a clean web app that should feel familiar to anyone who has used generative AI. Your conversation history is stored only in your browser. Venice does not store or log prompt or model responses on our servers. Several different AI models for different results and styles. Ability to save and organize past conversations for future reference.Starting Price: $49 per year -
35
Kimi Code
Kimi
Kimi Code is a developer-centric AI coding agent included as part of the Kimi Membership, designed to boost productivity by automating software development tasks and seamlessly integrating into popular workflows. It offers high-performance CLI tools and supports integration with terminal environments and IDEs like VS Code, allowing developers to read and edit code, answer questions about codebases, generate features, fix bugs, refactor, and verify changes through a natural-language interface. With a dedicated console showing real-time logs, request quotas, and pace controls, the platform lets users configure API keys for use in tools such as Kimi CLI, Claude Code, and Roo Code, enabling faster coding with AI assistance within commits and existing workflows. In VS Code, Kimi Code features a native chat panel with slash commands, file and folder references, diff views, and integration with external tools for context-aware coding support.Starting Price: $15 per month -
36
GLM Coding Plan
Z.ai
Z.ai DevPack (GLM Coding Plan) is a subscription-based AI coding platform designed to integrate high-performance language models into existing development tools, enabling a faster, more intelligent, and stable coding workflow. It provides access to advanced models such as GLM-4.7 and GLM-5, which can be used across popular AI coding environments like Claude Code, Cline, OpenCode, and other tools that support OpenAI-compatible APIs. The system allows developers to use natural language programming to describe requirements and automatically generate code, debug issues, and execute tasks, while also offering real-time, context-aware code completion to improve productivity. It includes intelligent debugging and repair capabilities, enabling models to analyze errors, suggest fixes, and maintain smooth execution throughout development. DevPack is designed with a structured interface that AI agents can understand, allowing seamless interaction between tools and models. -
37
Baidu Qianfan
Baidu
One-stop enterprise-level large model platform, providing advanced generation AI production and application process development toolchain. Provides data labels, model training and evaluation, reasoning services, and application-integrated comprehensive functional services. Training and reasoning performance greatly improved. Perfect authentication and flow control safety mechanism, self-proclaimed content review and sensitive word filtering, multi-safety mechanism escort enterprise application. Extensive and mature practice landed, building the next generation of smart applications. Online quick test service effect, convenient smart cloud reasoning service. One-stop model customization, full process visualization operation. Large model of knowledge enhancement, unified paradigm to support multi-category downstream tasks. An advanced parallel strategy that supports large model training, compression, and deployment. -
38
Together AI
Together AI
Together AI provides an AI-native cloud platform built to accelerate training, fine-tuning, and inference on high-performance GPU clusters. Engineered for massive scale, the platform supports workloads that process trillions of tokens without performance drops. Together AI delivers industry-leading cost efficiency by optimizing hardware, scheduling, and inference techniques, lowering total cost of ownership for demanding AI workloads. With deep research expertise, the company brings cutting-edge models, hardware, and runtime innovations—like ATLAS runtime-learning accelerators—directly into production environments. Its full-stack ecosystem includes a model library, inference APIs, fine-tuning capabilities, pre-training support, and instant GPU clusters. Designed for AI-native teams, Together AI helps organizations build and deploy advanced applications faster and more affordably.Starting Price: $0.0001 per 1k tokens -
39
Groq
Groq
GroqCloud is a high-performance AI inference platform built specifically for developers who need speed, scale, and predictable costs. It delivers ultra-fast responses for leading generative AI models across text, audio, and vision workloads. Powered by Groq’s purpose-built LPU (Language Processing Unit), the platform is designed for inference from the ground up, not adapted from training hardware. GroqCloud supports popular LLMs, speech-to-text, text-to-speech, and image-to-text models through industry-standard APIs. Developers can start for free and scale seamlessly as usage grows, with clear usage-based pricing. The platform is available in public, private, or co-cloud deployments to match different security and performance needs. GroqCloud combines consistent low latency with enterprise-grade reliability. -
40
LM Studio
LM Studio
Use models through the in-app Chat UI or an OpenAI-compatible local server. Minimum requirements: M1/M2/M3 Mac, or a Windows PC with a processor that supports AVX2. Linux is available in beta. One of the main reasons for using a local LLM is privacy, and LM Studio is designed for that. Your data remains private and local to your machine. You can use LLMs you load within LM Studio via an API server running on localhost. -
41
vLLM
vLLM
vLLM is a high-performance library designed to facilitate efficient inference and serving of Large Language Models (LLMs). Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evolved into a community-driven project with contributions from both academia and industry. It offers state-of-the-art serving throughput by efficiently managing attention key and value memory through its PagedAttention mechanism. It supports continuous batching of incoming requests and utilizes optimized CUDA kernels, including integration with FlashAttention and FlashInfer, to enhance model execution speed. Additionally, vLLM provides quantization support for GPTQ, AWQ, INT4, INT8, and FP8, as well as speculative decoding capabilities. Users benefit from seamless integration with popular Hugging Face models, support for various decoding algorithms such as parallel sampling and beam search, and compatibility with NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs, and more. -
42
Cerebras
Cerebras
We’ve built the fastest AI accelerator, based on the largest processor in the industry, and made it easy to use. With Cerebras, blazing fast training, ultra low latency inference, and record-breaking time-to-solution enable you to achieve your most ambitious AI goals. How ambitious? We make it not just possible, but easy to continuously train language models with billions or even trillions of parameters – with near-perfect scaling from a single CS-2 system to massive Cerebras Wafer-Scale Clusters such as Andromeda, one of the largest AI supercomputers ever built.
- Previous
- You're on page 1
- Next