Business Software for NVIDIA DGX Cloud Serverless Inference

Top Software that integrates with NVIDIA DGX Cloud Serverless Inference as of September 2025

Compare business software, products, and services to find the best solution for your business or organization. Use the filters on the left to drill down by category, pricing, features, organization size, organization type, region, user reviews, integrations, and more. View and sort the products and solutions that match your needs in the results below.

  • 1
    Google Cloud Platform
    Google Cloud is a cloud-based service that allows you to create anything from simple websites to complex applications for businesses of all sizes. New customers get $300 in free credits to run, test, and deploy workloads. All customers can use 25+ products for free, up to monthly usage limits. Use Google's core infrastructure, data analytics & machine learning. Secure and fully featured for all enterprises. Tap into big data to find answers faster and build better products. Grow from prototype to production to planet-scale, without having to think about capacity, reliability or performance. From virtual machines with proven price/performance advantages to a fully managed app development platform. Scalable, resilient, high performance object storage and databases for your applications. State-of-the-art software-defined networking products on Google’s private fiber network. Fully managed data warehousing, batch and stream processing, data exploration, Hadoop/Spark, and messaging.
    Leader badge
    Starting Price: Free ($300 in free credits)
    View Software
    Visit Website
  • 2
    Amazon Web Services (AWS)
    Whether you're looking for compute power, database storage, content delivery, or other functionality, AWS has the services to help you build sophisticated applications with increased flexibility, scalability and reliability. Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform, offering over 175 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster. AWS has significantly more services, and more features within those services, than any other cloud provider–from infrastructure technologies like compute, storage, and databases–to emerging technologies, such as machine learning and artificial intelligence, data lakes and analytics, and Internet of Things. This makes it faster, easier, and more cost effective to move your existing applications to the cloud.
  • 3
    Microsoft Azure
    Microsoft's Azure is a cloud computing platform that allows for rapid and secure application development, testing and management. Azure. Invent with purpose. Turn ideas into solutions with more than 100 services to build, deploy, and manage applications—in the cloud, on-premises, and at the edge—using the tools and frameworks of your choice. Continuous innovation from Microsoft supports your development today, and your product visions for tomorrow. With a commitment to open source, and support for all languages and frameworks, build how you want, and deploy where you want to. On-premises, in the cloud, and at the edge—we’ll meet you where you are. Integrate and manage your environments with services designed for hybrid cloud. Get security from the ground up, backed by a team of experts, and proactive compliance trusted by enterprises, governments, and startups. The cloud you can trust, with the numbers to prove it.
  • 4
    Splunk Cloud Platform
    Turn data into answers with Splunk deployed and managed securely, reliably and scalably as a service. With your IT backend managed by our Splunk experts, you can focus on acting on your data. Splunk-provisioned and managed infrastructure delivers a turnkey, cloud-based data analytics solution. Go live in as little as two days. Managed software upgrades ensure you always have the latest functionality. Tap into the value of your data in days with fewer requirements to turn data into action. Splunk Cloud meets the FedRAMP security standards, and helps U.S. federal agencies and their partners drive confident decisions and decisive actions at mission speeds. Drive productivity and contextual insights with Splunk’s mobile apps, augmented reality and natural language capabilities. Extend the utility of your Splunk solutions to any location with a simple phrase or the tap of a finger. From infrastructure management to data compliance, Splunk Cloud is built to scale.
  • 5
    Helm

    Helm

    Helm

    Helm runs in GNU/Linux, Mac OSX and Windows. Run Helm as a standalone synthesizer or as an LV2, VST, VST3 or AU plugin. Comes in both 32-bit and 64-bit versions. This means you are free to run Helm anywhere without the pains of DRM, you can study and change the source code and redistribute exact or modified copies of Helm. Helm is a software synthesizer. You use it to create electronic music on your computer. Helm is free as in freedom. This means you control this software, it doesn't control you. In terms of money, Helm is pay what you want. So you are free to pay nothing. Any sound that comes out of Helm belongs to the person who played it. You are the copyright holder to any sound you create with Helm. You can turn some modules on and of. They have little power buttons in the top left that you can click to turn them on or of. The SUB module is one of the three sound producers in Helm. It controls a single oscillator that by default plays one octave below the currently played note.
  • 6
    Nebius

    Nebius

    Nebius

    Training-ready platform with NVIDIA® H100 Tensor Core GPUs. Competitive pricing. Dedicated support. Built for large-scale ML workloads: Get the most out of multihost training on thousands of H100 GPUs of full mesh connection with latest InfiniBand network up to 3.2Tb/s per host. Best value for money: Save at least 50% on your GPU compute compared to major public cloud providers*. Save even more with reserves and volumes of GPUs. Onboarding assistance: We guarantee a dedicated engineer support to ensure seamless platform adoption. Get your infrastructure optimized and k8s deployed. Fully managed Kubernetes: Simplify the deployment, scaling and management of ML frameworks on Kubernetes and use Managed Kubernetes for multi-node GPU training. Marketplace with ML frameworks: Explore our Marketplace with its ML-focused libraries, applications, frameworks and tools to streamline your model training. Easy to use. We provide all our new users with a 1-month trial period.
    Starting Price: $2.66/hour
  • 7
    NVIDIA NIM
    Explore the latest optimized AI models, connect AI agents to data with NVIDIA NeMo, and deploy anywhere with NVIDIA NIM microservices. NVIDIA NIM is a set of easy-to-use inference microservices that facilitate the deployment of foundation models across any cloud or data center, ensuring data security and streamlined AI integration. Additionally, NVIDIA AI provides access to the Deep Learning Institute (DLI), offering technical training to gain in-demand skills, hands-on experience, and expert knowledge in AI, data science, and accelerated computing. AI models generate responses and outputs based on complex algorithms and machine learning techniques, and those responses or outputs may be inaccurate, harmful, biased, or indecent. By testing this model, you assume the risk of any harm caused by any response or output of the model. Please do not upload any confidential information or personal data unless expressly permitted. Your use is logged for security purposes.
  • 8
    Oracle Cloud Infrastructure
    Oracle Cloud Infrastructure supports traditional workloads and delivers modern cloud development tools. It is architected to detect and defend against modern threats, so you can innovate more. Combine low cost with high performance to lower your TCO. Oracle Cloud is a Generation 2 enterprise cloud that delivers powerful compute and networking performance and includes a comprehensive portfolio of infrastructure and platform cloud services. Built from the ground up to meet the needs of mission-critical applications, Oracle Cloud supports all legacy workloads while delivering modern cloud development tools, enabling enterprises to bring their past forward as they build their future. Our Generation 2 Cloud is the only one built to run Oracle Autonomous Database, the industry's first and only self-driving database. Oracle Cloud offers a comprehensive cloud computing portfolio, from application development and business analytics to data management, integration, security, AI & blockchain.
  • 9
    Yotta

    Yotta

    Yotta

    Alloy takes asset rich environments into the future with connected asset management. Alloy is the only connected asset management solution able to seamlessly connect people, systems and assets. It is scalable across multiple asset types and integrates with all systems through powerful API’s. Any of your physical and organisational structures and facilities you need for the operation of your enterprise. As data continues to grow, we can help you manage your data so you can make better, more informed decisions. We make sure your people have the information to make the right decisions at the right time. Alloy is a flexible, adaptable, fully mobile asset management platform that provides operational insights about your grounds management strategy.
  • 10
    CoreWeave

    CoreWeave

    CoreWeave

    CoreWeave is a cloud infrastructure provider specializing in GPU-based compute solutions tailored for AI workloads. The platform offers scalable, high-performance GPU clusters that optimize the training and inference of AI models, making it ideal for industries like machine learning, visual effects (VFX), and high-performance computing (HPC). CoreWeave provides flexible storage, networking, and managed services to support AI-driven businesses, with a focus on reliability, cost efficiency, and enterprise-grade security. The platform is used by AI labs, research organizations, and businesses to accelerate their AI innovations.
  • 11
    NVIDIA DGX Cloud
    NVIDIA DGX Cloud offers a fully managed, end-to-end AI platform that leverages the power of NVIDIA’s advanced hardware and cloud computing services. This platform allows businesses and organizations to scale AI workloads seamlessly, providing tools for machine learning, deep learning, and high-performance computing (HPC). DGX Cloud integrates seamlessly with leading cloud providers, delivering the performance and flexibility required to handle the most demanding AI applications. This service is ideal for businesses looking to enhance their AI capabilities without the need to manage physical infrastructure.
  • 12
    NVIDIA AI Foundations
    Impacting virtually every industry, generative AI unlocks a new frontier of opportunities, for knowledge and creative workers, to solve today’s most important challenges. NVIDIA is powering generative AI through an impressive suite of cloud services, pre-trained foundation models, as well as cutting-edge frameworks, optimized inference engines, and APIs to bring intelligence to your enterprise applications. NVIDIA AI Foundations is a set of cloud services that advance enterprise-level generative AI and enable customization across use cases in areas such as text (NVIDIA NeMo™), visual content (NVIDIA Picasso), and biology (NVIDIA BioNeMo™). Unleash the full potential with NeMo, Picasso, and BioNeMo cloud services, powered by NVIDIA DGX™ Cloud, the AI supercomputer. Marketing copy, storyline creation, and global translation in many languages. For news, email, meeting minutes, and information synthesis.
  • 13
    NVIDIA Cloud Functions
    NVIDIA Cloud Functions (NVCF) is a serverless API designed to deploy and manage AI workloads on GPUs, providing security, scalability, and reliability. It supports HTTP polling, HTTP streaming, and gRPC protocols for accessing workloads. Cloud Functions is primarily suited for shorter-running, preemptable workloads such as inferencing and fine-tuning. It supports two function types: "Container" and "Helm Chart", allowing users to define functions based on their specific requirements. Workloads are ephemeral and preemptable, so users should ensure they save their work appropriately. Models, containers, helm charts, and additional resources are hosted and pulled from the NGC Private Registry. To get started, users can follow the functions quickstart guide, which provides an end-to-end workflow for creating and deploying a container-based function using the fastapi_echo_sample container.
  • 14
    Llama

    Llama

    Meta

    Llama (Large Language Model Meta AI) is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI. Smaller, more performant models such as Llama enable others in the research community who don’t have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field. Training smaller foundation models like Llama is desirable in the large language model space because it requires far less computing power and resources to test new approaches, validate others’ work, and explore new use cases. Foundation models train on a large set of unlabeled data, which makes them ideal for fine-tuning for a variety of tasks. We are making Llama available at several sizes (7B, 13B, 33B, and 65B parameters) and also sharing a Llama model card that details how we built the model in keeping with our approach to Responsible AI practices.
  • Previous
  • You're on page 1
  • Next