Showing 261 open source projects for "deploy"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Patroni

    Patroni

    A template for PostgreSQL high availability with Etcd, Consul, etc.

    Patroni is a template for you to create your own customized, high-availability solution using Python and - for maximum accessibility - a distributed configuration store like ZooKeeper, etcd, Consul or Kubernetes. Database engineers, DBAs, DevOps engineers, and SREs who are looking to quickly deploy HA PostgreSQL in the datacenter-or anywhere else-will hopefully find it useful. We call Patroni a "template" because it is far from being a one-size-fits-all or plug-and-play replication system. It will have its own caveats. Use wisely. Currently supported PostgreSQL versions 9.3 to 14. Patroni originated as a fork of Governor, the project from Compose. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Cognita

    Cognita

    Open source RAG framework for building scalable modular AI apps

    Cognita is an open source framework designed to help developers build, organize, and deploy Retrieval-Augmented Generation (RAG) applications in a structured and production-ready way. It addresses the gap between quick experimentation in notebooks and the complexity of deploying scalable AI systems by introducing a modular and API-driven architecture. Cognita provides reusable components such as parsers, data loaders, embedders, retrievers, and query controllers, allowing teams to customize each stage of the RAG pipeline independently. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    web2py

    web2py

    Free and open source full-stack enterprise framework

    ...Everything you need in one package including fast multi-threaded web server, SQL database and web-based interface. No third-party dependencies but works with third-party tools. Create, modify, deploy and manage applications from anywhere using your browser. One web2py instance can run multiple web sites using different databases. Try the interactive demo. Start with some quick examples, then read the manual and the Sphinx docs, watch videos, and join a user group for discussion. Take advantage of the layouts, plugins, appliances, and recipes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Mistral Inference

    Mistral Inference

    Official inference library for Mistral models

    Open and portable generative AI for devs and businesses. We release open-weight models for everyone to customize and deploy where they want it. Our super-efficient model Mistral Nemo is available under Apache 2.0, while Mistral Large 2 is available through both a free non-commercial license, and a commercial license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    TFX

    TFX

    TFX is an end-to-end platform for deploying production ML pipelines

    TensorFlow Extended (TFX) is a Google-production-scale machine learning platform based on TensorFlow. It provides a configuration framework to express ML pipelines consisting of TFX components. TFX pipelines can be orchestrated using Apache Airflow and Kubeflow Pipelines. Both the components themselves and the integrations with orchestration systems can be extended. TFX components interact with an ML Metadata backend that keeps a record of component runs, input and output artifacts, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CoPaw

    CoPaw

    Your Personal AI Assistant; easy to install, deploy on local or coud

    CoPaw is a personal AI assistant designed to run on your own machine or in the cloud, giving you full control over memory, models, and data. Built by the AgentScope team, it connects to multiple chat platforms—including DingTalk, Feishu, QQ, Discord, iMessage, and more—through a single unified assistant. CoPaw supports both cloud-based LLM providers and fully local models such as llama.cpp, MLX, and Ollama, allowing you to operate without API keys if preferred. It includes a browser-based...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Agent Framework

    Agent Framework

    Framework for building, orchestrating, and deploying AI agents

    Microsoft Agent Framework is an open source framework designed to help developers build, orchestrate, and deploy AI agents and multi-agent systems. It provides a unified programming model that supports both Python and .NET implementations, allowing developers to create agent-driven applications in multiple programming environments. It includes tools and abstractions for constructing simple conversational agents as well as complex workflows where multiple agents collaborate to complete tasks. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    FAY

    FAY

    Framework for building AI-powered interactive digital humans and agent

    Fay is an open source framework designed to build and deploy interactive digital humans powered by large language models. It acts as a middleware layer that connects digital character technologies with conversational AI systems and business applications. Fay supports various types of digital humans, including 2.5D and 3D avatars, and can be integrated with applications running on mobile devices, PCs, web platforms, and embedded systems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    LitGPT

    LitGPT

    20+ high-performance LLMs with recipes to pretrain, finetune at scale

    LitGPT is a collection of over 20 high-performance large language models (LLMs) accompanied by recipes to pretrain, finetune, and deploy them at scale. It provides implementations without abstractions, making it beginner-friendly while offering advanced features like flash attention and support for various precision levels. LitGPT is designed to run efficiently across multiple GPUs or TPUs, catering to both small-scale and large-scale deployments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Prompt flow

    Prompt flow

    Build high-quality LLM apps

    Prompt flow is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, and evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Wanwu AI Agent Platform

    Wanwu AI Agent Platform

    Enterprise AI agent platform for workflows, models, and RAG apps

    Wanwu is an enterprise-grade AI agent development platform designed to help organizations build and deploy intelligent applications at scale. It provides a multi-tenant environment that enables teams to create AI agents, orchestrate workflows, and implement retrieval-augmented generation systems within a unified framework. Wanwu integrates large language models with business process automation, allowing developers to design complex, production-ready AI solutions tailored to enterprise needs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    GPUStack

    GPUStack

    Performance-optimized AI inference on your GPUs

    ...The platform supports GPUs from a wide range of vendors and can run on laptops, workstations, and servers across operating systems such as macOS, Windows, and Linux. It also enables developers to deploy models from common repositories like Hugging Face and access them through APIs similar to cloud-based AI services.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    TrendRadar

    TrendRadar

    AI-driven public opinion trend monitor with multi-platform aggregation

    ...The tool supports multiple alert modes—such as daily summaries, incremental change monitoring, and current rankings—and can push notifications through messaging platforms like Telegram, Slack, WeChat, DingTalk, and email. Users can deploy it quickly via Python and GitHub Actions, and it also supports RSS feeds and Docker deployment for flexible integration. By combining AI judgment with broad data ingestion, TrendRadar empowers individuals and teams to track public opinion, news cycles, and emerging topics without manual scanning of dozens of websites.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Writer Framework

    Writer Framework

    No-code in the front, Python in the back. An open-source framework

    ...Its architecture enforces a clear separation of concerns between frontend and backend, which improves maintainability and scalability as applications grow in complexity. The system is designed to support rapid prototyping, enabling developers to iterate on UI and backend logic independently and deploy changes quickly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    TensorRT LLM

    TensorRT LLM

    TensorRT LLM provides users with an easy-to-use Python API

    TensorRT-LLM is an open-source high-performance inference library specifically designed to optimize and accelerate large language model deployment on NVIDIA GPUs. It provides a Python-based API built on top of PyTorch that allows developers to define, customize, and deploy LLMs efficiently across a variety of hardware configurations, from single GPUs to large multi-node clusters. The library focuses on maximizing throughput and minimizing latency through advanced techniques such as quantization, custom attention kernels, and optimized memory management strategies. It includes support for cutting-edge inference methods like speculative decoding and inflight batching, enabling real-time and large-scale AI applications. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Google Kubernetes Engine (GKE) Samples

    Google Kubernetes Engine (GKE) Samples

    Sample applications for Google Kubernetes Engine (GKE)

    Google Kubernetes Engine (GKE) Samples repository is a comprehensive collection of sample applications and reference implementations designed to demonstrate how to build, deploy, and manage workloads on Google Kubernetes Engine (GKE). It serves as a practical companion to official GKE tutorials, providing real, runnable code that illustrates how containerized applications are packaged, deployed, and scaled within Kubernetes clusters. The repository is organized into multiple categories such as AI and machine learning, autoscaling, networking, observability, security, and cost optimization, allowing developers to explore specific use cases and architectural patterns. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    A.I.G

    A.I.G

    Full-stack AI Red Teaming platform

    ...It brings together AI infrastructure vulnerability scanning, MCP server risk analysis, and jailbreak evaluation into a unified workflow so that enterprises and individuals can identify critical security issues without relying on external services. Users can deploy it via Docker or scripts to get a modern web UI that guides them through tasks like scanning third-party frameworks for known CVEs and experimenting with prompt security against attack vectors. The tool provides both a visual interface and a comprehensive API, making integration with internal security systems or CI/CD pipelines practical for ongoing risk management.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Weblate

    Weblate

    Web based localization tool with tight version control integration

    Weblate is a copylefted libre software web-based continuous localization system, used by over 2500 libre projects and companies in more than 165 countries. Copylefted libre software, used by over 2,500 libre software projects and companies in over 165 countries. Hosted service and standalone tool with tight version control integration. Simple and clean user interface, propagation of translations across components, quality checks and automatic linking to source files. There is infrastructure...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Triton Inference Server

    Triton Inference Server

    The Triton Inference Server provides an optimized cloud

    Triton Inference Server is an open-source inference serving software that streamlines AI inferencing. Triton enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more. Triton supports inference across cloud, data center, edge, and embedded devices on NVIDIA GPUs, x86 and ARM CPU, or AWS Inferentia. Triton delivers optimized performance for many query types, including real-time, batched, ensembles, and audio/video streaming. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    DeepPavlov

    DeepPavlov

    A library for deep learning end-to-end dialog systems and chatbots

    DeepPavlov makes it easy for beginners and experts to create dialogue systems. The best place to start is with user-friendly tutorials. They provide quick and convenient introduction on how to use DeepPavlov with complete, end-to-end examples. No installation needed. Guides explain the concepts and components of DeepPavlov. Follow step-by-step instructions to install, configure and extend DeepPavlov framework for your use case. DeepPavlov is an open-source framework for chatbots and virtual...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    agentic-stack

    agentic-stack

    One brain, many harnesses. Portable .agent/ folder

    agentic-stack is a framework or toolkit designed to build, orchestrate, and deploy AI agents in a structured and scalable way. It likely provides components for managing agent workflows, communication, and task execution across different systems. The project emphasizes modularity, enabling developers to assemble custom pipelines using various AI models, tools, and APIs. It may include abstractions for memory, planning, and tool usage, reflecting modern agentic AI design patterns. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    nbdev

    nbdev

    Create delightful software with Jupyter Notebooks

    nbdev is a notebook-driven development platform (by fast.ai/AnswerDotAI) enabling you to write code, tests, documentation, and deploy software, all from Jupyter Notebooks. It provides a unified literate programming workflow where you can tag notebook cells for export to Python modules, auto-generate documentation via Quarto (and host it on GitHub Pages), run tests embedded in notebooks, manage clean notebooks with Git-friendly metadata hooks, and seamlessly publish packages to PyPI/conda, all while keeping source and documentation in sync.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    smolagents

    smolagents

    Agents write python code to call tools and orchestrate other agents

    ...We provide our definition in this page, where you’ll also find tips for when to use them or not (spoilers: you’ll often be better off without agents). smolagents is a lightweight framework for building AI agents using large language models (LLMs). It simplifies the development of AI-driven applications by providing tools to create, train, and deploy language model-based agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    stt

    stt

    Voice Recognition to Text Tool

    ...It leverages open-source speech models such as Faster-Whisper to recognize and transcribe human speech into plain text, structured JSON objects, or subtitle files with time codes, making it suitable for both personal and professional transcription tasks. The project is designed to be easy to deploy: you can run a local Python server that exposes an HTTP API for uploading audio/video files and retrieving transcriptions in different formats. It supports GPU acceleration if available, enabling faster processing on compatible hardware but still offers reliable performance on CPUs alone.
    Downloads: 0 This Week
    Last Update:
    See Project