Showing 852 open source projects for "ai framework"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Vision Agents

    Vision Agents

    Open Vision Agents by Stream. Build voice and vision agents quickly

    Vision Agents is an open-source Python framework for building real-time voice and video AI agents. It is designed for applications that need to watch, listen, understand, and respond with very low latency. The framework can combine vision models, speech models, LLMs, and real-time transport providers into one agent workflow. It supports use cases such as live coaching, telehealth, customer support, security monitoring, interactive video assistants, and voice-controlled tools. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Jina-Serve

    Jina-Serve

    Build multimodal AI applications with cloud-native stack

    Jina Serve is an open-source framework designed for building, deploying, and scaling AI services and machine learning pipelines in production environments. The framework allows developers to create microservices that expose machine learning models through APIs that communicate using protocols such as HTTP, gRPC, and WebSockets. It is built with a cloud-native architecture that supports deployment on local machines, containerized environments, or large orchestration platforms such as Kubernetes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Quint Code

    Quint Code

    Structured reasoning framework for Claude Code, Gemini, and Cursor

    Quint Code is a structured reasoning and decision-support framework aimed at making AI-assisted software engineering and decision workflows more rigorous and auditable. It implements the First Principles Framework (FPF) to guide users and AI tools through hypothesis generation, logical verification, evidence gathering, and documented decision making, reducing reliance on ad hoc or “vibe” coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Hunyuan3D-1

    Hunyuan3D-1

    A Unified Framework for Text-to-3D and Image-to-3D Generation

    Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements. (Note: less detailed public documentation was found for Hunyuan3D-1 compared to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Agent Orchestrator

    Agent Orchestrator

    Agentic orchestrator for parallel coding agents

    ...It supports a wide range of agents, runtimes, and tools, making it flexible and framework-agnostic. Through a centralized dashboard, developers can monitor progress, review outputs, and intervene only when human judgment is required. Agent Orchestrator transforms AI-assisted development into a scalable, autonomous system for continuous code improvement.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Nanocoder

    Nanocoder

    A beautiful local-first coding agent running in your terminal

    Nanocoder is an open-source, local-first coding assistant that runs in the command line and allows developers to use AI models to assist with programming tasks directly from their terminal environment. The tool is designed as a privacy-focused alternative to proprietary AI coding assistants, allowing users to run local models or connect to external APIs while keeping full control over their data and development workflow. Built with TypeScript and distributed as a CLI application, nanocoder...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    LTX-Video

    LTX-Video

    Official repository for LTX-Video

    LTX-Video is a sophisticated multimedia processing framework from Lightricks designed to handle high-quality video editing, compositing, and transformation tasks with performance and scalability. It provides runtime components that efficiently decode, encode, and manipulate video streams, frame buffers, and audio tracks while exposing a rich API for building customized editing features like transitions, effects, color grading, and keyframe automation. The toolkit is built with both real-time...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    yourself-skill

    yourself-skill

    Instead of distilling others, it is better to distil yourself

    yourself-skill is an AI skill framework focused on self-reflection and personalization, enabling agents to adapt their behavior based on user context and interaction history. It encourages systems to maintain awareness of user preferences, goals, and communication styles. The project emphasizes building more human-aligned interactions by incorporating memory and contextual reasoning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ComfyUI-LTXVideo

    ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers...
    Downloads: 16 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    rLLM

    rLLM

    Democratizing Reinforcement Learning for LLMs

    rLLM is an open-source framework for building and training post-training language agents via reinforcement learning — that is, using reinforcement signals to fine-tune or adapt language models (LLMs) into customizable agents for real-world tasks. With rLLM, developers can define custom “agents” and “environments,” and then train those agents via reinforcement learning workflows, possibly surpassing what vanilla fine-tuning or supervised learning might provide. The project is designed to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Eino

    Eino

    LLM application development framework for Go with agents and flows

    Eino is an LLM application development framework written in Go that helps developers build applications powered by large language models. Eino provides a structured environment for creating AI systems using reusable components such as chat models, retrievers, tools, embeddings, and prompt templates. It draws architectural inspiration from frameworks like LangChain and other modern AI development toolkits while remaining aligned with Go programming conventions. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Mercury Agent

    Mercury Agent

    Soul-driven AI agent with permission-hardened tools, token budgets

    Mercury Agent is a framework for building autonomous AI agents capable of executing complex workflows with minimal human intervention. It focuses on orchestrating tasks across multiple tools and services, enabling agents to perform end-to-end operations. The system includes mechanisms for planning, execution, and feedback, allowing agents to refine their actions iteratively.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    CogVideo

    CogVideo

    Text and image to video generation: CogVideoX and CogVideo

    ...The project includes tools for inference, fine-tuning, and optimization, making it suitable for both research and production use. It supports efficient deployment on a range of GPUs, including consumer hardware with quantization techniques. Overall, CogVideo provides a powerful framework for generating high-quality AI videos and experimenting with cutting-edge multimodal AI systems.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    MONAI

    MONAI

    AI Toolkit for Healthcare Imaging

    The MONAI framework is the open-source foundation being created by Project MONAI. MONAI is a freely available, community-supported, PyTorch-based framework for deep learning in healthcare imaging. It provides domain-optimized foundational capabilities for developing healthcare imaging training workflows in a native PyTorch paradigm. Project MONAI also includes MONAI Label, an intelligent open source image labeling and learning tool that helps researchers and clinicians collaborate, create annotated datasets, and build AI models in a standardized MONAI paradigm. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Motia

    Motia

    Multi-Language Backend Framework that unifies APIs

    Motia is a large-scale, multi-language backend framework designed to unify and simplify core backend concerns — from APIs and background jobs to distributed workflows, streams, and advanced AI agent integration — through a single coherent primitive with built-in observability and state management. It provides a unified development philosophy that seamlessly bridges different backend patterns (queues, event streams, scheduled jobs) while offering integrated tools for telemetry, logging, metrics, and tracing to help teams maintain high visibility into production workloads. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    AgentField

    AgentField

    Build and run AI agents like microservices

    AgentField is an open-source control plane designed to run AI agents as production-grade backend services, applying cloud-native principles similar to Kubernetes to the world of autonomous software. Instead of treating agents as isolated scripts or prototypes, the system elevates them to first-class infrastructure components that can be deployed, orchestrated, and managed at scale across distributed environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Wan2.2

    Wan2.2

    Wan2.2: Open and Advanced Large-Scale Video Generative Model

    Wan2.2 is a major upgrade to the Wan series of open and advanced large-scale video generative models, incorporating cutting-edge innovations to boost video generation quality and efficiency. It introduces a Mixture-of-Experts (MoE) architecture that splits the denoising process across specialized expert models, increasing total model capacity without raising computational costs. Wan2.2 integrates meticulously curated cinematic aesthetic data, enabling precise control over lighting,...
    Downloads: 99 This Week
    Last Update:
    See Project
  • 18
    InfiAgent

    InfiAgent

    Build your own Cowork, AI Scientist and other SoTA Agents

    infiAgent is an open-source AI agent framework for building powerful, long-running autonomous agents capable of tackling complex tasks without collapsing under growing context or tool invocation histories. Designed as a “Multi-Level Agent” (MLA) system, it externalizes persistent state to the file system so that agents can operate over unlimited runtime without the need for token-intensive context compression, enabling workflows such as research paper drafting, experiments, coding, and document generation to run reliably. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Superduper

    Superduper

    Superduper: Integrate AI models and machine learning workflows

    Superduper is a Python-based framework for building end-2-end AI-data workflows and applications on your own data, integrating with major databases. It supports the latest technologies and techniques, including LLMs, vector-search, RAG, and multimodality as well as classical AI and ML paradigms. Developers may leverage Superduper by building compositional and declarative objects that out-source the details of deployment, orchestration versioning, and more to the Superduper engine. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pixeltable

    Pixeltable

    Data Infrastructure providing an approach to multimodal AI workloads

    Pixeltable is an open-source Python data infrastructure framework designed to support the development of multimodal AI applications. The system provides a declarative interface for managing the entire lifecycle of AI data pipelines, including storage, transformation, indexing, retrieval, and orchestration of datasets. Unlike traditional architectures that require multiple tools such as databases, vector stores, and workflow orchestrators, Pixeltable unifies these functions within a table-based abstraction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Agent SOP

    Agent SOP

    Natural language workflows for AI agents

    Agent SOP is a framework that implements structured operational procedures (SOPs) for autonomous agents so that they can carry out complex multi-step tasks reliably and in a defined order. Instead of relying solely on broad language model reasoning, this project enforces explicit step sequences with checkpoints, conditional transitions, and rollback logic, making agent workflows more predictable and auditable. It defines reusable SOP templates that agents can instantiate with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    aie-book

    aie-book

    Resources for AI engineers

    ...It explores how to work with large language models, retrieval systems, and agent-based architectures, providing a practical perspective on how AI is actually used in industry. The repository serves as both a learning resource and a reference, combining explanations, examples, and structured insights into a cohesive framework. It is especially useful for engineers transitioning into AI-focused roles who need to understand the full lifecycle of AI applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    K8M

    K8M

    Mini Kubernetes AI Dashboard

    An AI-driven Mini Kubernetes Dashboard designed to simplify cluster management, offering a lightweight console tool with integrated large language model capabilities for enhanced operational efficiency. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    HyperAgent

    HyperAgent

    AI Browser Automation

    HyperAgent is an open-source browser automation framework that combines large language models with modern browser scripting tools to create intelligent web automation agents. Built on top of Playwright, the framework allows developers to automate complex browser interactions using natural language commands rather than fragile selectors or hard-coded scripts. Instead of manually writing logic for clicking elements, extracting data, or navigating web pages, developers can instruct the agent in plain language and allow the AI layer to interpret and execute the task. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    ET

    ET

    Unity3D Client And C# Server Framework

    ET is a game development framework combining a Unity3D client and a C# server architecture, intended to streamline common game modules, facilitate distributed server setups, and support AI-assisted development workflows. It emphasizes modular component design, networking, and automated tooling (e.g. test case generation, code scaffolding) to lighten the burden of writing game logic and server infrastructure.
    Downloads: 1 This Week
    Last Update:
    See Project
Auth0 Logo