62 projects for "combine" with 2 filters applied:

  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Pipecat

    Pipecat

    Framework for building real-time voice and multimodal AI agents

    ...Developers can create a wide range of interactive systems including voice assistants, customer service agents, interactive storytelling applications, and multimodal interfaces that combine voice, video, images, and text. Its modular architecture allows components to be composed into pipelines that process audio, text, and video streams in real time.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    mergekit

    mergekit

    Tools for merging pretrained large language models

    mergekit is an open-source toolkit designed to combine multiple pretrained language models into a single unified model through parameter merging techniques. The framework enables developers to merge model checkpoints so that the resulting model inherits capabilities from several source models without requiring additional training. This approach allows researchers to combine specialized models into a more versatile system capable of performing multiple tasks. mergekit implements a variety of merging algorithms and strategies that control how model parameters are blended together during the merging process. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    TurboQuant+

    TurboQuant+

    Implementation of TurboQuant (ICLR 2026)

    ...It builds upon the concept of reducing model precision to accelerate inference while attempting to maintain or recover accuracy through refined techniques. The project explores additional enhancements such as improved calibration, adaptive quantization, and potentially hybrid precision approaches that combine multiple levels of compression. It is designed to be used in conjunction with modern machine learning workflows, particularly those involving large models that require optimization for deployment. TurboQuant Plus focuses on experimentation and performance tuning, allowing developers to test different configurations and evaluate trade-offs. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Qwen3-TTS

    Qwen3-TTS

    Qwen3-TTS is an open-source series of TTS models

    ...Because it’s part of the broader Qwen ecosystem, it benefits from the model’s understanding of linguistic nuances, enabling more accurate pronunciation, prosody, and contextual delivery than many traditional TTS systems. Developers can customize voice output parameters like speed, pitch, and volume, and combine the TTS stack with other AI components.
    Downloads: 13 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    AI-Trader

    AI-Trader

    100% Fully-Automated Agent-Native Trading

    AI-Trader is an open-source AI-powered quantitative trading framework designed to combine financial analysis, machine learning, and autonomous trading workflows into a unified research platform. The project integrates large language models, financial indicators, market analysis pipelines, and automated decision-making systems to support strategy generation and market prediction tasks. It is built to help researchers and developers experiment with AI-assisted trading strategies using historical and real-time financial data. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Grok CLI

    Grok CLI

    An open-source AI agent that brings the power of Grok

    ...The CLI supports streaming responses, so outputs appear in real time as the Grok model generates them, making interactions feel responsive and fluid in terminal contexts. Grok CLI is designed to integrate with existing terminal habits—aliases, pipes, editors, and tooling—so you can combine AI assistance with native command-line workflows like grep, awk, and git. It also includes authentication support, configuration management, and caching options so frequent queries are efficient.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    HybridClaw

    HybridClaw

    The enterprise operating layer for open agents

    HybridClaw is an emerging open-source framework focused on enabling hybrid AI agent systems that combine local execution, tool integration, and multi-agent orchestration into a cohesive development environment. It is designed to work alongside modern agent ecosystems such as OpenClaw, Claude Code, and similar agentic coding tools, providing a flexible infrastructure for managing agent behaviors, workflows, and capabilities. The project emphasizes modularity, allowing developers to define and compose “skills” or capabilities that agents can invoke dynamically, enabling more adaptive and context-aware automation. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    HunyuanCustom

    HunyuanCustom

    Multimodal-Driven Architecture for Customized Video Generation

    HunyuanCustom is a multimodal video customization framework by Tencent Hunyuan, aimed at generating customized videos featuring particular subjects (people, characters) under flexible conditions, while maintaining subject/identity consistency. It supports conditioning via image, audio, video, and text, and can perform subject replacement in videos, generate avatars speaking given audio, or combine multiple subject images. The architecture builds on HunyuanVideo, with added modules for identity reinforcement and modality-specific condition injection. Text-image fusion module based on LLaVA for improved multimodal understanding. Applicable to single- and multi-subject scenarios, video editing/replacement, singing avatars etc.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    NVIDIA PhysicsNeMo

    NVIDIA PhysicsNeMo

    Open-source deep-learning framework for building and training

    ...The framework focuses on the emerging field of physics-informed machine learning, where neural networks are used alongside physical equations to model complex scientific systems. PhysicsNeMo provides modular Python components that allow developers to create scalable training and inference pipelines for models that combine data-driven learning with physics-based constraints. It is built on top of the PyTorch ecosystem and integrates with GPU-accelerated computing environments to handle computationally demanding simulations and datasets. The framework supports a wide range of scientific applications, including computational fluid dynamics, climate modeling, weather prediction, and engineering simulations.
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    TypedAI

    TypedAI

    TypeScript AI platform with AI chat, Autonomous agents

    ...Written in TypeScript, the platform emphasizes strong typing and structured development patterns to improve reliability when building AI-driven systems. TypedAI includes tools for building chat interfaces, managing LLM interactions, and orchestrating multi-step workflows that combine AI reasoning with external tools. The platform also includes specialized software engineering agents that can assist with tasks such as code reviews or repository analysis. Developers can integrate multiple model providers and tools into the platform to create flexible agent pipelines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Reader 3

    Reader 3

    Quick illustration of how one can easily read books together with LLMs

    This project is a minimalist, self-hosted EPUB reader designed to help users browse and read EPUB books one chapter at a time through a lightweight local server, making it especially easy to extract or work with chapters in external tools like large language models. It was created primarily as a simple demonstration of how to combine local book reading with LLM workflows without heavy dependencies or complicated setup, and it runs with just a small Python script and a basic HTTP server. The interface focuses on clarity and ease of use, offering straightforward navigation of book chapters rather than full-featured e-reading capabilities. While it lacks advanced features like built-in annotations or rich media support, its simplicity is intentional, enabling users to quickly load EPUBs, view them in a browser, and even repurpose text for downstream tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    LiveKit Agents

    LiveKit Agents

    Framework for building realtime multimodal voice AI agents apps

    ...These agents can join LiveKit rooms as participants and interact with users or systems through speech, text, and other modalities. LiveKit Agents provides libraries and tooling that allow developers to combine speech-to-text, large language models, and text-to-speech services to build interactive AI experiences. It is designed to run server-side and can integrate with various AI model providers and realtime APIs to support different application requirements. LiveKit Agents also includes tools for scheduling and managing agent tasks, making it easier to connect users to automated assistants in live communication scenarios.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    RuoYi AI

    RuoYi AI

    Enterprise AI platform for building, deploying, and managing apps

    RuoYi AI is a full-stack enterprise-oriented AI development platform designed to help developers rapidly build, deploy, and manage intelligent applications using modern large language models and AI ecosystems. It provides a unified framework for integrating multiple AI models from different providers, allowing teams to switch or combine models through a consistent interface without vendor lock-in. RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations to create secure, private knowledge bases with high-accuracy search and reasoning capabilities. It also offers visual workflow orchestration tools that allow users to design complex AI pipelines, automate tasks, and coordinate multi-agent systems for advanced decision-making scenarios. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Open Agents

    Open Agents

    An open source template for building cloud agents

    ...The project also includes examples and templates that demonstrate how to build and deploy agents for real-world applications. By prioritizing composability, it allows developers to combine simple components into more complex agent systems. Overall, open-agents serves as a playground for building and experimenting with next-generation AI agent architectures.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Youtu-Agent

    Youtu-Agent

    A simple yet powerful agent framework that delivers with models

    ...The framework supports automated generation of agent components, enabling the system to synthesize prompts, tool interfaces, and workflow configurations automatically. Youtu-Agent also incorporates hybrid learning strategies that combine experience accumulation with reinforcement learning to improve agent performance over time. These learning mechanisms allow agents to refine their reasoning, coding, and search capabilities as they interact with environments and tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Autonomous Agents

    Autonomous Agents

    Autonomous Agents (LLMs) research papers. Updated Daily

    ...One of the central concepts explored in the repository is the integration of different sensory modalities using advanced machine learning techniques such as Feature-wise Linear Modulation and graph-based attention mechanisms. These methods allow agents to combine visual and geometric information while maintaining awareness of the spatial relationships between agents and objects.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    One-Person Company

    One-Person Company

    One-Person Company AI Tools Series

    ...The repository includes workflows, strategies, and system designs that demonstrate how multiple AI agents can collaborate to run different aspects of a business in a coordinated manner. It emphasizes modularity, allowing users to combine different tools and services into a cohesive pipeline that automates end-to-end business processes. The project also highlights the importance of integrating APIs, automation tools, and AI models to create self-sustaining systems that can generate revenue with minimal oversight.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SuperPrompt

    SuperPrompt

    Experimental prompt framework exploring reasoning structures in AI

    SuperPrompt is an experimental open source project focused on designing complex prompts intended to help researchers and developers better understand how AI agents reason and respond. It explores structured prompt engineering techniques that combine symbolic expressions, logical constructs, and conceptual frameworks to guide large language models toward deeper reasoning processes. Its main concept revolves around a highly structured prompt format that includes tagged sections for reasoning, analysis, conceptual expansion, and recursive thinking patterns. These sections act as a kind of meta-instruction system intended to influence how an AI model approaches problem solving and conceptual exploration. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    LangChain

    LangChain

    ⚡ Building applications with LLMs through composability ⚡

    Large language models (LLMs) are emerging as a transformative technology, enabling developers to build applications that they previously could not. But using these LLMs in isolation is often not enough to create a truly powerful app - the real power comes when you can combine them with other sources of computation or knowledge. This library is aimed at assisting in the development of those types of applications.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    TaskingAI

    TaskingAI

    Open platform for building, deploying, and managing LLM agents

    ...It follows a Backend as a Service approach, allowing developers to separate AI logic from frontend product development while maintaining a structured and scalable workflow. TaskingAI integrates hundreds of language models from multiple providers into a unified system, enabling developers to switch models or combine capabilities without major reconfiguration. It includes a modular architecture that supports components such as assistants, tools, retrieval systems, and conversation management, all accessible through a consistent interface. TaskingAI also provides a built-in user interface for managing projects, testing workflows, and configuring AI agents without needing to rely entirely on code. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    MediaPipe Solutions

    MediaPipe Solutions

    Cross-platform, customizable ML solutions

    MediaPipe is an open-source framework developed by Google for building cross-platform machine learning pipelines that process audio, video, and other streaming data in real time. The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Embabel Agent Framework

    Embabel Agent Framework

    Agent framework for the JVM. Pronounced Em-BAY-bel

    Embabel Agent Framework is a JVM-based framework for building advanced AI agent systems that combine structured programming with large language model interactions to execute complex, goal-oriented workflows. The framework introduces a planning-driven approach where agents dynamically determine sequences of actions required to achieve objectives, rather than relying on fixed pipelines or predefined flows. It models agent behavior through concepts such as goals, actions, and conditions, allowing systems to adapt in real time based on changing inputs and outcomes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Swarms

    Swarms

    Enterprise multi-agent orchestration framework for scalable AI apps

    ...It emphasizes production readiness, offering modular architecture, high availability, and observability features suitable for large-scale deployments. It supports integration with multiple model providers and existing ecosystems, allowing developers to combine different AI tools and frameworks within a unified system. Swarms also includes mechanisms for agent lifecycle management, memory handling, and dynamic composition, making it adaptable to evolving workloads. Additionally, it focuses on developer productivity through APIs, CLI tools, and templates that simplify building and deploying agent-based applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TypeAgent Python

    TypeAgent Python

    Structured RAG: ingest, index, query

    ...Instead of relying solely on free-form prompts, the architecture emphasizes converting natural language interactions into structured representations that can be processed by deterministic software components. This design allows the system to combine the flexibility of language models with the reliability of traditional programming logic. The repository is intended primarily as a research prototype and sample code rather than a production-ready framework, allowing developers to experiment with building AI agents that maintain structured memory and perform tasks through defined actions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB