Showing 8 open source projects for "gemini"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    Portia SDK Python

    Portia SDK Python

    Portia Labs Python SDK for building agentic workflows

    ...It supports tool-backed agents capable of real-world interactions—like web browsing, API access, and human-in-the-loop clarifications—while maintaining transparency and auditability through structured plans and execution hooks. Designed for production environments, the SDK integrates with local or cloud LLMs (e.g. OpenAI, Anthropic, Mistral, Gemini) and supports extensive tool registries, session handling, and memory management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Self-Operating Computer

    Self-Operating Computer

    A framework to enable multimodal models to operate a computer

    The Self-Operating Computer Framework is an innovative system that enables multimodal models to autonomously operate a computer by interpreting the screen and executing mouse and keyboard actions to achieve specified objectives. This framework is compatible with various multimodal models and currently integrates with GPT-4o, o1, Gemini Pro Vision, Claude 3, and LLaVa. Notably, it was the first known project to implement a multimodal model capable of viewing and controlling a computer screen. The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Agent Stack

    Agent Stack

    Deploy and share agents with open infrastructure

    ...The platform supports agents built in frameworks like LangChain, CrewAI, etc., enabling them to be hosted, managed and shared through a unified interface. It also offers multi-model, multi-provider support (OpenAI, Anthropic, Gemini, IBM WatsonX, Ollama etc.), letting users compare performance and cost across models. For developers and organizations building AI-agent products or automations, Agent Stack gives a scaffold that handles the “plumbing”, so they can focus on logic and domain.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Agent S

    Agent S

    Agent S: an open agentic framework that uses computers like a human

    ...Agent S combines powerful foundation models (such as GPT-5) with grounding models like UI-TARS to translate visual inputs into precise executable actions. It supports flexible deployment via CLI, SDK, or cloud, and integrates with multiple model providers including OpenAI, Anthropic, Gemini, Azure, and Hugging Face endpoints. With optional local code execution, reflection mechanisms, and compositional planning, Agent S provides a scalable and research-driven framework for building advanced computer-use agents.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 5
    RAGapp

    RAGapp

    The easiest way to use Agentic RAG in any enterprise

    The easiest way to use Agentic RAG in any enterprise. As simple to configure as OpenAI's custom GPTs, but deployable in your own cloud infrastructure using Docker. Built using LlamaIndex. Just the RAGapp container doesn't come with any authentication layer by design. This is the task of an API Gateway routing the traffic to RAGapp. This step heavily depends on your cloud provider and the services you use. For a pure Docker Compose environment, you can look at our RAGapp with management UI...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Mastra

    Mastra

    The TypeScript AI agent framework

    ...At its core, Mastra provides abstractions for agents, workflows, tools, memory, retrieval, and model routing, so developers can focus on specifying behavior rather than wiring infrastructure from scratch. Model routing lets you connect to dozens of providers (OpenAI, Anthropic, Gemini, and others) through a single standardized interface, while agents orchestrate LLM calls and tools to solve open-ended tasks with internal reasoning loops. When explicit control is needed, Mastra’s workflow engine uses a graph-style API (.then(), .branch(), .parallel()) to orchestrate multi-step processes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Browser Use

    Browser Use

    Make websites accessible for AI agents

    ...It enables developers and AI systems to perform complex online tasks such as form filling, data extraction, and navigation through natural language instructions. Built with Python and compatible with modern LLMs, it integrates seamlessly with tools like ChatBrowserUse, Google Gemini, and Anthropic models. The platform supports both open-source deployment and a fully hosted cloud version for enhanced scalability and performance. Its cloud offering includes advanced capabilities like stealth browsing, CAPTCHA solving, and proxy rotation for reliable automation. Overall, Browser Use transforms web interaction into an intelligent, programmable workflow driven by AI agents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PC-Gui

    PC-Gui

    Lightweight PC-Gui framework for AI, typewriter stream Gemini-like

    PC-GUI: A lightweight desktop GUI framework for AI, natively supporting live typewriter-style streaming output like Gemini! 🎉 💡 Core philosophy: Rapid development · Minimal footprint · Native performance. We empower you to build premium desktop tools that users are willing to pay for. PC-GUI helps you meet strong market demands by building compact, powerful, commercial-grade applications with a simple and stable tech stack. We adopt a "backend-first approach" to desktop development: a stable Go backend (net/http) powers a standard web frontend (HTML/CSS/JS), coupled with encrypted SQLite storage for an extremely lightweight and high-performance design. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB