Showing 483 open source projects for "process"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    Skills For Real Engineers

    Skills For Real Engineers

    Skills for Real Engineers. Straight from my .claude directory

    ...Skills can be installed individually and integrated into agent environments, making them highly composable. Overall, the project transforms AI from a reactive assistant into a process-driven engineering collaborator.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Wanwu AI Agent Platform

    Wanwu AI Agent Platform

    Enterprise AI agent platform for workflows, models, and RAG apps

    ...It provides a multi-tenant environment that enables teams to create AI agents, orchestrate workflows, and implement retrieval-augmented generation systems within a unified framework. Wanwu integrates large language models with business process automation, allowing developers to design complex, production-ready AI solutions tailored to enterprise needs. It includes comprehensive model lifecycle management capabilities, enabling users to configure, monitor, and manage different models efficiently. Wanwu also supports knowledge base construction, allowing organizations to incorporate structured and unstructured data into their AI applications. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    openv0

    openv0

    AI tool for generating modular UI components with live previews

    ...It enables users to generate, iterate, and refine UI components with a live preview system, making it easier to visualize results during development. openv0 is built around a modular architecture, allowing different parts of the generation process to be extended or replaced through plugins. A key aspect of openv0 is its multipass pipeline, where each stage of component generation operates independently, enabling more complex and flexible transformations. It integrates with multiple frontend frameworks and UI libraries, leveraging existing open source ecosystems to build reusable assets. openv0 also supports various icon libraries and emphasizes extensibility for adding new tools, frameworks, and integrations. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    LLM Scraper

    LLM Scraper

    Extract structured data from webpages using LLM-powered scraping

    ...Developers can specify the data structure using tools such as Zod or JSON Schema, enabling the model to extract relevant information directly into typed objects. LLM Scraper integrates browser automation through Playwright, allowing it to load webpages and process their content before sending it to a language model for interpretation. Multiple content processing modes are supported, including raw HTML, cleaned HTML, Markdown, extracted text, screenshots, and custom inputs, making it adaptable to a wide range of scraping scenarios. LLM Scraper also provides streaming output and code generation capabilities that help developers build reusable scraping workflows.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    Pipecat

    Pipecat

    Framework for building real-time voice and multimodal AI agents

    ...Developers can create a wide range of interactive systems including voice assistants, customer service agents, interactive storytelling applications, and multimodal interfaces that combine voice, video, images, and text. Its modular architecture allows components to be composed into pipelines that process audio, text, and video streams in real time.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Netflix Maestro

    Netflix Maestro

    Netflix’s Workflow Orchestrator

    ...The system acts as a general-purpose workflow orchestrator that manages the execution, scheduling, monitoring, and recovery of large pipelines used for analytics and AI operations. It was designed to support the demanding internal infrastructure of Netflix, where thousands of workflows must process massive volumes of data reliably and efficiently every day. The platform enables engineers and data scientists to define workflows using structured configuration files and execute tasks across diverse compute environments, including scripts, containers, and notebook environments. Maestro provides built-in mechanisms for retry logic, task scheduling, dependency management, and error handling, which are essential when orchestrating production-scale pipelines.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    mistral.rs

    mistral.rs

    Fast, flexible LLM inference

    ...The project includes hardware-aware tooling that can benchmark a system and choose sensible quantization and device-mapping strategies, helping users get strong performance without manual tuning. It also supports serving multiple models from the same server process, enabling routing or quick switching between models depending on workload needs. For user-facing testing, mistral.rs can provide a built-in web UI, and it also offers a dedicated lightweight web chat interface that supports richer interaction patterns.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    DeepSeek VL2

    DeepSeek VL2

    Mixture-of-Experts Vision-Language Models for Advanced Multimodal

    ...or “Generate a caption appropriate to context”). The model supports both image understanding (vision tasks) and multimodal reasoning, and is likely used as a component in agent systems to process visual inputs as context for downstream tasks. The repository includes evaluation results (e.g. image/text alignment scores, common VL benchmarks), configuration files, and model weights (where permitted). While the internal architecture details are not fully documented publicly, the repo suggests that VL2 introduces enhancements over prior vision-language models (e.g. better scaling, cross-modal attention, more robust alignment) to improve grounding and multimodal understanding.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Postiz

    Postiz

    The ultimate social media scheduling tool, with a bunch of AI

    ...Easily manage multiple client accounts for increased productivity and better results. Schedule, analyze, and engage with your audience. Cross-post your social media posts into multiple channels. Improve your content creation process with an AI agent that performs all tasks for you. Use a Canva-like tool to create stunning visuals for your social media posts and generate pictures with AI. Manage your social media channels with ease. Collaborate with your team and delegate tasks. Expose your brand to a wider audience by connecting with influencers and brands. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    GoCV

    GoCV

    Go package for computer vision using OpenCV 4 and beyond

    ...Our mission is to make the Go language a “first-class” client compatible with the latest developments in the OpenCV ecosystem. Computer Vision (CV) is the ability of computers to process visual information, and perform tasks normally associated with those performed by humans. CV software typically processes video images, then uses the data to extract information in order to do something useful. Since memory allocations for images in GoCV are done through C based code, the go garbage collector will not clean all resources associated with a Mat. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Harmonist

    Harmonist

    Portable AI agent orchestration with mechanical protocol enforcement

    ...The project uses Python, has no runtime dependencies beyond the standard library, and is positioned as a drop-in agent coordination pack. Its purpose is to bring structure, review discipline, and repeatable process control to AI-assisted development.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    gstack

    gstack

    Use Garry Tan's exact Claude Code setup: 15 opinionated tools

    ...The system includes a set of curated tools that simulate roles like CEO, engineering manager, designer, and QA engineer, allowing developers to orchestrate complex development cycles more efficiently. It emphasizes structured thinking and process discipline, encouraging users to follow consistent workflows rather than ad hoc development practices. gstack integrates browsing, planning, reviewing, and shipping functionalities into a cohesive system, making it particularly useful for teams or individuals building products with AI assistance.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    LLM Vision

    LLM Vision

    Visual intelligence for your home.

    ...Instead of relying only on traditional object detection pipelines, it allows users to send prompts about visual content and receive contextual descriptions or answers about what is happening in camera footage. The system can process events from surveillance platforms such as Frigate and convert them into meaningful summaries, notifications, or structured data for automation workflows. It also maintains a timeline of analyzed camera events that can be displayed in dashboards or queried through the assistant interface.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Neuron AI

    Neuron AI

    The PHP Agentic Framework to build production-ready AI driven apps

    Neuron AI is a PHP agentic framework for building production-ready AI applications that connect models, memory, vector databases, and tools into working agents. It is designed for developers who want to create systems such as RAG pipelines, multi-agent workflows, and business process automations without having to hand-build every integration from scratch. The framework provides an Agent class that can be extended to inherit core capabilities like memory, tools, function calling, and retrieval-augmented generation. Its design is modular, so developers can swap model providers with minimal changes to their application code, which makes it practical for teams that need flexibility across vendors. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    OpenGame

    OpenGame

    Open Agentic Coding for Games

    OpenGame is an open-source project aimed at providing a flexible framework or toolkit for building games, likely focusing on accessibility, modularity, and collaborative development. It appears designed to simplify the process of creating interactive experiences by offering reusable components and structured architecture for game logic, rendering, and input handling. The project likely supports experimentation with different gameplay mechanics and encourages customization through open-source contributions. Its design philosophy emphasizes ease of use while still allowing developers to scale complexity as needed. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    MCP Golang

    MCP Golang

    Write Model Context Protocol servers in few lines of go code

    mcp-golang is an unofficial Go implementation of the Model Context Protocol (MCP), allowing developers to write MCP servers and clients with minimal code. It aims to simplify the development process by providing a straightforward API for integrating MCP functionalities into Go applications. Comprehensive documentation is available to assist developers in getting started. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    MiniMind

    MiniMind

    Train a 26M-parameter GPT from scratch in just 2h

    minimind is a framework that enables users to train a 26-million-parameter GPT (Generative Pre-trained Transformer) model from scratch in approximately two hours. It provides a streamlined process for data preparation, model training, and evaluation, making it accessible for individuals and organizations to develop their own language models without extensive computational resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AgentUniverse

    AgentUniverse

    agentUniverse is a LLM multi-agent framework

    AgentUniverse is a multi-agent AI framework that enables coordination between multiple intelligent agents for complex task execution and automation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    gensim

    gensim

    Topic Modelling for Humans

    Gensim is a Python library for topic modeling, document indexing, and similarity retrieval with large corpora. The target audience is the natural language processing (NLP) and information retrieval (IR) community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    Claude Video Vision is a plugin designed for Claude Code that enables large language models to process and understand video content by transforming it into multimodal inputs the model can reason over. Instead of attempting to directly interpret raw video streams, the system extracts key frames using tools like ffmpeg and processes audio through transcription engines, converting both visual and auditory signals into structured inputs for the model.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    WrenAI

    WrenAI

    Open-source SQL AI Agent for Text-to-SQL. Make Text2SQL Easy

    ...Wren AI has implemented a semantic engine architecture to provide the LLM context of your business; you can easily establish a logical presentation layer on your data schema that helps LLM learn more about your business context. With Wren AI, you can process metadata, schema, terminology, data relationships, and the logic behind calculations and aggregations with “Modeling Definition Language”, to generate accurate SQL queries with semantic context. When starting a new conversation in Wren AI, your question is used to find the most relevant tables. From these, LLM generates three relevant questions for the user to choose from. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Pedalboard

    Pedalboard

    A Python library for audio

    ...Internally at Spotify, pedalboard is used for data augmentation to improve machine learning models and to help power features like Spotify’s AI DJ and AI Voice Translation. pedalboard also helps in the process of content creation, making it possible to add effects to audio without using a Digital Audio Workstation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    ShortGPT

    ShortGPT

    AI framework for automated short video creation and editing tools

    ...It provides a structured system that handles multiple stages of the content creation workflow, including script generation, asset sourcing, voiceover synthesis, and video editing. ShortGPT uses large language models to generate scripts and prompts that guide the automated editing and production process. ShortGPT includes specialized content engines that manage different workflows, such as generating short videos, producing longer videos, and translating existing videos into other languages. It can automatically assemble videos by combining generated scripts, sourced media assets, captions, and synthesized voice narration. A modular editing system based on structured markup and JSON allows editing steps to be broken into manageable components that can be interpreted by language models.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    hls4ml

    hls4ml

    Machine learning on FPGAs using HLS

    ...This approach allows machine learning algorithms to run directly on specialized hardware, making them suitable for applications that require extremely fast response times and minimal power consumption. The framework was originally developed for high-energy physics experiments where real-time decision systems must process large volumes of data with strict latency constraints. Over time, it has expanded to support a variety of scientific and industrial applications including signal processing, embedded systems, and biomedical monitoring.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Hollama

    Hollama

    A minimal LLM chat app that runs entirely in your browser

    ...Because the application runs as a static web interface, it does not require complex backend infrastructure and can be easily deployed or self-hosted. Hollama supports both text-based and multimodal interactions, allowing users to work with models that process images as well as text. The interface includes features for editing prompts, retrying responses, copying generated code snippets, and storing conversation history locally within the browser. Mathematical expressions can be rendered using KaTeX, and Markdown formatting allows code blocks and structured outputs to appear clearly within conversations.
    Downloads: 2 This Week
    Last Update:
    See Project
Auth0 Logo