Showing 483 open source projects for "process"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    GPU Hot

    GPU Hot

    Real-time NVIDIA GPU dashboard

    GPU Hot is an open-source, lightweight monitoring dashboard designed to provide real-time visibility into NVIDIA GPU performance across single machines or entire clusters. The project offers a self-hosted web interface that streams hardware metrics directly from GPU servers, enabling developers, ML engineers, and system administrators to observe GPU utilization and system behavior in real time through a browser. The dashboard collects and displays a wide range of performance metrics...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    ...The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on context. This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    CowAgent

    CowAgent

    AI assistant based on large models that can actively think and plan

    CowAgent, based on the chatgpt-on-wechat project, is an open-source AI agent framework that integrates large language models into the WeChat ecosystem to create intelligent conversational assistants. It enables automated message handling by connecting WeChat accounts with AI models that can generate contextual replies, process voice messages, and produce images directly inside chats. The platform has evolved beyond a simple chatbot into a more autonomous agent capable of planning complex tasks, maintaining long-term memory, and invoking external tools to complete workflows. It supports multi-turn conversations with per-user context tracking, allowing more natural and persistent interactions across private and group chats. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Excalibur

    Excalibur

    Excalibur is a highly opinionated agent harness

    Excalibur is an experimental or utility-oriented project that appears to focus on enabling structured execution, control, or enhancement of workflows within AI or development environments. The system likely provides tools for managing tasks, orchestrating processes, or enhancing decision-making capabilities in automated systems. Its design suggests an emphasis on control and precision, allowing users to define how tasks are executed and monitored. It may include abstractions for handling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    NemoClaw

    NemoClaw

    NVIDIA plugin for secure installation of OpenClaw

    NVIDIA NemoClaw is an open-source tool designed to simplify the deployment and management of always-on AI assistants using the OpenClaw ecosystem. It installs and configures the NVIDIA OpenShell runtime, which provides a secure environment for running autonomous AI agents. NemoClaw enables users to launch sandboxed agent environments that control network access, file permissions, and inference requests through policy-based security. The platform integrates with AI models such as NVIDIA...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Matcha-TTS

    Matcha-TTS

    A fast TTS architecture with conditional flow matching

    Matcha-TTS is a non-autoregressive neural text-to-speech architecture that uses conditional flow matching to generate speech quickly while maintaining natural quality. It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches. The model is fully probabilistic, so it can generate diverse realizations of the same text while still sounding stable and intelligible. The repository provides an end-to-end TTS pipeline: a PyTorch/Lightning training stack, configuration files, pre-trained checkpoints, a command-line interface, and a Gradio app for interactive testing. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Qwen-Agent

    Qwen-Agent

    Agent framework and applications built upon Qwen>=3.0

    Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    ChatterBot

    ChatterBot

    Machine learning, conversational dialog engine for creating chat bots

    ...This makes it easy for developers to create chat bots and automate conversations with users. For more details about the ideas and concepts behind ChatterBot see the process flow diagram. The language independent design of ChatterBot allows it to be trained to speak any language. Additionally, the machine-learning nature of ChatterBot allows an agent instance to improve it’s own knowledge of possible responses as it interacts with humans and other sources of informative data. An untrained instance of ChatterBot starts off with no knowledge of how to communicate. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Ditto

    Ditto

    The simplest self-building coding agent

    ...Users describe the app they want, and the system attempts to plan and create routes, templates, static assets, and supporting files. It uses an LLM loop with basic tools to automate part of the coding process. The project is intentionally lightweight and experimental, making it easier to understand than larger agentic coding platforms. Its modular structure separates generated Flask components into cleaner directories for routes, templates, and static files. It is best suited for prototyping, learning, and exploring how natural-language app generation can work in a small local project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Train LLM From Scratch

    Train LLM From Scratch

    A straightforward method for training your LLM

    ...It is based on the architecture described in Attention Is All You Need and is designed to make the training pipeline understandable rather than hidden behind a large framework. The repository walks through the process from downloading data to generating text with a trained model. It supports training smaller or larger models, including million- and billion-parameter configurations depending on available hardware. A major goal is accessibility, since the author frames it as possible to train models using a single GPU. It is most useful for learners, researchers, and developers who want practical exposure to LLM internals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NVIDIA AI Blueprint

    NVIDIA AI Blueprint

    Suite of reference architectures for building GPU-accelerated vision

    NVIDIA AI Blueprint is an AI blueprint for building GPU-accelerated video intelligence applications and vision agents. It combines accelerated vision microservices, vision language models, large language models, embeddings, and NVIDIA NIM microservices to process both stored and streaming video. The project is organized around real-time video intelligence, downstream analytics, and agentic offline processing. It supports workflows such as natural-language video search, visual question answering, long-video summarization, clip retrieval, verified alerts, and incident analysis. It is designed for technical users who need deployable reference architectures for smart spaces, warehouse automation, SOP validation, monitoring, and operational video analytics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Alpamayo 1

    Alpamayo 1

    Bridging Reasoning and Action Prediction

    ...The model is designed as a foundational component rather than a complete driving stack, allowing developers to build custom autonomous vehicle applications on top of it. It incorporates vision-language-action modeling, enabling it to process sensor data and contextual information simultaneously. Alpamayo supports tasks such as trajectory prediction, auto-labeling, and reasoning-based decision making. The system is optimized for high-performance GPU environments and is intended primarily for experimentation and benchmarking. Overall, it represents an advanced step toward integrating reasoning into autonomous driving pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    GEO Content Writer

    GEO Content Writer

    Backlog-row-first content production system for teams

    ...The tool is particularly useful for businesses targeting local markets or region-specific audiences. It integrates into broader SEO pipelines, allowing content generation to be part of a continuous optimization process. Overall, GEO Content Writer enables scalable, AI-driven content creation tailored for modern search ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MedgeClaw

    MedgeClaw

    Open-source AI research assistant for biomedicine

    ...The system connects conversational interfaces with computational environments, allowing users to initiate research tasks through messaging platforms while the backend executes analyses using tools like R and Python. It includes a real-time dashboard that displays progress, generated code, and outputs, providing transparency throughout the research process. MedgeClaw also supports reproducibility by generating structured reports and maintaining consistent environments through containerization. Its architecture combines conversational AI, automated pipelines, and scientific tooling into a unified workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    vibecode-cli

    vibecode-cli

    The official vibecode.dev CLI built for agents

    ...It supports a wide variety of programming languages and file types, enabling developers to work on diverse projects within a unified interface. Vibecode CLI also handles process execution, logging, and threading, ensuring smooth operation even for more complex tasks. Its design emphasizes minimal setup and ease of use, allowing developers to quickly integrate it into their workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Biomni

    Biomni

    Biomni: a general-purpose biomedical AI agent

    Biomni is a general-purpose biomedical AI agent designed to autonomously perform complex research tasks across a wide range of scientific domains, combining language model reasoning with structured planning and execution. It integrates retrieval-augmented generation with code-based execution, allowing it to access external knowledge, process data, and generate testable hypotheses in scientific workflows. The system is built to support researchers by automating repetitive and time-consuming tasks such as literature review, data analysis, and experimental design. Biomni operates within a comprehensive environment that includes tools, APIs, and datasets, enabling it to execute multi-step research processes rather than just generating text responses. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Magic Resume

    Magic Resume

    free online AI resume editor

    Magic Resume is a modern, open-source AI-powered resume builder designed to simplify the process of creating professional resumes through an interactive, visually rich web interface. Built with modern frontend technologies such as TanStack Start, TypeScript, and Tailwind CSS, it provides a smooth and responsive user experience enhanced by animation frameworks that make editing intuitive and engaging. The platform offers real-time preview capabilities, allowing users to instantly visualize changes as they build their resume, which significantly improves usability and iteration speed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TNT

    TNT

    A lightweight library for PyTorch training tools and utilities

    TNT is a lightweight training framework developed by Meta that simplifies the process of building and managing machine learning training loops using PyTorch. The project focuses on providing a flexible yet structured environment for implementing training pipelines without the complexity of large deep learning frameworks. It introduces modular abstractions that allow developers to organize training logic into reusable components such as trainers, evaluators, and callbacks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    All-in-RAG

    All-in-RAG

    Big Model Application Development Practice 1

    ...Alongside theoretical explanations, the repository includes hands-on exercises and example projects that demonstrate how to build production-ready RAG systems. These projects guide developers through the process of integrating vector databases, embedding models, and large language models into a unified application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TruLens

    TruLens

    Evaluation and Tracking for LLM Experiments

    TruLens is an open-source Python library designed to systematically evaluate and track Large Language Model (LLM) applications. It provides fine-grained instrumentation, feedback functions, and a user interface to compare and iterate on app versions, facilitating rapid development and improvement of LLM-based applications. Programmatic tools that assess the quality of inputs, outputs, and intermediate results from LLM applications, enabling scalable evaluation. Fine-grained, stack-agnostic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    VisualDL

    VisualDL

    Deep Learning Visualization Toolkit

    VisualDL, a visualization analysis tool of PaddlePaddle, provides a variety of charts to show the trends of parameters and visualizes model structures, data samples, histograms of tensors, PR curves , ROC curves and high-dimensional data distributions. It enables users to understand the training process and the model structure more clearly and intuitively so as to optimize models efficiently. VisualDL provides various visualization functions, including tracking metrics in real-time, visualizing the model structure, displaying the data sample, visualizing the relationship between hyperparameters and model metrics, presenting the changes of distributions of tensors, showing the pr curves, projecting high-dimensional data to a lower dimensional space and more. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Step-Video-T2V

    Step-Video-T2V

    State-of-the-art (SoTA) text-to-video pre-trained model

    Step-Video-T2V is a state-of-the-art text-to-video foundation model developed to generate videos from natural-language prompts; its 30B-parameter architecture is designed to produce coherent, temporally extended video sequences — up to around 204 frames — based on input text. Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    HY-World 2.0

    HY-World 2.0

    A Multi-Modal World Model for Reconstructing, Generating, Simulation

    ...The system also improves reconstruction from multi-view images and video by upgrading its feed-forward 3D prediction components and its memory-aware view generation process. Another major part of the project is WorldLens, a rendering platform designed for interactive exploration with an engine-agnostic architecture, automatic image-based lighting, collision detection, and support for character interaction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Open SWE

    Open SWE

    Open source async coding agent that plans, codes, and opens PRs

    ...Open SWE is capable of creating commits and automatically opening pull requests once implementation is complete, effectively closing the loop on development tasks. It also supports interactive feedback during execution, allowing users to guide or adjust the process mid-task. Despite its advanced capabilities, the project has been officially marked as deprecated.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    kg-gen

    kg-gen

    Knowledge Graph Generation from Any Text

    ...The framework addresses common problems in automatic knowledge graph construction, particularly sparsity and duplication of entities, by applying a clustering and entity-resolution process that merges semantically similar nodes. This allows the generated graphs to be denser, more coherent, and easier to use for downstream tasks such as retrieval-augmented generation, semantic search, and reasoning systems.
    Downloads: 1 This Week
    Last Update:
    See Project
Auth0 Logo