Showing 246 open source projects for "process"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    HY-World 2.0

    HY-World 2.0

    A Multi-Modal World Model for Reconstructing, Generating, Simulation

    ...The system also improves reconstruction from multi-view images and video by upgrading its feed-forward 3D prediction components and its memory-aware view generation process. Another major part of the project is WorldLens, a rendering platform designed for interactive exploration with an engine-agnostic architecture, automatic image-based lighting, collision detection, and support for character interaction.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    tiny-llm

    tiny-llm

    A course of learning LLM inference serving on Apple Silicon

    tiny-llm is an educational open-source project designed to teach system engineers how large language model inference and serving systems work by building them from scratch. The project is structured as a guided course that walks developers through the process of implementing the core components required to run a modern language model, including attention mechanisms, token generation, and optimization techniques. Rather than relying on high-level machine learning frameworks, the codebase uses mostly low-level array and matrix manipulation APIs so that developers can understand exactly how model inference works internally. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Paper2Slides

    Paper2Slides

    From Paper to Presentation in One Click

    ...It is designed to replace the repetitive work of turning dense technical documents into presentation-friendly structure by extracting key points, figures, and data into a coherent visual narrative. The system supports multiple input formats, so you can process PDFs and common office documents rather than being locked to a single file type. It uses an extraction approach intended to capture critical insights comprehensively, including important visuals and data points that often get missed in naive summarization. A major focus is traceability: generated slide content is designed to remain linked back to the source material so you can verify accuracy and reduce information drift. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    VGGSfM

    VGGSfM

    VGGSfM: Visual Geometry Grounded Deep Structure From Motion

    ...It leverages tools like PyCOLMAP, poselib, LightGlue, and PyTorch3D for feature matching, pose estimation, and visualization. With minimal configuration, users can process single scenes or full video sequences, apply motion masks to exclude moving objects, and train neural radiance or splatting models directly from reconstructed outputs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    Pedalboard

    Pedalboard

    A Python library for audio

    ...Internally at Spotify, pedalboard is used for data augmentation to improve machine learning models and to help power features like Spotify’s AI DJ and AI Voice Translation. pedalboard also helps in the process of content creation, making it possible to add effects to audio without using a Digital Audio Workstation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    ModelScope

    ModelScope

    Bring the notion of Model-as-a-Service to life

    ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation. In particular, with rich layers of API abstraction, the ModelScope library offers unified experience to explore state-of-the-art models spanning across domains such as CV, NLP, Speech, Multi-Modality, and Scientific-computation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    autoresearch for AMD

    autoresearch for AMD

    AI agents running research on single-GPU nanochat training

    ...The system is built around a minimal structure that includes a data preparation module, a training script that can be modified, and a program specification that guides the agent’s decision-making process. During each iteration, the agent edits the training code, runs an experiment within a fixed time budget, evaluates performance metrics, and decides whether to retain or discard the changes. This loop allows the system to explore a wide range of architectural and hyperparameter configurations without human intervention. The framework emphasizes simplicity and reproducibility, ensuring that experiments are comparable and results are traceable over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    autoresearch-macos

    autoresearch-macos

    AI agents running research on single-GPU nanochat training

    ...The project typically includes components such as data preparation scripts, a training loop, and an instruction file that guides the agent’s behavior. By automating experimentation and optimization, it allows continuous improvement without manual intervention, effectively turning research into a self-improving process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    FastAgency

    FastAgency

    The fastest way to bring multi-agent workflows to production

    FastAgency is a framework that simplifies the creation and deployment of AI-driven automation agents. It provides a structured environment for developing AI assistants capable of handling various business and technical tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    Step-Video-T2V

    Step-Video-T2V

    State-of-the-art (SoTA) text-to-video pre-trained model

    Step-Video-T2V is a state-of-the-art text-to-video foundation model developed to generate videos from natural-language prompts; its 30B-parameter architecture is designed to produce coherent, temporally extended video sequences — up to around 204 frames — based on input text. Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Open Gauss

    Open Gauss

    Project-scoped Lean workflow orchestrator from Math, Inc.

    ...The database organizes data using the relational model, storing structured information in tables composed of rows and columns while supporting standard SQL for querying and management. One of its defining strengths is its optimization for multi-core and distributed environments, allowing it to efficiently process high volumes of concurrent transactions with minimal latency. OpenGauss also incorporates AI-based optimization techniques, such as intelligent query planning, performance prediction, and automated tuning, which help reduce operational complexity and improve efficiency.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    DeepAnalyze

    DeepAnalyze

    Autonomous LLM agent for end-to-end data science workflows

    ...DeepAnalyze is capable of conducting open-ended data research across multiple data formats such as structured tables, semi-structured files, and unstructured text, enabling flexible and comprehensive analysis workflows. It integrates execution-based reasoning by generating and running code as part of its analysis process, allowing it to iteratively refine results and produce more accurate outputs. DeepAnalyze provides multiple interaction interfaces, including a web-based UI, a command-line interface, and a Jupyter-style notebook environment for interactive workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    LLM-Aided OCR Project

    LLM-Aided OCR Project

    Enhances Tesseract OCR output using LLMs (local or API)

    ...The system first extracts raw text using OCR engines and then applies language models to analyze and correct recognition errors based on context. This AI-assisted correction process helps reconstruct missing characters, fix formatting mistakes, and produce more coherent text outputs. The project is particularly useful for digitizing historical documents, research papers, and scanned materials where traditional OCR often struggles. It also includes tools for processing batches of images or documents, enabling automated document digitization workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    CowAgent

    CowAgent

    AI assistant based on large models that can actively think and plan

    CowAgent, based on the chatgpt-on-wechat project, is an open-source AI agent framework that integrates large language models into the WeChat ecosystem to create intelligent conversational assistants. It enables automated message handling by connecting WeChat accounts with AI models that can generate contextual replies, process voice messages, and produce images directly inside chats. The platform has evolved beyond a simple chatbot into a more autonomous agent capable of planning complex tasks, maintaining long-term memory, and invoking external tools to complete workflows. It supports multi-turn conversations with per-user context tracking, allowing more natural and persistent interactions across private and group chats. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    NVIDIA Earth2Studio

    NVIDIA Earth2Studio

    Open-source deep-learning framework

    ...It provides a unified API that lets researchers, data scientists, and engineers build complex forecasting and analysis pipelines by combining modular prognostic and diagnostic AI models with a diverse range of real-world data sources such as global forecast systems, reanalysis datasets, and satellite feeds. The toolkit makes it easy to run deterministic and ensemble forecasts, swap models interchangeably, and process large geophysical datasets with Xarray structures, enabling experimentation with state-of-the-art deep learning models for climate and atmospheric prediction. Users can extend Earth2Studio with optional model packs, advanced data interfaces, statistical operators, and backend integrations that support flexible workflows from simple tests to large-scale operational inference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Nerfstudio

    Nerfstudio

    A collaboration friendly studio for NeRFs

    Nerfstudio provides a simple API that allows for a simplified end-to-end process of creating, training, and testing NeRFs. The library supports a more interpretable implementation of NeRFs by modularizing each component. With more modular NeRFs, we hope to create a more user-friendly experience in exploring the technology. This is a contributor-friendly repo with the goal of building a community where users can more easily build upon each other’s contributions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Qwen-Agent

    Qwen-Agent

    Agent framework and applications built upon Qwen>=3.0

    Qwen-Agent is a framework for building applications / agents using Qwen models (version 3.0+). It provides components for instruction following, tool usage (function calling), planning, memory, RAG (retrieval augmented generation), code interpreter, etc. It ships with example applications (Browser Assistant, Code Interpreter, Custom Assistant), supports GUI front-ends, backends, server setups. Agent workflow can maintain context / memory to perform multi-turn or more complex logic over time....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Train LLM From Scratch

    Train LLM From Scratch

    A straightforward method for training your LLM

    ...It is based on the architecture described in Attention Is All You Need and is designed to make the training pipeline understandable rather than hidden behind a large framework. The repository walks through the process from downloading data to generating text with a trained model. It supports training smaller or larger models, including million- and billion-parameter configurations depending on available hardware. A major goal is accessibility, since the author frames it as possible to train models using a single GPU. It is most useful for learners, researchers, and developers who want practical exposure to LLM internals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    NVIDIA AI Blueprint

    NVIDIA AI Blueprint

    Suite of reference architectures for building GPU-accelerated vision

    NVIDIA AI Blueprint is an AI blueprint for building GPU-accelerated video intelligence applications and vision agents. It combines accelerated vision microservices, vision language models, large language models, embeddings, and NVIDIA NIM microservices to process both stored and streaming video. The project is organized around real-time video intelligence, downstream analytics, and agentic offline processing. It supports workflows such as natural-language video search, visual question answering, long-video summarization, clip retrieval, verified alerts, and incident analysis. It is designed for technical users who need deployable reference architectures for smart spaces, warehouse automation, SOP validation, monitoring, and operational video analytics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Alpamayo 1

    Alpamayo 1

    Bridging Reasoning and Action Prediction

    ...The model is designed as a foundational component rather than a complete driving stack, allowing developers to build custom autonomous vehicle applications on top of it. It incorporates vision-language-action modeling, enabling it to process sensor data and contextual information simultaneously. Alpamayo supports tasks such as trajectory prediction, auto-labeling, and reasoning-based decision making. The system is optimized for high-performance GPU environments and is intended primarily for experimentation and benchmarking. Overall, it represents an advanced step toward integrating reasoning into autonomous driving pipelines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GEO Content Writer

    GEO Content Writer

    Backlog-row-first content production system for teams

    ...The tool is particularly useful for businesses targeting local markets or region-specific audiences. It integrates into broader SEO pipelines, allowing content generation to be part of a continuous optimization process. Overall, GEO Content Writer enables scalable, AI-driven content creation tailored for modern search ecosystems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MedgeClaw

    MedgeClaw

    Open-source AI research assistant for biomedicine

    ...The system connects conversational interfaces with computational environments, allowing users to initiate research tasks through messaging platforms while the backend executes analyses using tools like R and Python. It includes a real-time dashboard that displays progress, generated code, and outputs, providing transparency throughout the research process. MedgeClaw also supports reproducibility by generating structured reports and maintaining consistent environments through containerization. Its architecture combines conversational AI, automated pipelines, and scientific tooling into a unified workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Biomni

    Biomni

    Biomni: a general-purpose biomedical AI agent

    Biomni is a general-purpose biomedical AI agent designed to autonomously perform complex research tasks across a wide range of scientific domains, combining language model reasoning with structured planning and execution. It integrates retrieval-augmented generation with code-based execution, allowing it to access external knowledge, process data, and generate testable hypotheses in scientific workflows. The system is built to support researchers by automating repetitive and time-consuming tasks such as literature review, data analysis, and experimental design. Biomni operates within a comprehensive environment that includes tools, APIs, and datasets, enabling it to execute multi-step research processes rather than just generating text responses. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TNT

    TNT

    A lightweight library for PyTorch training tools and utilities

    TNT is a lightweight training framework developed by Meta that simplifies the process of building and managing machine learning training loops using PyTorch. The project focuses on providing a flexible yet structured environment for implementing training pipelines without the complexity of large deep learning frameworks. It introduces modular abstractions that allow developers to organize training logic into reusable components such as trainers, evaluators, and callbacks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    All-in-RAG

    All-in-RAG

    Big Model Application Development Practice 1

    ...Alongside theoretical explanations, the repository includes hands-on exercises and example projects that demonstrate how to build production-ready RAG systems. These projects guide developers through the process of integrating vector databases, embedding models, and large language models into a unified application.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo