Showing 484 open source projects for "process"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    hCaptcha Challenger

    hCaptcha Challenger

    Gracefully face hCaptcha challenge with multimodal llms

    hCaptcha Challenger is an open-source automation framework designed to solve hCaptcha verification challenges using computer vision models and multimodal reasoning techniques. The project integrates machine learning models capable of analyzing visual captcha tasks and identifying the correct responses required to pass the verification process. Instead of relying on third-party captcha-solving services or browser scripts, the system operates independently by using pretrained neural networks that can classify images, detect objects, and interpret spatial relationships. The framework includes support for multiple types of captcha challenges such as object selection, drag-and-drop puzzles, and image labeling tasks. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Paper2Slides

    Paper2Slides

    From Paper to Presentation in One Click

    ...It is designed to replace the repetitive work of turning dense technical documents into presentation-friendly structure by extracting key points, figures, and data into a coherent visual narrative. The system supports multiple input formats, so you can process PDFs and common office documents rather than being locked to a single file type. It uses an extraction approach intended to capture critical insights comprehensively, including important visuals and data points that often get missed in naive summarization. A major focus is traceability: generated slide content is designed to remain linked back to the source material so you can verify accuracy and reduce information drift. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Step-Video-T2V

    Step-Video-T2V

    State-of-the-art (SoTA) text-to-video pre-trained model

    Step-Video-T2V is a state-of-the-art text-to-video foundation model developed to generate videos from natural-language prompts; its 30B-parameter architecture is designed to produce coherent, temporally extended video sequences — up to around 204 frames — based on input text. Under the hood it uses a compressed latent representation (a Video-VAE) to reduce spatial and temporal redundancy, and a denoising diffusion (or similar) process over that latent space to generate smooth, plausible motion and visuals. The model handles bilingual input (e.g. English and Chinese) thanks to dual encoders, and supports end-to-end text-to-video generation without requiring external assets. Its training and generation pipeline includes techniques like flow-matching, full 3D attention for temporal consistency, and fine-tuning approaches (e.g. video-based DPO) to improve fidelity and reduce artifacts. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    VGGSfM

    VGGSfM

    VGGSfM: Visual Geometry Grounded Deep Structure From Motion

    ...It leverages tools like PyCOLMAP, poselib, LightGlue, and PyTorch3D for feature matching, pose estimation, and visualization. With minimal configuration, users can process single scenes or full video sequences, apply motion masks to exclude moving objects, and train neural radiance or splatting models directly from reconstructed outputs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Koog

    Koog

    Koog is the official Kotlin framework for building AI agents

    Koog is a Kotlin‑based framework for building and running AI agents entirely in idiomatic Kotlin, supporting both single‑run agents that process individual inputs and complex workflow agents with custom strategies and configurations. It features pure Kotlin implementation, seamless Model Control Protocol (MCP) integration for enhanced model management, vector embeddings for semantic search, and a flexible system for creating and extending tools that access external systems and APIs. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    WrenAI

    WrenAI

    Open-source SQL AI Agent for Text-to-SQL. Make Text2SQL Easy

    ...Wren AI has implemented a semantic engine architecture to provide the LLM context of your business; you can easily establish a logical presentation layer on your data schema that helps LLM learn more about your business context. With Wren AI, you can process metadata, schema, terminology, data relationships, and the logic behind calculations and aggregations with “Modeling Definition Language”, to generate accurate SQL queries with semantic context. When starting a new conversation in Wren AI, your question is used to find the most relevant tables. From these, LLM generates three relevant questions for the user to choose from. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Future AGI

    Future AGI

    Open-source platform for evaluating, observing, and improving LLM

    ...Future AGI is especially relevant for agent-heavy products where reliability, regression testing, and safety checks matter before and after release. Its main value is turning AI agent development into a measurable engineering process instead of an informal cycle of prompting, guessing, and manual review.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Anything to NotebookLM

    Anything to NotebookLM

    Multi-source content processor for NotebookLM

    ...The project uses natural-language commands, so the user can ask for a podcast, slide deck, mind map, report, quiz, flashcards, or infographic without manually building the workflow. It supports multilingual material, with especially strong use cases for Chinese and English content. The tool can process files locally, extract or transcribe content when needed, and hand the cleaned material to NotebookLM for generation. It is best suited for researchers, students, content curators, and knowledge workers who regularly turn scattered information into organized learning assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Ollama RAG Chatbot

    Ollama RAG Chatbot

    Chat with multiple PDFs locally

    ...Model support is flexible, with compatibility for both Hugging Face models and Ollama-based models, and the interface is delivered through Gradio for a lightweight user experience. The main value of the project is its ability to process multiple PDF inputs and turn them into a question-answering workflow centered on document retrieval. With Docker support, script-based setup, optional ngrok exposure, and a clear local run path, it serves as a compact starter project for people who want a hands-on, self-hosted PDF chat system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Claude Autoresearch

    Claude Autoresearch

    Claude Autoresearch Skill, autonomous goal-directed iteration

    Claude Autoresearch is an autonomous research assistant system that automates the process of exploring, collecting, and synthesizing information across multiple iterations. It is designed to mimic human research behavior by generating queries, evaluating results, and refining its approach based on previous findings. The system likely integrates with external data sources, allowing it to gather information from diverse inputs and organize it into structured outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    pi-autoresearch

    pi-autoresearch

    Autonomous experiment loop extension for pi

    ...It is designed to simulate a continuous research loop where queries are generated, refined, and expanded based on previous outputs, enabling deeper exploration of complex topics. The system likely integrates with external data sources or APIs to retrieve information and process it into structured insights. Its architecture suggests a focus on autonomy, allowing it to run multi-step research pipelines that mimic human investigative processes. This makes it particularly useful for exploratory analysis, trend discovery, or generating structured knowledge from large information spaces. Overall, pi-autoresearch represents a step toward self-directed research agents capable of producing increasingly refined outputs over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    model2Vec

    model2Vec

    Fast State-of-the-Art Static Embeddings

    model2vec is an innovative embedding framework that converts large sentence transformer models into compact, high-speed static embedding models while preserving much of their semantic performance. The project focuses on dramatically reducing the computational cost of generating embeddings, achieving significant improvements in speed and model size without requiring large datasets for retraining. By using a distillation-based approach, it can produce lightweight models that run efficiently on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Godot MCP

    Godot MCP

    MCP server for interfacing with Godot game engine

    ...It also includes advanced features for manipulating scenes, managing assets, and editing project structures, making it possible to automate large portions of the development process. By exposing Godot functionality through a standardized MCP interface, it ensures compatibility with various AI clients such as Claude Code or Cursor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Reflexion

    Reflexion

    Reflexion: Language Agents with Verbal Reinforcement Learning

    ...The framework introduces a mechanism where agents maintain a memory of past attempts and use that memory to guide future decisions, effectively simulating a learning process without requiring traditional model retraining. This approach is particularly useful for complex reasoning tasks, coding challenges, and decision-making scenarios where initial outputs may be incomplete or incorrect. Reflexion also emphasizes transparency by making intermediate reasoning steps explicit, allowing developers to inspect how conclusions are reached and where improvements occur.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SemTools

    SemTools

    Semantic search and document parsing tools for the command line

    SemTools is an open-source command-line toolkit designed for document parsing, semantic indexing, and semantic search workflows. The project focuses on enabling developers and AI agents to process large document collections and extract meaningful semantic representations that can be searched efficiently. Built with Rust for performance and reliability, the toolchain provides fast processing of text and structured documents while maintaining low system overhead. SemTools can parse documents, build semantic embeddings, and perform similarity searches across datasets, making it useful for research, knowledge management, and AI-assisted coding workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Cog

    Cog

    Package and deploy machine learning models using Docker containers

    Cog is an open source tool designed to package machine learning models into standardized, production-ready containers. It simplifies the process of deploying models by automatically generating Docker images based on a simple configuration file, eliminating the need to manually write complex Dockerfiles. Developers can define the runtime environment, dependencies, and Python versions required for their models, allowing Cog to build a consistent container environment that follows best practices. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    VoxelMorph

    VoxelMorph

    Unsupervised Learning for Image Registration

    VoxelMorph is an open-source deep learning framework designed for medical image registration, a process that aligns multiple medical scans into a common spatial coordinate system. Traditional image registration techniques typically rely on optimization procedures that must be executed separately for each pair of images, which can be computationally expensive and slow. VoxelMorph approaches the problem using neural networks that learn to predict deformation fields that transform one image so that it aligns with another. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DATA SCIENCE ROADMAP

    DATA SCIENCE ROADMAP

    Data Science Roadmap from A to Z

    DATA SCIENCE ROADMAP is an educational repository designed to guide learners through the process of becoming proficient in data science and machine learning. The project presents a structured roadmap that outlines the knowledge and skills required for different stages of a data science career. Topics typically include programming with Python, statistics, mathematics, machine learning algorithms, data visualization, and big data technologies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CUDA Containers for Edge AI & Robotics

    CUDA Containers for Edge AI & Robotics

    Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

    ...By using containerized environments, developers can ensure that their applications run consistently across different Jetson platforms and JetPack versions. The repository also includes build tools and package management utilities that help automate the process of assembling machine learning environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jina-Serve

    Jina-Serve

    Build multimodal AI applications with cloud-native stack

    ...Jina Serve focuses on making it easier to turn machine learning models into production-ready services without forcing developers to manage complex infrastructure manually. The framework supports many major machine learning libraries and data types, making it suitable for multimodal AI systems that process text, images, audio, and other inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Better Agents

    Better Agents

    Standards for building agents, better

    ...Rather than being a full execution framework itself, Better-Agents focuses on enhancing coding assistants and agent development tools by embedding standardized guidelines into the development process. The system generates structured project files, including configuration documents that define the architecture, roles, and capabilities of the agent system. By following these conventions, developers can ensure that their agents adhere to widely accepted design patterns and operational standards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LLM Vision

    LLM Vision

    Visual intelligence for your home.

    ...Instead of relying only on traditional object detection pipelines, it allows users to send prompts about visual content and receive contextual descriptions or answers about what is happening in camera footage. The system can process events from surveillance platforms such as Frigate and convert them into meaningful summaries, notifications, or structured data for automation workflows. It also maintains a timeline of analyzed camera events that can be displayed in dashboards or queried through the assistant interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MetaScreener

    MetaScreener

    AI-powered tool for efficient abstract and PDF screening

    MetaScreener is an open-source AI-assisted tool designed to streamline the screening process in systematic literature reviews and academic research workflows. The system helps researchers analyze large collections of academic abstracts and research papers to determine which studies are relevant for inclusion in evidence synthesis projects. Instead of manually reviewing hundreds or thousands of documents, researchers can use MetaScreener to apply machine learning techniques that assist with classification and prioritization of candidate papers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DATAGEN

    DATAGEN

    AI-driven multi-agent research assistant automating hypothesis

    ...The system coordinates multiple specialized AI agents that collaborate to perform tasks such as hypothesis generation, data collection, analysis, visualization, and report creation. Instead of requiring users to manually orchestrate each stage of a research process, the platform allows these agents to coordinate automatically and handle the workflow end-to-end. The project integrates several modern AI frameworks including LangChain, LangGraph, and large language models to manage reasoning and data processing tasks. Through this architecture, the system can combine structured data analysis with natural language reasoning to generate insights and research outputs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LLMs-Zero-to-Hero

    LLMs-Zero-to-Hero

    From nobody to big model (LLM) hero

    LLMs-Zero-to-Hero is an open-source educational project designed to guide learners through the complete process of understanding and building large language models from the ground up. The repository presents a structured learning pathway that begins with fundamental concepts in machine learning and progresses toward advanced topics such as model pre-training, fine-tuning, and deployment. Rather than relying entirely on existing frameworks, the project encourages readers to implement important components themselves in order to gain a deeper understanding of how modern language models work internally. ...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo