Showing 153 open source projects for "web process"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    OpsManage

    OpsManage

    Automated operation and maintenance platform

    Automated operation and maintenance platform: code and application deployment CI/CD, asset management CMDB, scheduled task management platform, SQL audit | rollback, task scheduling, on-site WIKI. A code deployment, application deployment, scheduled tasks, and equipment asset management platform. Welcome to star or fork my open source project. If you need to quote the project code in your own project, please declare the agreement and copyright information in the project. Programming...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Moriarty Project

    Moriarty Project

    Web-based OSINT tool for investigating phone number information

    Moriarty Project is an open source web-based investigation tool designed to gather publicly available information about phone numbers. It allows users to input a phone number and analyze various details related to that number through multiple investigation features. It performs information gathering by scraping data from online sources to retrieve insights such as owner information, spam risk, and related web references.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Pattern

    Pattern

    Web mining module for Python, with tools for scraping

    Pattern is an open-source Python library that provides tools for web mining, natural language processing, machine learning, and network analysis. The project integrates multiple capabilities into a single framework that allows developers to collect, process, and analyze textual data from the web. It includes modules for web scraping and crawling that can retrieve information from sources such as social media platforms, search engines, and online knowledge bases. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    grab-site

    grab-site

    Web crawler for archiving and backing up sites into WARC archives

    grab-site is an open source web crawling tool designed to archive and back up websites by recursively downloading their content. It works by taking a starting URL and systematically following links across the site, capturing pages and resources and saving them into WARC archive files for long-term preservation. Internally, the crawler uses a fork of the wpull engine to fetch and process web pages efficiently during large-scale crawls. grab-site includes a built-in dashboard that displays real-time crawl activity, including which URLs are currently being processed and how many remain in the queue. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 5
    pspider

    pspider

    Simple Python framework for building multithreaded web crawlers

    PSpider is a lightweight web crawling framework written in Python designed to simplify the development of custom web spiders. It focuses on providing an easy-to-understand architecture while still supporting concurrent crawling for improved performance. It uses a multithreaded model that separates the crawling workflow into several components responsible for fetching, parsing, and saving data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    mlscraper

    mlscraper

    ML-based HTML scraper that learns extraction rules from examples

    ...It analyzes those examples within the HTML document and determines patterns or rules that can be used to extract the same type of information from similar pages. Once trained, the generated scraper can process new pages and return the extracted data in structured formats such as dictionaries or lists. This approach simplifies web scraping tasks by shifting the focus from rule-writing to example-based training. Internally, the project processes HTML documents, identifies relevant elements in the DOM, and builds extraction logic based on statistical or heuristic analysis of the training samples. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Envelop (Risk and Audits software)

    Governance, Risk, and Compliance Management Tool

    Envelop is a management tool that focuses on governance, risk and compliance processes and documentation. Auditing and managing audit workflows is the main feature of this tool. Envelop is an audit software, that helps with internal and external audits to be managed. The common usage of this tool would be for audits, e.g. financial audits done by internal or external auditors. The overall structure follows this hierarchy : Process > Objective > Risk > Control > Test > Finding You...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Automation Orchestrator

    Automation Orchestrator

    Automation Orchestrator application to orchestrate digital processes

    The Automation Orchestrator allows you to run a local Django Web Application to administrate and orchestrate automation scripts such as Nintex RPA botflows but you may also orchestrate other automation such as Python scripts, batch files, and other RPA tools. The Automation Orchestrator is tested with and supports Nintex RPA version = 15.1. As described on the official Django project site, it "is a high-level Python Web framework that encourages rapid development and clean, pragmatic design....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    MITRE Annotation Toolkit

    A toolkit for managing and manipulating text annotations

    The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g.,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    Glazier

    Glazier

    A tool for automating the installation of Windows OS

    Glazier is an automation framework developed by Google for deploying and managing Windows operating systems at scale. It streamlines the entire Windows imaging process by booting systems into the Windows Preinstallation Environment (WinPE), retrieving installation instructions from a web server, and automatically applying operating systems, software, and configurations. The tool is fully text-based and code-driven, with configurations written in YAML, allowing teams to leverage source control for versioning, collaboration, and quality assurance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    ruia

    ruia

    Async Python framework for fast and flexible web scraping spiders

    Ruia is an asynchronous web scraping micro-framework built for Python that focuses on simplicity, speed, and flexibility when creating web crawlers. Ruia is powered by Python’s asyncio library along with aiohttp, enabling developers to perform concurrent network requests efficiently and scrape data from websites with minimal overhead. Ruia follows a “write less, run faster” philosophy, emphasizing concise code and streamlined spider development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TimothyDocs

    TimothyDocs

    Timothy is a cloud base storage system designed to document your work

    Timothy is a cloud based documentation system. Timothy will document any endeavor because it will store not only the documents created during the project but also store information about those files. Like most storge schemes timothy creates a hierarchy of categories through which one may browse. Timothy displays information about the document or category as well as its name. This use of meta data explains the structure and content of the project to the user as he browses. Users...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Brewery HMI

    Web app for all-grain brewers. Automate your electric brewery.

    This project is UI to guide the user through the steps of home brewing. Each beer has a recipe of ingredients, but it also has a “recipe” for the process to make the beer. This app allows you to load a recipe (BeerXML), guide you through the brewing steps and record any details so that you can improve next time. By default, this app relies on user inputs and feedback from the real-world. There are easy hooks to a link up a real system, so that if you wanted to automate some of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Free Queue Manager

    Free Queue Manager

    Web based python-flask Queue management system

    A web based management system developed for the purpose of easing the process of orgnizing queues and lines. Like many other (QMS)s Queue Management Systems, FQM does provide a basic dashboard to allow the users of the system and customers alike to interact with the system via a basic yet simple user interface . Brief user guide can be found on https://fqms.github.io/images/user_guide.pdf
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 15
    COCO Annotator

    COCO Annotator

    Web-based image segmentation tool for object detection & localization

    COCO Annotator is a web-based image annotation tool designed for versatility and efficiently label images to create training data for image localization and object detection. It provides many distinct features including the ability to label an image segment (or part of a segment), track object instances, label objects with disconnected visible parts, and efficiently store and export annotations in the well-known COCO format.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    GoogleScraper

    GoogleScraper

    Python tool for scraping search engine results from many providers

    GoogleScraper is a Python-based tool designed to automatically collect and process search engine results from multiple providers. It enables developers and researchers to programmatically query search engines and extract useful information such as links, titles, and result descriptions. GoogleScraper supports several major search engines and can be used to gather structured datasets from search result pages for further analysis. It provides two different scraping approaches: sending direct...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    AttackSurfaceMapper

    AttackSurfaceMapper

    Automated tool for mapping & expanding organization’s attack surface

    ...It performs both brute-force and passive enumeration techniques to uncover infrastructure components that may not be immediately visible. After building an expanded list of targets, AttackSurfaceMapper collects intelligence such as screenshots of web applications, information about exposed services, and possible vulnerabilities identified through integrated services. It can also search for publicly exposed credentials.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Buster

    Buster

    OSINT tool for discovering information linked to email addresses

    ...Buster supports validating generated email addresses and retrieving contextual information about them. By combining multiple online sources and services, Buster helps automate the process of gathering intelligence related to digital identities.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    ECommerceCrawlers

    ECommerceCrawlers

    Collection of Python ecommerce and website crawler examples projects

    ...It aims to help developers understand the full workflow of web scraping, including request simulation, data extraction, storage, and handling anti-scraping techniques. It includes crawlers for platforms such as ecommerce marketplaces, blogging platforms, recruitment sites, and social networks, providing real-world practice scenarios. Developers can study the individual project documentation to understand the analysis process.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Walle 2.0

    Walle 2.0

    Walle 2.0 web

    ...Permissions, notifications, file fingerprints. Independent space resources like gitlab. Custom environment, environment variables. Project administrator, a notification mechanism, custom hooks in each process. Get through server management, grayscale release. Advanced gitlab-like user management mode. Online order application, review, grayscale release, real-time websocket. Walle allows users to release code and finally can not only choose jenkins! Support various web code releases, release and rollback of php, java, python, go and other codes can be completed with one click through the web. walle is a freely configurable project, more user-friendly, high-value, an open source online deployment system that supports git, multi-user, multi-language, multi-project, and multi-environment simultaneous deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Raven Python

    Raven Python

    Raven is the legacy Python client for Sentry

    Raven is the official legacy Python client for Sentry, officially supports Python 2.6–2.7 & 3.3–3.7, and runs on PyPy and Google App Engine. It tracks errors and exceptions that happen during the execution of your application and provides instant notification with detailed information needed to prioritize, identify, reproduce, and fix each issue. It provides full out-of-the-box support for many of the popular Python frameworks, including Django, and Flask. Raven also includes drop-in support...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    etalon

    Standards Document Redaction System

    The best process for creating standards is that of the IETF’s RFCs. Standard documents are written, submitted for publication, reviewed, and finally approved. Then they get a official number and hence can be referenced. Some can even be blessed to be an official standard. Etalon will implement this whole process, web-based and with markdown documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    pyspider

    pyspider

    A powerful Spider(Web Crawler) system in Python

    pyspider is a powerful Spider(Web Crawler) system in Python. Components are connected by message queue. Every component, including message queue, is running in their own process/thread, and replaceable. That means, when process is slow, you can have many instances of processor and make full use of multiple CPUs, or deploy to multiple machines. This architecture makes pyspider really fast. benchmarking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    WeChatSogou

    WeChatSogou

    Python library to crawl and retrieve data from WeChat accounts

    WechatSogou is an open source Python library designed to retrieve data from WeChat official accounts by using the Sogou WeChat search service as its data source. It provides developers with a programmatic way to search for public accounts and collect article information without manually browsing the search interface. It functions as a crawler interface that sends requests to the search engine, retrieves results, and converts the returned pages into structured data that can be used in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    gain

    gain

    Asyncio-based Python framework for building fast web crawling spiders

    Gain is a Python web crawling framework designed to simplify the process of building efficient and scalable web scrapers. It is built on top of asynchronous technologies such as asyncio, aiohttp, and uvloop to support high-performance crawling with concurrent network requests. It provides a structured framework for creating spiders that can navigate websites, extract structured data, and process the collected results.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB