Search Results for "extensible web spider" - Page 2

Showing 89 open source projects for "extensible web spider"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    FlaskBB

    FlaskBB

    A classic Forum Software in Python using Flask

    ...Besides that, you can also install community-made themes. Our goal is to develop a forum system with modern technologies and sticking to already well-tested and approved layouts. FlaskBB is a simple and extensible forum software that should be seen as the foundation for building communities. It is written in Python using the web framework Flask. FlaskBB is being distributed under the BSD 3-Clause License.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Synapse

    Synapse

    Matrix reference homeserver

    Matrix is an ambitious new ecosystem for open federated Instant Messaging and VoIP. Everything in Matrix happens in a room. Rooms are distributed and do not exist on any single server. Rooms can be located using convenience aliases like #matrix:matrix.org or #test:localhost:8448. Synapse is currently in rapid development, but as of version 0.5 we believe it is sufficiently stable to be run as an internet-facing service for real usage! Create and manage fully distributed chat rooms with no...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 3
    PasteMD

    PasteMD

    Paste Markdown and AI responses into Word Excel instantly fast

    ...It includes intelligent detection mechanisms that distinguish between Markdown tables, rich HTML content, and plain text, ensuring the correct output format is used for each target application. PasteMD also introduces extensible workflows that allow users to configure different paste behaviors.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 4
    LoLLMs WEBUI

    LoLLMs WEBUI

    Local AI WebUI for running and managing large language models offlineA

    lollms-webui is a locally hosted web interface designed to run and manage large language models without relying on external services. It provides users with a centralized environment to interact with multiple AI models, making it suitable for experimentation, development, and personal use. lollms-webui emphasizes offline capability, allowing users to maintain privacy and control over their data while still accessing advanced AI features. It integrates model management tools that help users...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    OWASP Maryam

    OWASP Maryam

    Modular OSINT framework for automated open-source intelligence gatheri

    Maryam is an open source intelligence (OSINT) framework designed to automate the process of gathering and analyzing publicly available information from the internet. It provides a modular environment that enables users to collect data from search engines, open data sources, and various online services for reconnaissance and investigative purposes. Written in Python, Maryam is built to provide a flexible and extensible framework for harvesting information quickly and efficiently from open...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Mopidy

    Mopidy

    Mopidy is an extensible music server written in Python

    Mopidy plays music from local disk, Spotify, SoundCloud, TuneIn, and more. You can edit the playlist from any phone, tablet, or computer using a variety of MPD and web clients. Vanilla Mopidy only plays music from files and radio streams. Through extensions, Mopidy can play music from cloud services like Spotify, SoundCloud, and TuneIn. With Mopidy's extension support, you can easily add backends for new music sources. Mopidy is a Python application that runs in a terminal or in the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    A.I.G

    A.I.G

    Full-stack AI Red Teaming platform

    AI-Infra-Guard is a powerful open-source security platform from Tencent’s Zhuque Lab designed to assess the safety and resilience of AI infrastructures, codebases, and components through automated scanning and evaluation tools. It brings together AI infrastructure vulnerability scanning, MCP server risk analysis, and jailbreak evaluation into a unified workflow so that enterprises and individuals can identify critical security issues without relying on external services. Users can deploy it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ClawTeam

    ClawTeam

    ClawTeam: Agent Swarm Intelligence (One Command → Full Automation)

    ClawTeam is an advanced multi-agent orchestration framework that enables AI agents to form collaborative swarms capable of solving complex tasks autonomously. Instead of relying on a single agent, the system allows a leader agent to spawn and coordinate multiple specialized sub-agents, each responsible for different aspects of a problem. These agents communicate, share insights, and dynamically adapt their strategies based on real-time feedback, creating a form of collective intelligence....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Werkzeug

    Werkzeug

    The comprehensive WSGI web application library

    ...Includes a response object that can wrap other WSGI applications and handle streaming data. Includes a routing system for matching URLs to endpoints and generating URLs for endpoints, with an extensible system for capturing variables from URLs. Includes HTTP utilities to handle entity tags, cache control, dates, user agents, cookies, files, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Mezzanine

    Mezzanine

    CMS framework for Django

    Mezzanine is a powerful open source content management platform built using the Django framework. In many ways it is like many other content management tools, offering an intuitive interface for managing all of your content. But Mezzanine is different in that it provides most of its functionality by default. While other platforms rely heavily on modules or reusable applications, Mezzanine comes ready with all the functionality you need, making it the more efficient choice. Mezzanine has a...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Dolphin

    Dolphin

    Document Image Parsing via Heterogeneous Anchor Prompting”

    Dolphin — maintained by ByteDance — is a project aimed at providing a high-performance, robust, and extensible media or multimedia framework / player infrastructure (or possibly a streaming media solution), intended to meet modern demands for efficiency, flexibility, and integration in media-heavy applications. It seeks to combine performant media playback or handling (audio/video decoding, streaming, buffering) with a modular, developer-friendly API that allows easy embedding into larger...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Eric Integrated Development Environment

    Eric Integrated Development Environment

    Python Development Environment with all batteries included

    Eric is a Python IDE written using PyQt and QScintilla. It provides various features such as any number of open editors, an integrated (remote) debugger, project management facilities, unit test, refactoring and much more.
    Leader badge
    Downloads: 170 This Week
    Last Update:
    See Project
  • 13
    dashAI

    dashAI

    dashAI: an interactive platform for training, evaluating and deploying

    dashAI is an open-source, No-code workbench for Exploratory Data Analysis and classical ML. Visual data preparation, multi-model experiments, XAI explainability, and a plugin-based extensible catalog. The platform guides users through a complete, traceable workflow — data ingestion → visual exploration → preprocessing → model training → evaluation → explainability — without writing a single line of code. Each step is explicit and reversible, keeping the user in control rather than...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14

    TOMUSS

    TOMUSS: The Online Multi User Simple Spreadsheet

    TOMUSS is an interactive web application (groupware) allowing multiple concurrent users to edit data tables. Its primary goal is the management of students grades.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    GameServer

    GameServer is a sophisticated, game server designed for online games

    GameServer is a sophisticated, game server designed for massively multiplayer online games. Built with a C++ core and extensible Python scripting, it provides solution for managing persistent game worlds with real-time player interactions, advanced networking, and comprehensive monitoring capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Miracode

    Miracode

    A sharp, readable, vector-y version of Monocraft

    A web-based code editor with support for multiple programming languages, designed for collaborative projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Crawlab

    Crawlab

    Distributed web crawler admin platform for spiders management

    Golang-based distributed web crawler management platform, supporting various languages including Python, NodeJS, Go, Java, PHP and various web crawler frameworks including Scrapy, Puppeteer, Selenium. Please use docker-compose to one-click to start up. By doing so, you don't even have to configure MongoDB database. The frontend app interacts with the master node, which communicates with other components such as MongoDB, SeaweedFS and worker nodes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Bot on Anything

    Bot on Anything

    Large model-based chatbot builder that can quickly integrate AI models

    Bot on Anything is a versatile open-source AI chatbot builder that lets developers connect large language models such as ChatGPT, Claude, and Gemini to virtually any messaging platform, website, or interface with minimal configuration. At its heart, the project abstracts away the glue logic between AI model APIs and disparate application “channels,” enabling the same bot logic to run in Slack, Telegram, Gmail, enterprise tools, web UIs, or command-line terminals. Configuration is handled...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Security Log Generator

    Security Log Generator

    Generates logs of typical formats that would often be found in a SOC

    Generates logs of typical formats that would often be found in a SOC. As of 31st January 2023, it supports IDS, Web Access and Endpoint log formats. Can generate a specific number of events in a linear fashion or use a waveform to add 'bumpiness' to your data. The code is modular and extensible, adding additional formats can be done with relative ease.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    ReconSpider

    ReconSpider

    Most Advanced Open Source Intelligence (OSINT) Framework

    ...ReconSpider can be used by Infosec Researchers, Penetration Testers, Bug Hunters, and Cyber Crime Investigators to find deep information about their target. ReconSpider aggregate all the raw data, visualize it on a dashboard, and facilitate alerting and monitoring on the data. Recon Spider also combines the capabilities of Wave, Photon and Recon Dog to do a comprehensive enumeration of attack surfaces. Reconnaissance is a mission to obtain information by various detection methods, about the activities and resources of an enemy or potential enemy, or geographic characteristics of a particular area. A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    ruia

    ruia

    Async Python framework for fast and flexible web scraping spiders

    Ruia is an asynchronous web scraping micro-framework built for Python that focuses on simplicity, speed, and flexibility when creating web crawlers. Ruia is powered by Python’s asyncio library along with aiohttp, enabling developers to perform concurrent network requests efficiently and scrape data from websites with minimal overhead. Ruia follows a “write less, run faster” philosophy, emphasizing concise code and streamlined spider development.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    BlackWidow

    BlackWidow

    Python web scanner for OSINT gathering and OWASP vulnerability fuzzing

    BlackWidow is a Python-based web application scanning tool designed to crawl target websites and collect open-source intelligence (OSINT) while identifying potential security vulnerabilities. It functions as a web spider that systematically explores a site to gather valuable information such as URLs, dynamic parameters, subdomains, email addresses, and phone numbers associated with the target domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    CBMPy

    CBMPy

    PySCeS Constraint Based Modelling

    ...CBMPy supports user interaction via: - interactive console or as a library for advanced use - GUI, visual representation of the model, analysis methods - a SOAP based webAPI exposes high level functionality via web services
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    weatherPi

    An extensible weather sensor recording network, based on Raspberry Pis

    A distributed network of sensors with a central data store, to record environmental data. Project goals include: * to serve as a teaching outlet for: ** sensor programming ** distributed data collecting ** web APIs ** database ** web UI for numerical data ** graphing data sensibly It also gives me some infotainment in new (to me) technologies, and as an excuse to scatter ever more Raspberry Pis around my home.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Photon

    Photon

    Incredibly fast crawler designed for OSINT

    ...Despite its speed focus, the tool still provides useful filtering and extraction capabilities for analysts who need structured results. Overall, Photon functions as a lightweight yet powerful reconnaissance spider for web intelligence gathering.
    Downloads: 5 This Week
    Last Update:
    See Project