Showing 658 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    HASS Configurator

    HASS Configurator

    Configuration UI for Home Assistant

    The HASS Configurator is a small web app (you access it via a web browser) that provides a filesystem browser and text-editor to modify files on the machine the configurator is running on. It has been created to allow easy configuration of Home Assistant. It is powered by Ace editor, which supports syntax highlighting for various code/markup languages. YAML files (the default language for Home Assistant configuration files) will be automatically checked for syntax errors while editing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    dirsearch

    dirsearch

    Web path scanner

    An advanced command-line tool designed to brute force directories and files in webservers, AKA web path scanner. Wordlist is a text file, each line is a path. About extensions, unlike other tools, dirsearch only replaces the %EXT% keyword with extensions from -e flag. For wordlists without %EXT% (like SecLists), -f | --force-extensions switch is required to append extensions to every word in wordlist, as well as the /. To use multiple wordlists, you can separate your wordlists with commas...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Wooey

    Wooey

    A Django app that creates automatic web UIs for Python scripts

    Wooey is a simple web interface to run command line Python scripts. Think of it as an easy way to get your scripts up on the web for routine data analysis, file processing, or anything else. The project was inspired by how simply and powerfully sandman could expose users to a database and by how Gooey turns ArgumentParser-based command-line scripts into WxWidgets GUIs. Originally two separate projects (Django-based djangui by Chris Mitchell and Flask-based Wooey by Martin Fitzpatrick) it has...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Test your software product anywhere in the world Icon
    Test your software product anywhere in the world

    Get feedback from real people across 190+ countries with the devices, environments, and payment instruments you need for your perfect test.

    Global App Testing is a managed pool of freelancers used by Google, Meta, Microsoft, and other world-beating software companies.
    Try us today.
  • 5

    Render 4 Monitoring

    Render Web Services for Viewing/Monitoring and Test Web Applications

    Renders and aggregates other web services into static HTML monitoring output. Also implements simple and dynamic testing of web based applications using mechanize. Includes Basic authentication, and ADFS authentication for web service testing. Custom authentications can be implemented very easily with python programming. Implements PhantomJS rendering for Javascript/HTML5 dynamic web pages. Requires python3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    ipfs-css

    ipfs-css

    Single-purpose css class names and font-face config to IPFS up your UI

    The single-purpose CSS class names and @font-face config to IPFS up your UI. Once you've installed ipfs-css from npm, the CSS and SCSS files and the web-fonts are available from your node_modules/ipfs-css directory. You can import the theme.json file which can be used with a ThemeProvider component. All the CSS atoms are generated from that, so you can be sure you're using the same values. While ipfs.css contains everything you need, if you prefer variables for fonts, colors, and gradients...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Kanbanara

    Web-based Project Management System using the Kanban methodology

    Kanbanara, a web-based Project Management System, uses the Kanban methodology. Started in 2013, the product is already quite well advanced. It is written in Python 3.7+ and utilizes MongoDB and CherryPy. Its Kanban board features projects, user-definable workflow with custom states, support for epic, feature, story, enhancement, defect, task, test, bug and transient cards, global and personal WIP limits, role-based columns (Owner, Reviewer or Quality Assurance), support for ghost cards (cards...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SMSBoom

    SMSBoom

    SMS bombing / SMS pressure measurement, a robust and free python

    SMS bombing / SMS pressure measurement / | A robust and free python SMS bombing program, specializing in blowing up bad guys, with millions of interfaces, multi-threaded automatic addition of effective interfaces, support for millions of concurrent asynchronous coroutines, a completely free SMS bombing too! Hongkonger develops the whole network for the first time! api.jsonDefine the interface in a custom way. Support keyword replacement. Timestamp [timestamp] phone number [phone] Multithreaded...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 10
    Diems - CodeShine

    Diems - CodeShine

    Web application for posting, submitting, and evaluating assignments

    CodeShine (a part of Di website) - is an application for posting, submitting, and evaluating assignments. As a whole, 'Di' is a shorthand for Deogiri Institute of Engineering and Management Studies, Aurangabad 431001, Maharashtra, India.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    V2RayCloudSpider

    V2RayCloudSpider

    V2RayCloudSpider

    V2RSS is an "ecological mining machine" that can perform vertical mining on global providers based on the SSPanel-Uim framework; it can generate bottom-up "aggregation collection" tasks for mainstream protocol headers; it can self-digest and Compared with proxypool , the output is purer and more reliable proxy nodes; it has powerful production features such as self-discovery and service self-healing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Webifier

    Webifier

    A GitHub Action to deploy Notebooks, Markdowns

    Webifier is a stand-alone build tool for converting any repository into a deployable jekyll website. You can define your pages via yaml files and provide notebooks, markdown and pdf and other files for Webifier to render. It uses python markdown providing additional control over attributes and other extensive functionalities. It lets you define and direct how your web pages feel and automatically manages your assets, making it a perfect solution for fast static website development...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Spyne

    Spyne

    A transport agnostic sync/async RPC library

    Spyne is a Python RPC toolkit that makes it easy to expose online services that have a well-defined API using multiple protocols and transports. It integrates with popular Python web frameworks as well as libraries like SQLAlchemy to keep your code as DRY as possible. Spyne aims to save the protocol implementers the hassle of implementing their own remote procedure call api and the application programmers the hassle of jumping through hoops just to expose their services using multiple protocols...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Raiden Network

    Raiden Network

    Raiden Network

    Use Raiden to enrich your dApp with nearly instant, low-fee and scalable payments. It comes with an easy-to-use API and is compatible with the Ethereum ERC20 token standard. Incentivized, decentralized P2P live streaming with micropayments using Raiden, introducing “Proof-of-Stream-Payment”. The Raiden Network is an infrastructure layer on top of the Ethereum Blockchain. While the basic idea is simple, the underlying protocol is quite complex and the implementation non-trivial. Nonetheless...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Docksal

    Docksal

    Docksal is a tool for defining and managing web development

    Running Docker on macOS and Windows is supported via VirtualBox as well as Docker for Mac / Docker for Windows. Docksal project stacks are pre-loaded with common development tools, which include Composer, PHP Code Sniffer, php-cli, node, npm, ruby, bundler, and python. For PHP, Drupal, and WordPress development, it comes with Drush, Drupal Console, and WP-CLI. There is built-in support for Apache Solr, Varnish, Memcache, Selenium, and Behat. Since services are containerized with Docker...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Code Cookbook

    Code Cookbook

    Reusable code patterns which you can use as reference or copy

    Reusable code patterns which you can use as reference or copy to your project. Achieve small or large tasks using recipes that contain steps, scripts, and config files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Webware for Python

    Webware for Python

    The Classic Webware for Python

    Webware for Python is a suite of components for dynamic, server-side web development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ReconSpider

    ReconSpider

    Most Advanced Open Source Intelligence (OSINT) Framework

    ... the capabilities of Wave, Photon and Recon Dog to do a comprehensive enumeration of attack surfaces. Reconnaissance is a mission to obtain information by various detection methods, about the activities and resources of an enemy or potential enemy, or geographic characteristics of a particular area. A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically for the purpose of Web indexing (web spidering).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    FlaskBB

    FlaskBB

    A classic Forum Software in Python using Flask

    ... can customize every template in FlaskBB. Besides that, you can also install community-made themes. Our goal is to develop a forum system with modern technologies and sticking to already well-tested and approved layouts. FlaskBB is a simple and extensible forum software that should be seen as the foundation for building communities. It is written in Python using the web framework Flask. FlaskBB is being distributed under the BSD 3-Clause License.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Reminiscence

    Reminiscence

    Self-Hosted Bookmark And Archive Manager

    Bookmark links and edit its metadata (like title, tags, summary) via web interface. Archive links to content in HTML, PDF or full-page PNG format. Automatic archival of links to non-html content like pdf, jpg, txt etc. i.e. Bookmarking links to pdf, jpg etc.. via the web interface will automatically save those files on the server. Supports archival of media elements of a web page using third-party download managers. Directory-based categorization of bookmarks. Automatic tagging of HTML links...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DracOS GNU/Linux Remastered
    What is DracOS GNU/Linux Remastered ? DracOS GNU/Linux Remastered ( https://github.com/dracos-linux ) is the Linux operating system from Indonesia , open source is built based on Debian live project under the protection of the GNU General Public License v3.0. This operating system is one variant of Linux distributions, which is used to perform security testing (penetration testing). Dracos linux in Arm by hundreds hydraulic pentest, forensics and reverse engineering. Use a GUI-based...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 24
    restless

    restless

    A lightweight REST miniframework for Python

    A lightweight REST mini framework for Python. Works great with Django, Flask, Pyramid & Tornado, but should be useful for many other Python web frameworks. Based on the lessons learned from Tastypie & other REST libraries. Restless tries to be RESTful by default, but flexible enough. The main Resource class has data methods (that you implement) for all the main RESTful actions. It also uses HTTP status codes as correctly as possible. Restless is BYOD (bring your own data) and hence, works...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ML workspace

    ML workspace

    All-in-one web-based IDE specialized for machine learning

    All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly...
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.