Showing 1049 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 1

    Amuziq Web Browser

    muraleekrishnagc@gmail.com

    This is a small,light weight and fast web browser made from python.Use in Linux distros only
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    A Python interface to the gnuplot plotting program.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3

    Web Crawler Security Tool

    A web crawler oriented to information security.

    Last update on tue mar 26 16:25 UTC 2012 The Web Crawler Security is a python based tool to automatically crawl a web site. It is a web crawler oriented to help in penetration testing tasks. The main task of this tool is to search and list all the links (pages and files) in a web site. The crawler has been completely rewritten in v1.0 bringing a lot of improvements: improved the data visualization, interactive option to download files, increased speed in crawling, exports list of found...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    APMP
    Another distribution of apache that contains Apache, MySQL, Perl, and PHP. and comes with the following modules mod_perl, mod_php, mod_python, and more. Also Contains a FTP and Mail Server. Will be ready to run right after installation and a reboot.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 5

    LinkChecker

    check links in web documents or full websites

    New Homepage: http://wummel.github.io/linkchecker/ Linkchecker features: - recursive and multithreaded checking and site crawling - output in colored or normal text, HTML, SQL, CSV, XML or a sitemap graph in different formats - HTTP/1.1, HTTPS, FTP, mailto:, news:, nntp:, Telnet and local file links support - restrict link checking with regular expression filters for URLs - proxy support -...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6
    Yet another web crawler? Yes, but this ones uses the full power of regular expressions to accept or reject, examine or ignore, save or refuse pages. You also use MIME types to do all this. Powerful and flexible.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Screenshot Paste plugin for Trac

    Screenshot Paste plugin for Trac

    A Trac plugin to allow pasting screenshots or images with one click

    A Trac plugin to allow pasting screenshots or other images captured or copied in the clipboard directly as attachements to tickets, Wiki pages, etc., without the need to first saving as images and then uploading them. Once the plugin is installed in Trac, you can easily attach a screenshot or any image you have in the clipboard to a Ticket or Wiki page, with one click.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This 5 generation selenium web crawler crawl through web page of a host website searching for static and dynamic links and able to detect honeypot links.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    Python Crawler Library

    Python Web Crawler Library

    A simple library for crawling the web. This library will give you the ability to create macros for crawling web site and preforming simple actions like preforming "log in" and other simple actions in web sites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 10

    Zyrak WB

    Simple anonymous browser

    Zyrak is a simple web browser created in Visual Basic. You browse in peace without worrying.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    Spondulas

    Spondulas is browser emulator designed to retrieve web pages for hunti

    Spondulas is browser emulator and parser designed to retrieve web pages for hunting malware. It supports generation of browser user agents, GET/POST requests, and SOCKS5 proxy. It can be used to parse HTML files sent via e-mail. Monitor mode allows a website to be monitored at intervals to discover changes in DNS or content over time. Autolog mode creates an investigation file that documents redirection chains. The retrieved web pages are parsed for links and reported to an output file. More...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    WordList Generator

    Generate wordlists using different methods

    WordList Generator is used to generate word lists. Methods: -Web Crawler -Search Engine Crawler -Random -Brute Force
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Where In the World Have You Been?
    A PHP script with maps of the World, China, Canada, USA, India, Africa and Europe that allows the user to select the countries, provinces or states by clicking on them or selecting a checkboxes. Selection causes the entity to turn a default color which contrasts with defaults colors of all bordering countries. Thus a patchwork is made to show the history of countries, states or provinces traveled. Added features allow users to download their maps, to blow them up to posters of any...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    SaWALi Web Application Library

    The SaWALi is a website management tool written in Python.

    The SaWALi Web Application Library is a Python application that aims to provide a reasonably complete set of components for operating a multi-purpose website. Taking advantage of the Pylons Framework, SaWALi is fully-customisable and inherently-extensible. All of SaWALi's administrative and public interfaces can be modified to suit a website's userbase— from its document editors and server error pages down to its public-facing pages and site maps. Being a Python module, SaWALi can also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Web Community Wiki aims to create a base for informations, using mediawiki and the semantic mediawiki extension. The project works on templates for the wiki and on skins and extensions to enhance the useability of the Web Community Wiki.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Seven-Labs

    Seven-Labs

    Application Development

    This repository serves as our entire project space which contains all of the open-source projects we've worked on. - C/C++ - C#/.NET - PHP - HTML5/CSS3
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Pyjamas is a python to Javascript compiler, Widget set, Framework and Toolkit for Application development that runs on Web browsers. The developer need not know anything about AJAX: all the AJAX tricks, for all major browsers, are entirely taken care of.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    PRO-Search is a crawler of FTP servers, SMB shares, HTTP, dc++ networks, ... with powerful web search and navigation interface
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20

    VIT Marks Display

    A small program that accesses VIT marks of a specific student

    A small attempt while learning interfacing with the web while learning python to get the marks of a specific valid VIT student using basic web scraping techniques
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    fileshack

    fileshack allows you to create your own hosted web file storage, easy

    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    PBP is a web browser made for testing web applications. Its user interface is a command interpreter with a simple, focused shell-like language which helps both developers and non-developers create robust functional tests with little effort.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Biz is a WSGI-compatible web application framework written in Python. It aims to be a platform for easily developing secure and internationalized web applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Notice Publisher plugin for Trac

    Notice Publisher plugin for Trac

    A Trac plugin to display Notices to any User visiting any page in Trac

    A Trac plugin to display Notices to any User visiting any page in Trac. Take a look at the Web site on Trac-Hacks: http://trac-hacks.org/wiki/NoticePublisherPlugin This is useful to bring everyone attention on news that affect all users, like the system going down, a solution to a common problem, and so on. Notices can contain Wiki-formatted syntax, thus allowing for rich content. Notices can have an expiration, expressed in hours, after which they automatically disappear...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    "Swish-e is a fast, flexible, and free open source system for indexing collections of Web pages or other files" (http://swish-e.org/ ) This module provides a Python API for this software.
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.