Showing 1049 open source projects for "python web crawler"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    Hatta Wiki
    Hatta is a wiki engine that uses a Mercurial repository for storing the pages. You can run it as a web application on your server, or locally on your computer. You can also do both, and synchronize the repositories once in a while.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The aim of this project is to build the first computer for children (ages 4-14). From OS and apps to hardware.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PyApplet will be a web browser plugin (currently for Firefox on Windows, later for other platforms and browsers). PyApplet will make it possible to create applets (like java applets) in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WHIFF is an infrastructure for easily building complex Python/WSGI Web applications by combining smaller and simpler WSGI components organized within file system trees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Open Source Software Tool for installing and configuring Squid: Proxy Server and Web Cache Daemon. Developed By: Ashwin Tumma
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Teng is a general purpose templating engine written in C++ (i.e. library). It is also available as Python module or PHP extension. The main idea of teng is to strictly separate application logic from presentation layer. Widely used on dynamic web sites.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    The web lint checks HTML and XHTML pages for possible markup problems. It attempts to find problems with your code that an HTML validator does not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Proxy server that performs proof of concept session hijacking by receiving cookies from one or more NomNom Agents. It then offers the user an option to switch to any intercepted session. More features and site specific options later.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    htc-py converts XML data into HTML web-pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 10
    Tenjin is a very fast template engine for web applition. It runs about 3-10 times faster than other template engine. It is implemented in Python, Ruby, Perl, and JavaScript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Es un software diseñado para suplir la necesidad de algunas personas de tener un Web Crawler o Spider duro, navega de forma automática por los diferentes sitios o paginas Web, extrayendo los enlaces a otras paginas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    This is a Django based application for creating database backed or email forms. It features an innovative drag and drop interface for building forms, a workflow for reviewing forms, and infinite customization. As of 12/1/2013, primary development has moved to http://github.com/carsongee/formunculous
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Crawler
    Crawler is a bare-bones spider designed to quickly and effectively build an index of all files and pages on a given Web site as well as the link relationship (both incoming and outgoing) between each page. More open source at https://github.com/fcc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Project to manage xenserver/xcp (xen cloud platform) virtual machines with web (similar to xencenter) Sorry guys, but this project is dead. You are free to clone and use the same name. Contact to me if you want in alberto at pesadilla dot org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A web application for collecting ideas how to improve work flow and everyday tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Trac Semantic Extension
    Extension of Trac from Edgewall. This plugin adds possibility to use semantic features in wiki. It is targeted to support software development and presentation. Uses standalone semantic repository Sesame 2. Download plugin or try virtual machine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    zope3 based cms
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Pynta - Flexible Web Framework in Python
    Pynta is flexible web framework written in Python. All development going on https://github.com/lig/pynta
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Web spider and SERP scrapper
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A web crawler for crawling news from user specific websites and keywords using PHP and third party software
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    HTML/CGI-based streaming multi-room CHAT. Crazy administration features (why kick when you can possess?), memos, memo lists, smileys, dice roller, & more. Comes with BOT GAMES and an interface for new bots. In Python, for UNIX and Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MyCube Vault
    MyCube Vault is an open-source project with the objective of allowing users to regain control of their social media content and social connections. MyCube has developed a base product with support for Facebook, Google Contacts and Picassa.
    Downloads: 0 This Week
    Last Update:
    See Project