Showing 88 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 1
    ViewVC is a web-based version control repository browser, currently supporting CVS and Subversion repositories. Project development is now hosted at https://github.com/viewvc/viewvc/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Agile Author is a framework for developing networked repositories of digital information such as digital libraries and content management systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Blogfish - natural selection for your memes. Blogfish is a Gnome panel applet that enables you to find the newest and coolest stuff on the web, via some tiny fish. No bait required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Cheshire3 is a fast Z39.50, SRW, XML search engine, written in Python for extensability and using C libraries for speed. Next generation of the Cheshire system (http://cheshire.berkeley.edu) and designed around a distributable, object oriented model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Webcomic Archive and News Generator (WANG) is a database driven PHP application built for both aspiring and existing web comics. Written with a focus on security and speed, the code is built to be easy to use for code novices and experts alike.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Syncato is a Weblog Web Services system built on top of Berkeley DB XML, Webware and Python. It has a number of unique features; XPath access to all content via URLs, XSL-T presentation and extremely flexible database structure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    pyChelsea is a python based, personal, visited, web page indexer, seach engine and interface for the browser/platform of your choice. If you remember a page based on a phrase, pyChelsea is for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    System to assist with the management of a local Buddhist study library. Will download and install chosen documentation (sutras, etc), index them, and provide a web interface. Python, HTML. Needs a webserver, namazu. Can be used for other documents, too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 10
    CVS-Summary is a program that generates an HTML summary of CVS activity, very similar to that provided by the popular ViewCVS script. CVS-Summary differs from ViewCVS in that it dumps the summary information to static HTML files which can be served by n
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A Python wrapper for the Google web API. Allows you to do Google searches, retrieve pages from the Google cache, and ask Google for spelling suggestions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LAMP eGovernment Database Project offers state and local governments a free open source, web-enabled system for use in developing public information sites. You can also use this system for government-to-government systems as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Ferret CMS is a Content Management System based on Zope. It is focused on easy administration and fast deployment of a web site. It has a workflow mechanism with roles that can be assigned to backend users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A powerful, themeable image gallery generator for static HTML pages.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    Emine is a python script that parses an email file, separates all the email elements, including words and phrases, and populates a database with file offsets for retrieval from the original file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A News Aggregator - not a news reader - to collect news from subscribed RSS channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Open Source Application for databasing your Music Collection(s). iChoons will utilize other open source products such as MySQL, Apache Webserver and PHP as well as Python / wxPython and SQL Lite. We will also be including tools written in Python for Win3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    HORUS is a system for knowledge acquisition, hypothesis generation, inference and learning. It is an interactive, internet environment accessible to a diverse community of users (public-access or membership basis) - see also UMKAILASH project for more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This project is for credit in UIUC's CS327 Software Engineering class. This is a class scheduler written in ocaml and python, using a web interface. The feature list will be decided as the project progresses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PySMBSearch is a crawler and search engine for SMB shares. It consists of a crawler script, which creates an index and stores it in an SQL database, and a CGI script that can be used to extract queries from the database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Squishdot is a web-based news publishing and discussion product for Zope. It is written in Python with the ability to script in DTML and runs on Linux, *BSD, Unix and Windows 95/98/NT/200.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Zope is an open source application server specializing in content management, intranets, and custom web applications. Zope is written in Python and has a large, global community of developers and companies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian allows you to easily add advanced indexing and search facilities to your applications. See www.xapian.org for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The NLADA E-Library is an add-on product for the Zope web application server. It is designed to be a drop-in web application and content managment system for creating web based document libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This project is a Python-based HTTP web proxy server that hooks into MySQL to store a full history of your browsing. Allows you to check out statistics about your browsing habits. Creates a personal portal page, has search features, multi-user, filters.
    Downloads: 0 This Week
    Last Update:
    See Project