Showing 76 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 1
    Easy, modular and flexible WAMP bundling Apache2/SSL, MySQL4, PHP4, Perl5.8/ASP, Python2.3, Tomcat5, FirebirdDB, FileZilla, Mail/News-Server, phpMyAdmin, Awstats, WordPress, etc. It also includes a web-GUI to control/manipulate all bundled services.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Support for the Oasis XRI (Extensible Resource Identifiers) effort. This includes resolvers and client libraries for XRIs in multiple languages and multiple platforms. See http://www.oasis-open.org/committees/xri
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Code for reference implementations of identity brokers and simple single sign-on (SSO) mechanisms that utilize XDI and link contracts to manage the dataweb.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    A collection of software to implement search engine technology. The overall search technology is built on the individual components of this project, each component is released under the BSD License, and is written in the language most suited to its task.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
    Try for free
  • 5
    Publish and subscribe messaging for the Web, and related tools. Our license is BSD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    LAMP eGovernment Database Project offers state and local governments a free open source, web-enabled system for use in developing public information sites. You can also use this system for government-to-government systems as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LAN management system - MySQL based system managing machines with DHCP, DNS through a PHP Web interface. Also contains a samba search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    An alternative Identification system that is a replacement for Microsoft's Passport and the Liberty Alliance. Its a simple architecture that is setup so anyone can run a server and thereby have control over their online identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Web publish without any access to database, everything is done with files, using XML and other marks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    Empire Land Game of world rule! A realtime, multiplayer, medieval, web-game with a moduler architecture. Capable of running on multiple servers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. Xapian allows you to easily add advanced indexing and search facilities to your applications. See www.xapian.org for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Valuhack is a collaborative effort of a variety of newsgroups to port a simple program to as many languages as possible, while maintaining the true spirit of hacking.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Spatial Disruption is a Web based Strategy Warfare game. This project houses both Version 1(Perl, C++, and Flat text Files), and Version 2(PHP, Python, C++, and Various SQL databases). Our game already features a 2-D map, built-in alliances, and more!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XML XSLT Web Traverse - parses web directories transforming XML with XSLT.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PM2HTML takes PageMaker files and makes a cohesive newspaper website. It comprises a PMScript that exports all stories to a directory of tagged txts, and a python program to act as a converter to turn those tagged text files into HTML, a parser to guess
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    What if we could create a three dimensional world, in the beginning empty, without neither concept or content, free for all to populate and build. In a manner not far from the world wide web, but without the limitations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LiveFrame Gallery is an easily customized, web-based album and slideshow for sets of photographs. LiveFrame Lab is a jpeg processor that scales, rotates and captions images, and outputs LiveFrame Gallery directories and configuration files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Named after a well know product from Microsoft, SandStorm is a framework for creating modular middle-end web products.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A Web-Based system for activists to share information and organize for events. Including a content mangement system for the public as well as services for the users. Users will be able to communicate via forums/listservs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    IT4School is a package to bring the school workflows and collaboratin processes on-line, and servers as an intergration platform to presents all the services schools offer as a set of consistant and easy to use services over the internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Frosttie (FROnt-end SchemaTron Text Internet Engine) takes XHTML pages and processes them with various user-definable filters such a W3C's WAI, Section 508 (US) web usability compliance, ad removal, etc. It can be used with zKnowMan.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Omseek has been renamed to Xapian. Xapian is a Search Engine Library, written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C# and Ruby. It allows you to easily add advanced indexing and search facilities to your applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FemFind is a crawler/search engine for SMB shares (which can be found on Windows or Unix systems running Samba). FemFind does also crawl FTP servers and provides a web interface and a Windows client as frontends for searching.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A simple perl (or python) CGI that exports the MainLine of a CVS repository as a website. In use since 1998, recently added Subversion support to help migrate away from CVS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Harvest is a web indexing package, originally disigned for distributed indexing, it can form a powerful system for indexing both large and small web sites. Also now includes Harvest-NG a highly efficient, modular, perl-based web crawler.
    Downloads: 0 This Week
    Last Update:
    See Project