Showing 80 open source projects for "extensible web spider"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Sperowider Website Archiving Suite is a set of Java applications, the primary purpose of which is to spider dynamic websites, and to create static distributable archives with a full text search index usable by an associated Java applet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SmartCrawler is a java-based fully configurable, multi-threaded and extensible crawler, which is able to fetch and analyze the contents of a web site by using dinamically pluggable filters
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    RSS spider for getting multiple RSS feeds into single place with search capabilities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Robust featureful multi-threaded CLI web spider using apache commons httpclient v3.0 written in java. ASpider downloads any files matching your given mime-types from a website. Tries to reg.exp. match emails by default, logging all results using log4j.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    GridAuth is a user credential management system for distributed data and computational grids. GridAuth is configurable and extensible to just about any system requiring credential management, advanced authorization and secure authentication.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    WebLoupe is a java-based tool for analysis, interactive visualization (sitemap), and exploration of the information architecture and specific properties of local or publicly accessible websites. Based on web spider (or web crawler) technology.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    An Java-based extensible Content Management System, focused mainly on an easily configurable backend, as well as some front-end portal components. Includes authentication, document, language and workflow modules. Based on Hibernate, Tapestry and Spring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Support for the Oasis XRI (Extensible Resource Identifiers) effort. This includes resolvers and client libraries for XRIs in multiple languages and multiple platforms. See http://www.oasis-open.org/committees/xri
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The goal of the project is to create a 100% pure Java-based browser support the latest standards from the W3C. This project is made up of two parts, the actual browser application written in Java Swing and a Swing component that renders HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    University Mobile Portal (UMP): extensible application for setting up a mobile-accessible database with information about news and announcements within a university. News Fetcher, SMS/Email Reminder,Pinboard. Uses OS Software: Cocoon, Xindice, Kannel
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The OJB Console is an extensible struts web application featuring transparent browse, search, create, update and delete functionality for the objects configured within the Object Relational Bridge (OJB) persistence framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Anokha is a MVC web framework based on Java/J2EE technology that simplify the enterprise application development by providing a set of presentation and business tier components being J2EE-API independent and high extensible. Easy to use and learn.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Spider is a complete standalone Java application designed to easily integrate varied datasources. * XML driven framework * Scheduled pulling * Highly extensible * Provides hooks for custom post-processing and configuration
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JImageTaglib is a Java Tag Library for displaying server-side generated and/or manipulated images, in a JSP. Apply effects, resize, show subparts of an image, change image colors, display thumbnails, generate barcode, apply text, etc. Highly extensible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Finally! An extensible, cross-platform dynamic, static, or custom DNS update client! This simple Java program will keep your domain name always pointed at your computer. Supports multiple ways for obtaining IP, updating domain, and reporting errors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A Java implementation of a flexible and extensible web spider engine. Optional modules allow functionality to be added (searching dead links, testing the performance and scalability of a site, creating a sitemap, etc ..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A modular, flexible and extensible Identity and Access Management system for integrated login, access and profile management across disparate security domains. Supports Apache, PAM, Webcrossing, XMLRPC and SOAP from C, Perl, and Java with more to come.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A highly customizable and extensible J2EE portal application based on portlet-like components. Our target is to provide a simple JSR168 implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Dynamic XML generation implemented in Java. Features an extensible XML-compliant procedural scripting language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Arachnid is a Java-based web spider framework. It includes a simple HTML parser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    WebSPHINX is a web crawler (robot, spider) Java class library, originally developed by Robert Miller of Carnegie Mellon University. Multithreaded, tollerant HTML parsing, URL filtering and page classification, pattern matching, mirroring, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The DynAPI <!-- I D E --> is aimed at being an extensible tool for easy and correct development of dynamic webpages with client-side scripting with focus on the HTML Document Object Model as controlled by ECMA/Javascript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    RealClient tries to provide an extensible way of programming applets using an XML file as the basis of Layout and transferring xml back and forth to a back-end.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JCast-X is an extensible streaming server written in java. It can stream any kind of media mp3, mpeg, anything you would like to see. The basis of JCast-X is a framework build on top of the "source-bus-listener" pattern. Its easy to extend and use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The DynaTalk DynAPI distribution is a DHTML library that includes an extensible client-side network/messaging subsystem, allowing developers to create truly live, stateful, and responsive web applications using open technology.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo