Showing 245 open source projects for "crawler"

View related business solutions
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 1
    MyNewsGroups :) is a Web based USENET news crawler, news reader and news poster. With the use of a DB backend, the crawler fetch the newsgroups messages ONCE only. Web based environment, SPAM Filters, Search Engine, Subscriptions and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    a crawler to index and search the XML web
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    PySMBSearch is a crawler and search engine for SMB shares. It consists of a crawler script, which creates an index and stores it in an SQL database, and a CGI script that can be used to extract queries from the database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    FTPList/FTPCrawler is a Multi-threaded and MySQL based FTP Crawler, it has a PHP interface which you can use to search in the database, see FTP status (up, full, down), and more. It's designed for enviorements like big LAN-Partys i.e Remedy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Grub is a distributed internet crawler/indexer designed to run on multi-platform systems, interfacing with a central server/database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    WebSPHINX is a web crawler (robot, spider) Java class library, originally developed by Robert Miller of Carnegie Mellon University. Multithreaded, tollerant HTML parsing, URL filtering and page classification, pattern matching, mirroring, and more.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Content Engineering Tools including an XSLT based site rendering system, XSLT Documentation Generator, and Swing based Site Crawler. The tools may be downloaded and used seperately since there are no dependancies between them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    FemFind is a crawler/search engine for SMB shares (which can be found on Windows or Unix systems running Samba). FemFind does also crawl FTP servers and provides a web interface and a Windows client as frontends for searching.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Spindexer is a Search Engine/Crawler tool similar to UDMsearch or ht://dig - but unlike these tools, Spindexer is *very* fast and flexible. A simple Perl script works as a front-end to Pavuk and Swish++, allowing a fast crawl across any site(s).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 10
    Harvest is a web indexing package, originally disigned for distributed indexing, it can form a powerful system for indexing both large and small web sites. Also now includes Harvest-NG a highly efficient, modular, perl-based web crawler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This project aims to be a base for specialized image crawlers. It can download images from a specific website and can be extended to crawler any website. All the the processes are multithread. Accept filters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    arachne is a C++ library for HTTP crawling, link, text and metadata extraction designed to run in a distributed environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SubC : Smart usenet binaries Crawler, the powerfull and efficient newsgroup binaries auto-referencing program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    mornex

    a first-person dungeon crawler

    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Gokstad will be a basic crawler and text analysis engine. Its current scope is to download news webpages and do simple text analysis on top of it. The name "Gokstad" comes from a sea worthy, clinker-built ship, constructed largely of oak by the vikings
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Spider is web crawler written in the Java.Based on an Regular expression string the spider parses the internet for web pages matching this string and stores it in an MYSQL database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    studiMaps is a web based application for visualization and analysis of social networks. It consists of two software components: a web-crawler for getting data and the web based application for visualization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Webhunter is a distributed, multi-threaded web crawler designed for both general indexing and crawling the web for focused content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    FTP Scanner, FTP Crawler and web-based interface to search files and browse FTP-servers through database index (offline-browsing). Originally designed for ethernet network segments with anonymously accessing FTP servers to easily find needle files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    ICECrawler is a WWW crawler and map-generator intended to help understanding and analyzing links between websites and webdocuments.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB