The "/OpenJUMP_snapshots" file could not be found or is not available. Please select another file.

Search Results for "heritrix-1.14.4-src.zip"

Showing 12 open source projects for "heritrix-1.14.4-src.zip"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Heritrix

    Heritrix

    Internet Archive's open-source, web-scale, web crawler project

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/heretix/heratix) is an archaic word for heiress (woman who inherits). Since our crawler seeks to collect and preserve the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    EssentialsX

    EssentialsX

    The modern Essentials suite for Spigot and Paper

    EssentialsX is a continuation of the Essentials plugin suite, updated to support modern Minecraft and Spigot versions. It provides countless new features, performance enhancements and fixes that are not available in the original Essentials or Spigot-Essentials. If you're coming from the original Essentials plugin, EssentialsX is a drop-in replacement for Essentials.
    Downloads: 55 This Week
    Last Update:
    See Project
  • 3

    Offnet

    Program that saves complete web pages retaining multiple timestamps

    ...Project goals: - Web page downloads for less experienced users, including easy setup - Project based page maintanance - Not too plain functions that include also multiple snapshots per project - Iterative, understandable and storage efficient data structure to enable more manual control over stored pages (meta files editable with Easy Folder Morpher) - Retain archived files and query links as original, altering links only during query Current status: - Alpha stadium, archivation quality below Heritrix...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ARCOMEM

    ARCOMEM

    Semantic and social web crawling

    ...Throughout the project a large number of components have been developed to collect content from Web and Social Web, to analyse it from semantic and social perspectives and to enable Web archive access by different facets. The whole system based on the Heritrix crawler is released as open source to the public. Since many components or composite tools are of interest also for other areas and usage scenarios, the ARCOMEM consortium defined a number of pre-packaged tools which can be used independently from each other. By combining all packages the full ARCOMEM system can be build. The following major packages will be released in the coming weeks as pre-compiled packages with source code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    A drop-in replacement for the src.zip shipped with Oracle Java 7, that contains sources to all Java classes that are shipped or generated by the OpenJDK project (the official src.zip only covers public classes), plus tools to generate it.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    chksm-chkfile

    chksm-chkfile

    Windows MD5/SHA-1/SHA-256 file hashes calculator.

    ... - 32 & 64 bits windows platforms supported - Installation (Add to file context menu) - Uninstallation Requirements: - Microsfot .NET Framework 2.0 Files: chksm-chkfile.exe MD5: c0b7761deeb7d7dc45aa04726309d4ca SHA1: b19e8013ce1ea6c297c682a9ad5940024f8ff68e SHA256: a539d86960ec28429d98fc619a213efd8435d4c340f78c4d07bb1494ace82ce5 chksm-chkfile-src.zip MD5: f60acabb5a5e3e0e930b74e918e20d71 SHA1: 05ec2018fe4de677b1f27b498be3b61afe5f15c1 SHA256: e5c9caba615066c539a7ff523ea1a89af6dcf1665fd47362dd95bc8adf88fe65
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    exShut

    Windows extended Shutdown scheduler

    ...Functionalities: - 32 & 64 bits windows platforms supported - Stay in the notification zone - Cancel the requested operation Requirements: - Microsfot .NET Framework 2.0 Files: exShut.exe MD5: 5c6d02fa051060970bbf831f21bd179b SHA1: 2e0879bfcb594e25bc99942068ca22128cfeb23f SHA256: 5a9a79108e07d3b6258373c487814fc4d0fc900e2c787e668103837560175ccf exShut-src.zip MD5: a00b501f56c3c486ee7fc2952336cf0d SHA1: 3cec4800daeee78aa3f4237fac23391effe2c55d SHA256: 787a4f01e8cf015e30e5860b97640658e8932be8c20d6f4816e1cf4e8c69c7c4
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    The DeDuplicator is an add-on module (plug-in) for the web crawler Heritrix. It offers a means to reduce the amount of duplicate data collected in a series of snapshot crawls.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Heritrix expand project
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB