Showing 55 open source projects for "web crawler source code"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Composer for OWL-S-based Semantic Web Services that can load SWRL-based Semantic Bridges to mediate between heterogeneous ontology concepts. Exports semantic. enh. BPEL code. Dependency to BerliOS-project: "sembpelext". Also see QualiPSo.org for details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Modelio-Open is a project hosting a set of open source extensions (SoaML, SysML and UML Testing Profile) for a previous version (1.2) of the Modelio Free tool . Currently, the lastest version (2.x) of Modelio modeling and generation tool is available at http://modelio.org/downloads/download-modelio.html. All extensions are downloadable at http://forge.modelio.org/projects.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    A java-based JSON service for zip code location lookup and distance calculation. Implemented as a web application and related support classes. Includes a zip code dataset that should be loaded into a database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    MOLGENIS can be used to generate databases for life science experiments (micrroarray, mass spectrometry, genomics) having a web user interface, csv exchange format, and programmatic interfaces (web services, rest, and r-project). See NatRevGen 8.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OSGB Web Map Tools is a set of software code and installation instructions that will enable web developers to produce dynamic mapping web sites using Ordnance Survey map data on the internet.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Web-as-corpus tools in Java. * Simple Crawler (and also integration with Nutch and Heritrix) * HTML cleaner to remove boiler plate code * Language recognition * Corpus builder
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DEPRECATED PROJECT! SolexaTools is becoming the SeqWare project (http://seqware.sourceforge.net) to better reflect its expanded sequencer support. Please go to this project page for code and documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Crawl a set of files, accumulating information on the temporal and spatial extent of the data in each file, for later search and retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Testing tools source code of the Integrating the Healthcare Enterprise (IHE) Cross-Enterprise Document Sharing for Images (XDS-I) Integration Profile
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Testing tools source code of the Integrating the Healthcare Enterprise (IHE) PIX and PDQ v3 Integration Profiles
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A platform which hosts battles between Fruit War robots - special java programs. The platform accepts source code as robot, and ranks all robots according to battle result. This project has been totally refined and is now hosted at http://robotypo.appspot.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Integrate the power of the fully AJAX enabled technology of ICEfaces into the framework of the ESRI WebADF. No need to code in Javascript or to handle XML communication to build AJAX functionalities. OnlineDemo: http://www2.pch.etat.lu/pchViewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    IFS Tools is a set of Java applets for the interactive, graphical construction of iterated function system (IFS) fractals. Output is in the form of IFS code and graphics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Peruser is a open-source lightweight knowledge application framework. Peruser speaks XML and RDF and is implemented in Java. Peruser code runs within Cocoon, as a standalone webapp, as console programs invoked through ant, or as embeddable java code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Crawl-By-Example runs a crawl, which classifies the processed pages by subjects and finds the best pages according to examples provided by the operator. Crawl-By-Example is a plugin to the Heritrix crawler, and was done as a part of GSoC06 program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Java and ColdFusion libraries to lookup country code and language from IP address. It uses a local copy of the WHOIS database to perform fast, accurate lookups of country codes. Useful for log analysis, internationalization, geolocation, etc..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Manuelito is a Java tool for decrypting posttranslational modification (PTM) patterns of proteins based on MALDI mass spectrometry spectra. It was built for analysing the 'histone code' but it can be applied to any other highly modified protein.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A configurable knowledge management framework. It works out of the box, but it's meant mainly as a framework to build complex information retrieval and analysis systems. The 3 major components: Crawler, Analyzer and Indexer can also be used separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    JChemPaint Applet and Swing Application
    The JChemPaint Applet and Swing Application are Java programs for drawing 2D chemical structures like those found in most chemistry textbooks. It is based on the Chemistry Development Kit (cdk.sf.net). Downloads, tracker, and source code repository can be found on https://jchempaint.github.io
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Pödznsatch is a open and distributed hypergoogle of love. It is a semantic web application for social networking, word-of-mouth analysis and profiling. The Pödznsatch architecture includes a bot crawler, an inference engine and a query interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenL implements Java-like languages allowing to use code snippets placed in Excel, XML, or DB. OpenL is a foundation for OpenRules http://openrules.com. Maintained from http://openl.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Sight provides a friendly interface to create and connect agents for bioinformatics. The workflow supports multiple responses to a single request, structure transforms, filters and request history access.Can talk with ordinary http servers (get and post)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The purpose of the E-Remise system will be to facilitate the proper evaluation of large numbers of electronic assignements. Another prupose of E-Remise is the archiving of source code and other work done by students. It is useful for pedagocial reasons.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    kin is a set of libraries and tools supporting meta-programming for engineering and technical computing applications. This JVM version of kin has been superseded by the stand-alone version being developed at http://purl.org/net/kin.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB