Showing 65 open source projects for "html source extractor"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Cross-platform visual XSLT generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5

    XmlDoclet

    A JavaDoc doclet that outputs source code structure in XML format.

    XmlDoclet is a JavaDoc doclet that outputs the source code structure of the packages, classes etc. in XML format. Later, the XML data may easily be processed by standard tools such as XSLT to produce HTML, PDF, dot graphs etc. Technically, this is done by wrapping the class and interfaces of the com.sun.javadoc packages into JAXB annotated classes, which allows for an easy serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Observation Manager
    Java based astronomical logging software which stores it's data in a free and open XML based format (OpenAstronomyLog). Discontinued project: Please check out the fork: https://github.com/capape/observation-manager for an updated version
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Jaxe
    Jaxe is a free Java XML editor with a configurable GUI, using XML schemas for validation and XSL for exports in HTML or XML.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Please see https://github.com/davemckain/snuggletex
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    XML Interactive SlideShow
    Xiss web is a Java tool to generate photo slide show on the web. Generation of photo description in XML and HTML pages based on layout and skin. Exif display. Google Maps display. Animated photo transition. Uses W3C standards: XML, XSLT, XPATH, CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    RSS Reader is a mobile J2ME (Java ME) app. that runs on Java enabled phones. User can store multiple RSS/ATOM feeds as bookmarks. Can import from OPLM file or HTML file with links. Can read offline. MIDP 2.0/some MIDP 1.0. Specify which MIDP w/issue
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Calenco XML CMS
    Calenco is a Web collaborative platform that enable remote teams of writers, proofreader, graphic designers, translators, etc. to produce together XML documents like user guides, security procedures, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    xccdf2pdf renders XCCDF documents in PDF and other formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    This application uses XML files as input to apply a XSLT transformation and export user-readable documents, such as HTML or LaTeX. Il was originally intended to create Yearbooks based on the already existing files with details about an organisation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Lightweight XML processor; XML-POJO mapping via Java5 annotations or DTD; Preprocessing of XML documents using expression language; Binary XML; RMI friendly XML; JSON format support; XML marshall/unmarshall; HTML as XML parser; Swing XML Viewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a web framework, based on XML and XSL technology. Support for the incremental development with plug-in, support plug and play.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ServingXML is an open source, Apache 2.0 licensed, framework for flat/XML data transformations. It defines an extensible markup vocabulary for expressing flat-XML, XML-flat, flat-flat, and XML-XML processing in pipelines.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    MetaXplode is simply an e-book organizer, an Automatic Meta Data Generator for Electronic Books.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MadStore is a Java-based web platform for automatically extracting, generating and publishing Atom web feeds from standard XHTML web pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Content adaptation platform that adapts internet content to any mobile device in real time. Includes features such as device detection, image transcoding, DOM manipulation, XML/RSS feed support and much more.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    A servlet (or a bunch of servlets) to access a database, perform ddlutils tasks (retrieve, create and update from or to a description). Uses the apache.org ddlutils (and some other libraries).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Atheneum is a bookstore inventory system written in Java 5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    XML utilities : - xml to excel paging tool
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB