Showing 192 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    PostHTML

    PostHTML

    PostHTML is a tool to transform HTML/XML with JS plugins

    PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree API and a node tree stringified. All HTML transformations are made by plugins. And these plugins are just small plain JS functions, which receive an HTML node tree, transform it, and return a modified tree. PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Nokogiri

    Nokogiri

    Tool to work with XML and HTML from Ruby

    Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2 (C) and xerces (Java). Be secure-by-default by treating all documents as untrusted by default. Be a thin-as-reasonable layer on top of the underlying parsers, and don't attempt to fix behavioral differences between the parsers. "Native...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    lxml

    lxml

    The lxml XML toolkit for Python

    A Python library for efficient XML and HTML processing, known for speed and compatibility. The lxml XML toolkit is a Pythonic binding for the C libraries libxml2 and libxslt. It is unique in that it combines the speed and XML feature completeness of these libraries with the simplicity of a native Python API, mostly compatible but superior to the well-known ElementTree API. The latest release works with all CPython versions from 3.6 to 3.12. See the introduction for more information about the...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 4
    quick-xml

    quick-xml

    Rust high performance xml reader and writer

    High-performance xml pull reader/writer. The reader is almost zero-copy (use of Cow whenever possible) It is easy on memory allocation (the API provides a way to reuse buffers) It supports various encoding (with encoding feature), namespaces resolution, and special characters. Syntax is inspired by xml-rs. When using the serialize feature, quick-xml can be used with serde's Serialize/Deserialize traits. The mapping between XML and Rust types, and in particular the syntax that allows you to...
    Downloads: 6 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    adx - addressbook.xml

    adx - addressbook.xml

    Minimalistic address book in web browser. No server or plugin needed.

    Minimalistic but full-featured addressbook in your web browser. adx is a standalone and portable web app (online and offline). FEATURES Contact Management, portable, small (~350KB), lightweight, contact tagging, geo mapping, web accounts, trigger phone/Skype calls, etc. EXPORT FUNCTIONALITY vCard (as file or QR code via offline generator) HOW IT WORKS Your address-book (XML file) is transformed in your web browser (via XSLT) to a full-featured web application...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Plot

    Plot

    A DSL for writing type-safe HTML, XML and RSS in Swift

    Welcome to Plot, a domain-specific language (DSL) for writing type-safe HTML, XML and RSS in Swift. It can be used to build websites, documents and feeds, as a templating tool, or as a renderer for higher-level components and tools. It’s primary focus is on static site generation and Swift-based web development. Plot enables you to write HTML using native, fully compiled Swift code, by modeling the HTML5 standard’s various elements as Swift APIs. The result is a very lightweight DSL that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DiDOM

    DiDOM

    Simple and fast HTML and XML parser

    Simple and fast HTML and XML parser. DiDom allows loading HTML in several ways.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    EPUB Test Suite

    Tests of EPUB e-book format

    A suite of tests for e-book readers that support the EPUB e-book format. Basic HTML with CSS, as well as EPUB inclusion of raw image files, table of contents navigation, and font embedding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    xmlfy

    xmlfy

    Convert to XML on the fly

    xmlfy converts text/UTF based output into XML formatted output using schema files and/or options to control its behaviour. By Arthur Gouros.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    XSH is a powerfull command-line XML editing tool/programming language in the manner of Unix shell interpreters and line-oriented text editors like ed which can be used either interactively or for batch-mode XML processing.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    **CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    IdeoType is a book compiler that converts manuscript (XHTML) to book (PDF) on the fly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Easy Feed Editor
    ACTIVE DEVELOPMENT ENDED IN 2011. Use Easy Feed Editor to create your own feed. Easy Feed Editor supports both RSS and ATOM.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Cross-platform visual XSLT generator
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    JHOVE

    File validation and characterization

    JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    xslet

    xslet project provides some XSL libraries and applets.

    xslet is the project which develops some applets with XSL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    XmlDoclet

    A JavaDoc doclet that outputs source code structure in XML format.

    XmlDoclet is a JavaDoc doclet that outputs the source code structure of the packages, classes etc. in XML format. Later, the XML data may easily be processed by standard tools such as XSLT to produce HTML, PDF, dot graphs etc. Technically, this is done by wrapping the class and interfaces of the com.sun.javadoc packages into JAXB annotated classes, which allows for an easy serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Ava: Testdata Xsl

    Ava: Testdata Xsl

    generates Testdata on base of excel: creates xml,excel,csv,html,sql,+

    this tool for test-data-generation receives an 'excel-sheet' as primary input. second important paramter is the 'number of test-records to produce'. The excel-data will be reused as long data is needed. This tool is hightly paramatrisazable by the use of 'xsl scripts'. data can be created, updated, modified and finally exported in a format of your choice Main Fuctions: (1) Generates Testdata (excel, xsl, xml) (2) Exports generated testdata in multiple formats (csv, excel, html,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Bee Framework is a PHP 5 DI/IoC application framework based on the popular Spring Framework, bringing - among other features - XML-configured declarative Dependency Injection and a powerful MVC framework to PHP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Observation Manager
    Java based astronomical logging software which stores it's data in a free and open XML based format (OpenAstronomyLog). Discontinued project: Please check out the fork: https://github.com/capape/observation-manager for an updated version
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Jaxe
    Jaxe is a free Java XML editor with a configurable GUI, using XML schemas for validation and XSL for exports in HTML or XML.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    trafalgar.map

    trafalgar.map

    Open Street Map (OSM) tools, OSM/XML parser, tag extractor

    This is going to be a set of tools which is intended to be used with huge OSM files like the planet files in XML format. The parser reads directly from packed *.gz files and it is not needed to unpack the OSM/XML data files to the local disk. Now in 0.3.0: osm_tags: tag analyzer (like tag watch) osm_split: split osm file in single files for nodes, ways and relations and collect some meta information (will be used as input for other tools). osm_cut: create rectangular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB