Search Results for "html source extractor" - Page 2

Showing 6066 open source projects for "html source extractor"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    parse5

    parse5

    HTML parsing/serialization toolset for Node.js.

    HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant. parse5 provides nearly everything you may need when dealing with HTML. It's the fastest spec-compliant HTML parser for Node to date. It parses HTML the way the latest version of your browser does. It has proven itself reliable in such projects as jsdom, Angular, Lit, Cheerio, rehype and many more.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    NGX-Translate

    NGX-Translate

    The internationalization (i18n) library for Angular

    NGX-Translate is an internationalization library for Angular. It lets you define translations for your content in different languages and switch between them easily. It gives you access to a service, a directive and a pipe to handle any dynamic or static content. NGX-Translate is also extremely modular. It is written in a way that makes it really easy to replace any part with a custom implementation in case the existing one doesn't fit your needs. The main part of the library is named core....
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Sanitize

    Sanitize

    Ruby HTML and CSS sanitizer

    Sanitize is an allowlist-based HTML and CSS sanitizer. It removes all HTML and/or CSS from a string except the elements, attributes, and properties you choose to allow. Using a simple configuration syntax, you can tell Sanitize to allow certain HTML elements, certain attributes within those elements, and even certain URL protocols within attributes that contain URLs. You can also allow specific CSS properties, @ rules, and URL protocols in elements or attributes containing CSS. Any HTML or...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Ksoup

    Ksoup

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML

    Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities. ​
    Downloads: 6 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    pagedown

    pagedown

    Paginate the HTML Output of R Markdown with CSS for Print

    Paginate the HTML Output of R Markdown with CSS for Print. You only need a modern web browser (e.g., Google Chrome or Microsoft Edge) to generate PDF. No need to install LaTeX to get beautiful PDFs. This R package stands on the shoulders of two giants to support typesetting with CSS for R Markdown documents: Paged.js and ReLaXed (we only borrowed some CSS from the ReLaXed repo and didn't really use the Node package).
    Downloads: 18 This Week
    Last Update:
    See Project
  • 6
    Emogrifier

    Emogrifier

    Converts CSS styles into inline style attributes in your HTML code

    Emogrifier converts CSS styles into inline style attributes in your HTML code. This ensures proper display on email and mobile device readers that lack stylesheet support. This utility was developed as part of Intervals to deal with the problems posed by certain email clients (namely Outlook 2007 and GoogleMail) when it comes to the way they handle styling contained in HTML emails. As many web developers and designers already know, certain email clients are notorious for their lack of CSS...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Floki

    Floki

    Floki is a simple HTML parser that enables search for nodes using CSS

    Floki is a simple HTML parser that enables search for nodes using CSS selectors. Floki needs the :leex module in order to compile. Normally this module is installed with Erlang in a complete installation. By default, Floki uses a patched version of mochiweb_html for parsing fragments due to its ease of installation (it's written in Erlang and has no outside dependencies). fast_html is generally faster, according to the benchmarks conducted by its developers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    Juice

    Juice

    Juice inlines CSS stylesheets into your HTML source

    Given HTML, juice will inline your CSS properties into the style attribute. Juice has a number of functions based on whether you want to process a file, HTML string, or a cheerio document, and whether you want the juice to automatically get remote stylesheets, scripts, and image dataURIs to inline. To inline HTML without getting remote resources, using default options. Juice is exposed as a standard module, and from CLI with a smaller set of options.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    kotlinx.html

    kotlinx.html

    Kotlin DSL for HTML

    A kotlinx.html library provides DSL to build HTML to Writer/Appendable or DOM. Available to all Kotlin Multiplatform targets and browsers (or other JavaScript engines) for better Kotlin programming for the Web.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Lume

    Lume

    Create 3D web applications with HTML, bring a new depth to your DOM

    A toolkit that simplifies the creation of rich and interactive 2D or 3D experiences. LUME is composed of several packages that can be used individually, or together as a whole. HTML elements for easily defining rich and interactive 2D or 3D applications powered by CSS3D, WebGL, or both mixed together. This is a web component system that allows you to create new, fast, and performant HTML elements in a simple way. It provides the foundation for LUME's HTML elements and a standard pattern for...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    htmlparser2

    htmlparser2

    The fast & forgiving HTML and XML parser

    The fast & forgiving HTML and XML parser. htmlparser2 is the fastest HTML parser, and takes some shortcuts to get there. If you need strict HTML spec compliance, have a look at parse5. htmlparser2 itself provides a callback interface that allows the consumption of documents with minimal allocations. While the Parser interface closely resembles Node.js streams, it’s not a 100% match. Use the WritableStream interface to process a streaming input.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    TEMPL

    TEMPL

    A language for writing HTML user interfaces in Go

    Create components that render fragments of HTML and compose them to create screens, pages, documents, or apps.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 13
    Tesseract OCR

    Tesseract OCR

    Open Source OCR Engine

    ...Tesseract can recognize over 100 languages out-of-the-box, and can be trained to recognize other languages. It supports various output formats, including plain text, HTML, PDF and more. It also has unicode (UTF-8) support.
    Downloads: 3,404 This Week
    Last Update:
    See Project
  • 14
    DOMPurify

    DOMPurify

    XSS sanitizer for HTML, MathML and SVG

    A DOM-only, super-fast, uber-tolerant XSS sanitizer for HTML, SVG and MathML. It's also very simple to use and get started with. DOMPurify was started in February 2014 and, meanwhile, has reached version 2.3.0. DOMPurify is written in JavaScript and works in all modern browsers (Safari (10+), Opera (15+), Internet Explorer (10+), Edge, Firefox and Chrome - as well as almost anything else using Blink or WebKit). It doesn't break on MSIE6 or other legacy browsers. It either uses a fall-back or...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    Beautymail for Laravel

    Beautymail for Laravel

    Send beautiful HTML emails with Laravel

    Beautymail makes it super easy to send beautiful responsive HTML emails. Send beautiful HTML emails with Laravel.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Pico.css

    Pico.css

    Minimal CSS Framework for semantic HTML

    Minimal CSS Framework for semantic HTML. Elegant styles for all native HTML elements without .classes and dark mode automatically enabled. Pico uses simple native HTML tags as much as possible. Less than 10 .classes are used in Pico. No dependencies, package manager, external files, or JavaScript. Elegant and consistent adaptive spacings and typography on all devices. Shipped with two beautiful color themes, automatically enabled according to the user preference. We use simple native HTML...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    Angular DataTables

    Angular DataTables

    DataTables with Angular

    An Angular2+ library for building complex HTML tables using DataTables JQuery plug-in. Implementation of the example on custom filtering with range search. The HTML element provides a Promise that returns the instance of the DataTable. Implementation of the example on individual column searching (text inputs). Sometimes, your DataTable options are stored or computed server-side. All you need to do is to return the expected result as a promise. You can use Angular Pipe to transform data on...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    marko

    marko

    A declarative, HTML-based language that makes building web apps fun

    A declarative, HTML-based language that makes building web apps fun. If you know HTML, CSS, and Javascript, you know Marko. Streaming, partial hydration, an optimizing compiler, & a small runtime. Start with simple HTML templates and add powerful components as needed. Marko is powering high-traffic websites like ebay.com. Marko is HTML re-imagined as a language for building dynamic and reactive user interfaces. Just about any valid HTML is valid Marko, but Marko extends the HTML language to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    EzXML.jl

    EzXML.jl

    XML/HTML handling tools for primates

    EzXML.jl is a package to handle XML/HTML documents for primates. This package depends on libxml2, which will be automatically installed as an artifact via XML2_jll.jl if you use Julia 1.3 or later. Currently, Windows, Linux, macOS, and FreeBSD are now supported.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    PostHTML

    PostHTML

    PostHTML is a tool to transform HTML/XML with JS plugins

    PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree API and a node tree stringified. All HTML transformations are made by plugins. And these plugins are just small plain JS functions, which receive an HTML node tree, transform it, and return a modified tree. PostHTML is a tool for transforming HTML/XML with JS plugins. PostHTML itself is very small. It includes only an HTML parser, an HTML node tree...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PSWriteHTML

    PSWriteHTML

    PSWriteHTML is PowerShell Module to generate beautiful HTML reports

    PSWriteHTML is PowerShell Module to generate beautiful HTML reports, pages, emails without any knowledge of HTML, CSS or JavaScript. To get started basics PowerShell knowledge is required. PSWriteHTML is a fork of ReportHTML module. I've forked ReportHTML due to it being outdated and not maintained. I also wanted to address some problems it had and add more features. As it stands PSWriteHTML is a complete rewrite. Make sure to read blog posts provided to understand the concepts and how you...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    Lexbor is the development of a web browser engine available as a software library; it ships with a free license and has no extra dependencies. For us, speed is an absolute must-have. In our development process, we focus on fastest parsing techniques for HTML, CSS, and fonts, fastest data processing methods, and fastest ways to serve content to end users. Whether you are building a backend that handles millions of HTML documents or a UI-heavy user app, your software’s response rate always...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    HTMLHint

    HTMLHint

    The static code analysis tool you need for your HTML

    Static code analysis tool you need for your HTML. By default, htmlhint looks for a .htmlhintrc file in the current directory and all parent directories and applies its rules when parsing a file.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 25
    HTMLPurifier for Laravel

    HTMLPurifier for Laravel

    HTMLPurifier for Laravel 5/6/7/8/9/10/11

    ...Tired of using BBCode due to the current landscape of deficient or insecure HTML filters? Have a WYSIWYG editor but have never been able to use it? Looking for high-quality, standards-compliant, open-source components for that application you're building? HTML Purifier is for you.
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB