Showing 87 open source projects for "html source extractor"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Sanitize

    Sanitize

    Ruby HTML and CSS sanitizer

    Sanitize is an allowlist-based HTML and CSS sanitizer. It removes all HTML and/or CSS from a string except the elements, attributes, and properties you choose to allow. Using a simple configuration syntax, you can tell Sanitize to allow certain HTML elements, certain attributes within those elements, and even certain URL protocols within attributes that contain URLs. You can also allow specific CSS properties, @ rules, and URL protocols in elements or attributes containing CSS. Any HTML or...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Asciidoctor

    Asciidoctor

    A fast, open source text processor and publishing toolchain

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc. Asciidoctor reads the AsciiDoc source, and converts it to publishable formats, such as HTML 5. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    GitHub Markup

    GitHub Markup

    Determines which markup library to use to render a content file

    This library is the first step of a journey that every markup file in a repository goes on before it is rendered on GitHub.com. GitHub-markup selects an underlying library to convert the raw markup to HTML. See the list of supported markup formats provided. The HTML is sanitized, aggressively removing things that could harm you and your kin—such as script tags, inline-styles, and class or id attributes. Syntax highlighting is performed on code blocks. See github/linguist for more information...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    Simple Form

    Simple Form

    Forms made easy for Rails! It's tied to a simple DSL

    Simple Form is a Ruby on Rails gem that simplifies form-building by providing a clean DSL for generating HTML forms. It works alongside Rails form helpers and supports integration with Bootstrap, Tailwind, and custom markup—allowing developers to write forms quickly without compromising layout flexibility.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Phlex

    Phlex

    Object-oriented views in Ruby

    Phlex is a Ruby-based framework for building HTML and SVG views using object-oriented programming principles, offering a unique alternative to traditional template systems like ERB. It allows developers to write UI components entirely in Ruby, providing full control over structure, logic, and rendering without mixing HTML and templating syntax. One of its key advantages is performance, as it can render HTML extremely quickly while maintaining predictable scaling even with complex component...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Asciidoctor PDF

    Asciidoctor PDF

    Asciidoctor PDF: A native PDF converter for AsciiDoc

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    jekyll-minifier

    jekyll-minifier

    Jekyll HTML/XML/CSS/JS Minifier utilising yui-compressor

    Minifies HTML, XML, CSS, JSON and JavaScript both inline and as separate files utilising yui-compressor and htmlcompressor. This was created due to the previous minifier (jekyll-press) not being CSS3 compatible, which made me frown.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Liquid

    Liquid

    Liquid markup language. Safe, customer facing template language

    Liquid is a secure, open-source templating language created by Shopify in Ruby. It enables embedding logic (loops, conditionals, filters) within safe, customer-editable templates. Commonly used for rendering storefronts, emails, and static site generation in Shopify and Jekyll-based systems.
    Downloads: 9 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    jekyll-toc

    jekyll-toc

    Jekyll plugin which generates a table of contents

    Jekyll plugin which generates a table of contents.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    wombat

    wombat

    Lightweight Ruby DSL for scraping structured data from web pages

    Wombat is a lightweight web crawling and scraping library written in Ruby that focuses on extracting structured data from web pages using a concise domain-specific language (DSL). It is designed to simplify the process of defining how information should be collected from HTML documents without requiring large amounts of scraping boilerplate code. Developers can declare the data fields they want and specify selectors or rules for retrieving them, allowing Wombat to parse and return structured...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    MailCatcher

    MailCatcher

    Catches mail and serves it through a dream

    ...Run mailcatcher, set your favorite app to deliver to smtp://127.0.0.1:1025 instead of your default SMTP server, then check it out to see the mail that's arrived so far. Shows HTML, Plain Text and Source version of messages, as applicable. Rewrites HTML enabling display of embedded, inline images/etc and opens links in a new window. Command-line options to override the default SMTP/HTTP IP and port settings. Mail appears instantly if your browser supports WebSockets, otherwise updates every thirty seconds. Sendmail-analogue command, catchmail, makes using mailcatcher from PHP a lot easier. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Jbuilder

    Jbuilder

    Generate JSON objects with a Builder-style DSL

    Jbuilder gives you a simple DSL for declaring JSON structures that beats manipulating giant hash structures. This is particularly helpful when the generation process is fraught with conditionals and loops. You can either use Jbuilder stand-alone or directly as an ActionView template language. When required in Rails, you can create views à la show.json.jbuilder (the json is already yielded). Fragment caching is supported, it uses Rails.cache and works like caching in HTML templates. If your...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    WhatWeb

    WhatWeb

    Next generation web scanner

    WhatWeb is a Ruby-based web scanner for fingerprinting websites. It identifies CMS, server technologies, JavaScript frameworks, and other characteristics by analyzing HTML, headers, JavaScript, cookies, and responses. Commonly used in reconnaissance and security assessments.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Jekyll

    Jekyll

    A simple, blog-aware static site generator written in Ruby

    Jekyll is a simple, blog-aware, static site generator that’s ideal for creating personal, project, or organization sites. Jekyll is incredibly simple-- it just takes your content, renders Markdown and Liquid templates, and spits out a complete, static website ready for deployment. No configurations, databases, pesky updates and other needless complexities. Jekyll lets you focus on what really matters: your content. Jekyll is easy to install and run. You can have your own website or blog...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Pro Git

    Pro Git

    Pro Git 2nd Edition

    Welcome to the second edition of the Pro Git book. The entire Pro Git book, written by Scott Chacon and Ben Straub and published by Apress, is available here. All content is licensed under the Creative Commons Attribution Non Commercial Share Alike 3.0 license. Print versions of the book are available on Amazon.com. What is “version control”, and why should you care? Version control is a system that records changes to a file or set of files over time so that you can recall specific versions...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17
    DocuSeal

    DocuSeal

    Open source DocuSign alternative

    Open source document filling and signing. DocuSeal is an open-source platform that provides secure and efficient digital document signing and processing. Create PDF forms to have them filled and signed online on any device with an easy-to-use, mobile-optimized web tool. Use embeddable code snippets to seamlessly implement the document signing workflows directly on your website or app.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    MCP UI

    MCP UI

    SDK for building interactive UI components over MCP for AI tools

    mcp-ui is a software development kit designed to bring interactive user interface capabilities to applications built on the Model Context Protocol (MCP). It enables developers to create rich, dynamic UI components that can be delivered from an MCP server and rendered seamlessly by a compatible client. Instead of returning only text responses, tools can provide structured UI resources such as HTML or remote-rendered components, allowing more engaging and functional interactions. mcp-ui...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    bootstrap_form

    bootstrap_form

    Repository of the bootstrap_form gem

    Official repository of the bootstrap_form gem, a Rails form builder that makes it super easy to create beautiful-looking forms using Bootstrap 5. bootstrap_form is a Rails form builder that makes it super easy to integrate Bootstrap v5-style forms into your Rails application. It provides form helpers that augment the Rails form helpers. bootstrap_forms's form helpers generate the form field and its label and all the Bootstrap mark-up required for proper Bootstrap display. bootstrap_form. You...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    kimuraframework

    kimuraframework

    AI-first Ruby framework for building fast, flexible web scraping spide

    Kimurai is an open source web scraping framework written in Ruby that simplifies the process of building automated data extraction tools. It provides a clean domain-specific language that allows developers to define scraping logic and data schemas with minimal boilerplate code. Kimurai can use AI-assisted extraction to identify where data resides in HTML pages, automatically generating selectors that are cached for future use so subsequent scraping runs operate with pure Ruby performance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Middleman

    Middleman

    Hand-crafted frontend development

    Middleman is a Ruby-powered static site generator that emphasizes clean templates, flexible layouts, and a great developer experience. It supports common templating languages (ERB, Haml, Slim) and content formats (Markdown, AsciiDoc), plus front-matter metadata to drive layouts, navigation, and per-page behavior. A built-in dev server with live reload keeps iteration fast, while an asset pipeline handles fingerprinting, minification, and cache-friendly builds. Extensions cover blogging,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ajax-datatables-rails

    ajax-datatables-rails

    A wrapper around DataTable's ajax methods that allow synchronization

    DataTables is a nifty jQuery plugin that adds the ability to paginate, sort, and search your HTML tables. When dealing with large tables (more than a couple of hundred rows) however, we run into performance issues. These can be fixed by using server-side pagination, but this breaks some DataTables functionality. ajax-data tables-rails is a wrapper around DataTables ajax methods that allow synchronization with server-side pagination in a Rails app. It was inspired by this Railscast. I needed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    damo-blog-engine

    DAMO Blog Engine is a light posting articles, comment, ...

    - generate static html from simple syntax articles - tags index generation - comment posting - smart formating : titles, images, text, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Slim Template

    Slim Template

    Template language to reduce the syntax to the essential parts

    Slim is a Ruby template language whose goal is reduce the syntax to the essential parts without becoming cryptic. The initial design of Slim is what you see on the home page. It started as an exercise to see how much could be removed from a standard html template (<, >, closing tags, etc.). As more people took an interest in Slim, the functionality grew and so did the flexibility of the syntax. Slim will strive to maintain simplicity, but not everyone's definition of a readable syntax is the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    VK_Periodic_Table_branch_mmmv

    VK_Periodic_Table_branch_mmmv

    Periodic Table of Chemical Elements that requires an HTTP server.

    A shallow fork of https://github.com/valter1977/Pertable/ with all goodies from upstream and most flaws/bugs by me. The Periotic Table of Chemical Elements has been implemented at upstream by Valter Kiisk https://kodu.ut.ee/~kiisk/pertable/pertable.htm and its technology is HTML+CSS+JavaScript. The JavaScript downloads data at runtime, so a HTTP server is needed to view that table of chemical elements. The HTTP server has been bundled to this project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB