Showing 1816 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    WebMagic

    WebMagic

    A scalable web crawler framework for Java

    WebMagic is a scalable crawler framework. It covers the whole lifecycle of crawler, downloading, url management, content extraction and persistent. It can simplify the development of a specific crawler. WebMagic is a simple but scalable crawler framework. You can develop a crawler easily based on it. WebMagic has a simple core with high flexibility, a simple API for html extracting. It also provides annotation with POJO to customize a crawler, and no configuration is needed. Some other...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Redcarpet

    Redcarpet

    The safe Markdown parser, reloaded

    Redcarpet is written with sugar, spice and everything nice. Redcarpet is a Ruby library for Markdown processing that smells like butterflies and popcorn. Redcarpet would not be possible without the Sundown library and its authors (Natacha Porté, Vicent Martí, and its many awesome contributors). Redcarpet is readily available as a Ruby gem. It will build some native extensions, but the parser is standalone and requires no installed libraries. Starting with Redcarpet 3.0, the minimum required...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LangExtract

    LangExtract

    A Python library for extracting structured information

    LangExtract is a Python library developed by Google that leverages large language models (LLMs) to extract structured information from unstructured text—such as clinical notes, research papers, or literary works—based on user-defined instructions. It is designed to transform free-form text into reliable, schema-constrained data while maintaining traceability back to the source material. Each extracted entity is precisely grounded in its original context, allowing visual inspection and validation via automatically generated interactive HTML visualizations. LangExtract supports a wide range of models, including Google Gemini, OpenAI GPT, and local LLMs via Ollama, making it adaptable to different deployment environments and compliance needs. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    LCOV

    LCOV

    LCOV

    LCOV is an extension of GCOV, a GNU tool that provides information about what parts of a program are actually executed (i.e. "covered") while running a particular test case. The extension consists of a set of Perl scripts that builds on the textual GCOV output to implement the following enhanced functionality,
    Downloads: 18 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Modern.js

    Modern.js

    The meta-framework suite designed from scratch for web development

    The meta-framework suite was designed from scratch for frontend-focused modern web development. Both "server-side routing" and "client-side routing" are supported. Serverless paradigm of "dynamic and static integrated Web". Low-threshold, full-featured, integrated "BFF" development. Modern.js provides out-of-the-box support for mainstream IDEs such as VS Code, and has features such as automatic detection and repair of Lint problems, access checks before code submission, etc., which can make...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    LaRecipe

    LaRecipe

    Write gorgeous documentation for your products using Markdown

    Write gorgeous documentation with Markdown inside your Laravel apps. LaRecipe is simply a code-driven package provides an easy way to create beautiful documentation for your product or application inside your Laravel app. Configure your entire docs with simple PHP code. None of your docs configurations is stored in the database. LaRecipe provides a bunch of amazing-looking UI Vue-based components powered by TailwindCSS. LaRecipe allows you add your own custom assets though custom packages in...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Email Templates

    Email Templates

    Create, preview, and send custom email templates for Node.js

    Create, preview, and send custom email templates for Node.js. Highly configurable and supports automatic inline CSS, stylesheets, embedded images and fonts, and much more! Made for sending beautiful emails with Lad. We've added preview email by default to this package! If you run into any issues with configuration, files, templates, locals, etc, then you can use the DEBUG environment flag. You can swap the transport option with a Nodemailer transport configuration object or transport...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    Prettier

    Prettier

    Opinionated Code Formatter

    ...No need to spend so much time, effort and mental energy discussing styles or formatting your code. Simply press save and your code is formatted. Prettier supports several languages including JavaScript, TypeScript, JSON, CSS, HTML and more. It also integrates with most editors (Atom, Visual Studio, Vim, Espresso and more).
    Downloads: 45 This Week
    Last Update:
    See Project
  • 9
    Meraki UI

    Meraki UI

    Tailwind CSS components that support RTL languages

    Meraki UI is a free and open source collection of Tailwind CSS components designed to help developers quickly build responsive and visually appealing web interfaces with minimal effort. The library focuses on simplicity and accessibility by providing copy-and-paste HTML components that can be easily integrated into any project without requiring complex setup or framework dependencies.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    docToolchain

    docToolchain

    AsciiDoc Toolchain for technical Software Documentation

    ...It provides a suite of scripts and integrations to generate documentation from AsciiDoc (and related formats), run automated checks, incorporate diagrams, and publish outputs in different formats (HTML, PDF, etc.). The framework encourages documentation and architecture artifacts to live alongside source code, versioned in the same repositories, so changes in code and docs can evolve together. It supports templates (such as the arc42 architecture template), reusable snippets, and modular documentation where different aspects (API docs, architecture, user guides) are composed. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    GoNB

    GoNB

    GoNB, a Go Notebook Kernel for Jupyter

    Go is a compiled language, but with very fast compilation, that allows one to use it in a REPL (Read-Eval-Print-Loop) fashion, by inserting a "Compile" step in the middle of the loop -- so it's a Read-Compile-Run-Print-Loop — while still feeling very interactive. GoNB leverages that compilation speed to implement a full-featured (at least it's getting there) Jupyter notebook kernel. As a side benefit it works with packages that use CGO — although it won't parse C code in the cells, so it...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Primitive UI

    Primitive UI

    A front-end design toolkit for developing web apps

    Primitive is a minimalist Sass boilerplate that provides helpful, browser-consistent styling for default HTML elements - buttons, forms, tables, lists, and typography. A front-end design toolkit built with Sass for developing responsive web apps. Primitive also provides helpful, browser-consistent styling for default HTML elements - buttons, forms, tables, lists, and typography. There are three container sizes for content that are centered with left and right padding. Content without a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Cheerio

    Cheerio

    Implementation of core jQuery designed for the server

    Fast, flexible & lean implementation of core jQuery designed specifically for the server. Cheerio implements a subset of core jQuery. Cheerio removes all the DOM inconsistencies and browser cruft from the jQuery library, revealing its truly gorgeous API. Cheerio works with a very simple, consistent DOM model. As a result parsing, manipulating, and rendering are incredibly efficient. Cheerio wraps around parse5 parser and can optionally use @FB55's forgiving htmlparser2. Cheerio can parse...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    Markdown to PDF

    Markdown to PDF

    Hackable CLI tool for converting Markdown files to PDF using Node.js

    A simple and hackable CLI tool for converting markdown to pdf. It uses Marked to convert markdown to HTML and Puppeteer (headless Chromium) to further convert the HTML to PDF. It also uses highlight.js for code highlighting. The whole source code of this tool is only ~250 lines of JS ~500 lines of Typescript and ~100 lines of CSS, so it is easy to clone and customize.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Trunk

    Trunk

    Build, bundle & ship your Rust WASM application to the web

    ...The easiest way to ensure that your application launches properly is to setup your app as an executable with a standard main function. Trunk uses a source HTML file to drive all asset building and bundling. Trunk also uses the official dart-sass.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Rubick

    Rubick

    Electron based open source toolbox, free integration of rich plug-ins

    Based on the electron open-source toolbox, free integration of rich plug-ins, creates the ultimate desktop efficiency tool, Rubick is one of the heroes of Dota, The core skill is the ability to use other heroes as plug-ins, Finished the walk, Very consistent with the design concept of this tool, So named Rubick. Plug-in management is based on npm package mode, installing plugins is as easy as installing npm packages. Support system plug-ins, as long as rubick is running, plug-ins can be used...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Tacit

    Tacit

    CSS framework for dummies, without a single CSS class

    Tacit is a CSS framework for dummies, who want their web services to look attractive but have almost zero skills in graphic design, just like myself. Tacit is a primitive CSS framework for dummies, like myself, who don't know anything about graphic design but want their web services to look eatable. No classes, no layouts. Just design plain and simple web pages compliant with HTML5 and they will look OK. Tacit's goal is to be super simple and always with the same look-and-feel. If you want...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Laravel Backpack

    Laravel Backpack

    Build custom admin panels

    It's never been easier to build and customize admin panels using Laravel. See why thousands of professionals have been using Backpack for Laravel, every day. Similar to how you'd build it without Backpack. To create a management page (aka CRUD), you create a CrudController. In there, configure or overwrite whatever you want. Load our views, or create your own. Laravel, Bootstrap & jQuery. Those are all the technologies you need to know, to customize anything in Backpack. You can totally use...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    jekyll-minifier

    jekyll-minifier

    Jekyll HTML/XML/CSS/JS Minifier utilising yui-compressor

    Minifies HTML, XML, CSS, JSON and JavaScript both inline and as separate files utilising yui-compressor and htmlcompressor. This was created due to the previous minifier (jekyll-press) not being CSS3 compatible, which made me frown.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    UIkit

    UIkit

    Lightweight modular front-end framework for developing web interfaces

    A lightweight and modular front-end framework for developing fast and powerful web interfaces. UIkit will work on pretty much any modern browser. Using UIkit works best if you have a solid code editor, for example Sublime Text or Atom. To be even more efficient, we recommend that you install one of the autocomplete plugins for your favorite IDE or code editor. This saves a lot of time, as you won't have to look up and type all UIkit classes and markup. To use UIkit's CSS and JavaScript,...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    go-wkhtmltopdf

    go-wkhtmltopdf

    Golang commandline wrapper for wkhtmltopdf

    ...You can combine any number of external HTML documents (HTTP(S) links) with at most one HTML document from stdin and set options for each input document.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    media-chrome

    media-chrome

    Custom elements (web components) for making audio and video player

    media-chrome is an open source library that provides fully customizable media player controls using native web components, allowing developers to design consistent and flexible audio and video player interfaces across different platforms and frameworks. Instead of relying on default browser controls or proprietary player APIs, Media Chrome introduces a set of reusable custom elements that can be composed using standard HTML, styled with CSS, and integrated into any JavaScript framework including React, Angular, and Svelte. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Vexip UI

    Vexip UI

    Vue 3 UI library, highly customizability, full TypeScript, performance

    Highly customizability, full TypeScript, performance pretty good. This library is using base on vue 3.0 with using composition api, and design and code components in the traditional way by Vue possible, fully TypeScript. Almost all the default value of props for each component can be quickly modified by configuration, for easy customization. And, the writing of component codes pays great attention to lowering the threshold of source code reading, and the style of code is as close to the...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    Tailwind CSS

    Tailwind CSS

    A utility-first CSS framework for rapid UI development

    Rapidly build modern websites without ever leaving your HTML. A utility-first CSS framework packed with classes like flex, pt-4, text-center and rotate-90 that can be composed to build any design, directly in your markup. Utility classes help you work within the constraints of a system instead of littering your stylesheets with arbitrary values. They make it easy to be consistent with color choices, spacing, typography, shadows, and everything else that makes up a well-engineered design...
    Downloads: 136 This Week
    Last Update:
    See Project
  • 25
    libssh2

    libssh2

    the SSH library

    libssh2 is a client-side C library implementing the SSH2 protocol. The libssh2 offers a large amount of functions and this is an attempt to provide HTML versions of the man pages present in the source tree. These pages are updated automatically from the source code repository.
    Downloads: 7 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB