Showing 1578 open source projects for "html source extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    FinalRecon

    FinalRecon

    All-in-one Python web reconnaissance tool for fast target analysis

    FinalRecon is an all-in-one web reconnaissance tool written in Python that helps security professionals gather information about a target website quickly and efficiently. It combines multiple reconnaissance techniques into a single command-line utility so users do not need to run several separate tools to collect similar data. FinalRecon focuses on providing a fast overview of a web target while maintaining accuracy in the collected results. It includes modules for gathering server...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Prowler

    Prowler

    An open source security tool to perform AWS security assessment

    Prowler is an Open Source security tool to perform AWS security best practices assessments, audits, incident response, continuous monitoring, hardening, and forensics readiness. It contains more than 200 controls covering CIS, PCI-DSS, ISO27001, GDPR, HIPAA, FFIEC, SOC2, AWS FTR, ENS and custom security frameworks. Prowler is a command-line tool that helps you with AWS security assessment, auditing, hardening, and incident response.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    OverlayScrollbars

    OverlayScrollbars

    A javascript scrollbar plugin which hides native scrollbars

    A javascript scrollbar plugin that hides native scrollbars, provides custom styleable overlay scrollbars and keeps the native functionality and feeling. The plugin combines the features of native scrolling (e.g. hotkeys, speed, smoothness, touch support etc.) and custom HTML & CSS. It does not replace the native scrollbars, it just hides them to guarantee 100% native scrolling. It also provides additional and highly customizable DOM elements to style your own scrollbars. To check how...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    watercrawl

    watercrawl

    AI-ready web crawler that extracts and structures website content

    WaterCrawl is an open source web crawling and data extraction platform designed to transform website content into structured data suitable for machine learning and AI workflows. It enables developers and researchers to crawl web pages, extract meaningful information, and convert it into formats that are easier to process and analyze. It provides a modern crawling system that can automatically navigate links, control crawl depth, and collect content from targeted sections of a website....
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Twill

    Twill

    Twill is an open source CMS toolkit for Laravel

    Twill is an open source Laravel package that helps developers rapidly create a custom CMS that is beautiful, powerful, and flexible. By standardizing common functions without compromising developer control, Twill makes it easy to deliver a feature-rich admin console that focuses on modern publishing needs. Twill is an AREA 17 product. It was crafted with the belief that content management should be a creative, productive, and enjoyable experience for both publishers and developers. With a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Jekyll

    Jekyll

    A simple, blog-aware static site generator written in Ruby

    Jekyll is a simple, blog-aware, static site generator that’s ideal for creating personal, project, or organization sites. Jekyll is incredibly simple-- it just takes your content, renders Markdown and Liquid templates, and spits out a complete, static website ready for deployment. No configurations, databases, pesky updates and other needless complexities. Jekyll lets you focus on what really matters: your content. Jekyll is easy to install and run. You can have your own website or blog...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LinkChecker

    LinkChecker

    Check links in web documents or full websites

    LinkChecker is a free, GPL licensed website validator. LinkChecker checks links in web documents or full websites. It runs on Python 3 systems, requiring Python 3.8 or later. The version in the pip repository may be old, to find out how to get the latest code, plus platform-specific information and other advice see doc/install.txt in the source code archive. If you do not want to install any additional libraries/dependencies you can use the Docker image which is published on GitHub...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Magnific Popup

    Magnific Popup

    Light and responsive lightbox script with focus on performance

    Magnific Popup is a responsive lightbox & dialog script with focus on performance and providing best experience for user with any device. You may put any HTML content in each gallery item and mix content types. Three simple popups with different scaling settings. Fits horizontally and vertically, only horizontally, or with no gaps, zoom animation, close icon in top-right corner. If you wish to open the popup only after image is fully loaded, you may preload image via JS. Or use scaled down...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Melt UI

    Melt UI

    A set of headless, accessible component builders for Svelte

    Melt UI is an open-source headless component builder library created specifically for the Svelte ecosystem, designed to help developers construct accessible and highly customizable user interface components. Rather than providing fully styled widgets, the library focuses on supplying the behavioral logic and accessibility patterns needed to build UI components while allowing developers to control the visual appearance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    rpc-websockets

    rpc-websockets

    JSON-RPC 2.0 implementation over WebSockets for Node.js

    The rpc-websockets library enables developers to easily implement their business logic that includes messaging between users, machines or any devices. It provides a possibility to send and receive JSON data through the WebSocket communication protocol in order to support two-way notification push, running RPC methods and firing any types of event signalling. Only clients can call RPC methods and not vice versa at the moment. Both frontend (HTML/JS-based) and backend (Node.js-based)...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Grab Framework Project

    Grab Framework Project

    Web Scraping Framework

    Grab is a python framework for building web scrapers. With Grab you can build web scrapers of various complexity, from simple 5-line scripts to complex asynchronous website crawlers processing millions of web pages. Grab provides an API for performing network requests and for handling the received content e.g. interacting with DOM tree of the HTML document. The single request/response API that allows you to build network request, perform it and work with the received content. The API is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Typecho Blogging Platform

    Typecho Blogging Platform

    A PHP blogging platform, simple and powerful

    With only 7 data tables and less than 400KB of code, a complete plug-in and template mechanism is complete. Ultra-low CPU and memory usage is enough to give full play to the maximum performance of the host. Native support for Markdown typesetting syntax, easy to read and write. Support various cloud hosts such as BAE/GAE/SAE, even in the face of sudden high traffic, it can easily cope with it. The meticulously polished operation interface is still a familiar feature, but more mature and with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    QueryList

    QueryList

    Progressive PHP web crawler framework with jQuery-like DOM parsing

    QueryList is an extensible PHP web scraping and crawling framework designed to extract and process data from web pages. It provides a simple and expressive API that allows developers to collect structured information from HTML documents using familiar DOM traversal techniques. It is built on top of phpQuery and uses CSS3 selectors similar to those found in jQuery, making it easy for developers to query and manipulate page elements during scraping tasks. QueryList supports common data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    NukeViet

    NukeViet

    NukeViet CMS is multi Content Management System

    NukeViet is the first opensource CMS in Vietnam. The lastest version - NukeViet 4 coding ground up supports lastest web technologies, including responsive web design (use HTML 5, CSS 3, Composer, XTemplate), jQuery, Ajax...) enabling you to build websites and online applications rapidly. With it own core libraries built in, NukeViet 4 is cross platforms and frameworks independent. By basic knowledge of PHP and MySQL, you can easily extend NukeViet for your purposes. NukeViet core is simply...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    crwlr

    crwlr

    Library for Rapid (Web) Crawler and Scraper Development

    This library provides kind of a framework and a lot of ready-to-use, so-called steps, that you can use as building blocks, to build your own crawlers and scrapers with. Before diving into the library, let's have a look at the terms crawling and scraping. For most real-world use cases, those two things go hand in hand, which is why this library helps with and combines both. A (web) crawler is a program that (down)loads documents and follows the links in it to load them as well. A crawler...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Middleman

    Middleman

    Hand-crafted frontend development

    Middleman is a Ruby-powered static site generator that emphasizes clean templates, flexible layouts, and a great developer experience. It supports common templating languages (ERB, Haml, Slim) and content formats (Markdown, AsciiDoc), plus front-matter metadata to drive layouts, navigation, and per-page behavior. A built-in dev server with live reload keeps iteration fast, while an asset pipeline handles fingerprinting, minification, and cache-friendly builds. Extensions cover blogging,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NodeBB

    NodeBB

    Node.js based forum software built for the modern web

    A better community platform for the modern web. NodeBB is next generation forum software. It's powerful, mobile-ready and easy to use. NodeBB Forum Software is powered by Node.js and supports either Redis, MongoDB, or a PostgreSQL database. It utilizes web sockets for instant interactions and real-time notifications. NodeBB has many modern features out of the box such as social network integration and streaming discussions, while still making sure to be compatible with older browsers. From...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PHPScraper

    PHPScraper

    A universal web-util for PHP

    PHPScraper is a universal web-scraping util for PHP, built with simplicity in mind. The goal is to make xPath Selectors optional and avoid the commonly needed boilerplate code. Just create an instance of PHPScraper, go to a website, and start collecting data. All scraping functionality can be accessed either as a function call or a property call. For example, the title can be accessed in two ways. Many common use cases are covered already. You can find prepared extractors for various HTML...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Sarg - Squid Analysis Report Generator is a tool that allow you to view "where" your users are going to on the Internet. Sarg generate HTML reports, with informations about users, IP Addresses, bytes, sites and times.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 20

    damo-blog-engine

    DAMO Blog Engine is a light posting articles, comment, ...

    - generate static html from simple syntax articles - tags index generation - comment posting - smart formating : titles, images, text, ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Lobo Evolution - Java Web Browser

    Lobo Evolution - Java Web Browser

    Lobo Evolution is an extensible all-Java web browser and RIA platform

    ...I'm waiting your first commit! Source code: https://github.com/LoboEvolution/LoboEvolution
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    WhakerKit

    WhakerKit

    A seamless toolkit to manage dynamic websites and shared documents

    WhakerKit is a versatile toolkit for building websites with both static and dynamic HTML pages, developed by Brigitte Bigi, CNRS. WhakerKit offers seamless management of public and authenticated access, and simplifies document sharing for collaborative environments. It is based on the following technologies: * python >= 3.9 * (optional) PyJWT and ldap3 for authentication (install with pip) * WhakerPy >= 1.3: <https://whakerpy.sourceforge.io> (install with pip) * Whakerexa >=...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AWStats

    AWStats

    AWStats Log Analyzer

    AWStats is a free powerful and featureful server logfile analyzer that shows you all your Web/Mail/FTP statistics including visits, unique visitors, pages, hits, rush hours, os, browsers, search engines, keywords, robots visits, broken links and more
    Leader badge
    Downloads: 274 This Week
    Last Update:
    See Project
  • 24
    uix-kit

    uix-kit

    A free web kits for fast web design and development

    Source: https://github.com/xizon/uix-kit
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Admidio

    Admidio

    Manage your organization online

    Admidio is a free open source user management system for websites of organizations and groups. The system has a flexible role model so that it’s possible to reflect the structure and permissions of your organization. You can create an individual profile for your members by adding or removing fields. Additional to these functions the system contains several modules like member lists, event manager, guestbook, photo album or download area.
    Leader badge
    Downloads: 143 This Week
    Last Update:
    See Project