161 projects for "html page" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    single-file-cli

    single-file-cli

    CLI tool to save complete web pages as single self-contained HTML file

    SingleFile CLI is an open source command-line tool designed to save complete web pages as a single self-contained HTML file. It captures the rendered page in a headless browser and embeds all required resources directly into the output document, including stylesheets, scripts, images, and fonts. By consolidating every dependency into one file, it allows users to preserve a faithful copy of a web page that can be viewed offline without requiring external assets. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Spider

    Spider

    High-performance Rust web crawler and scraper for large-scale data

    ...It focuses on speed, concurrency, and reliability by using asynchronous and multi-threaded processing to handle large volumes of web pages. It can rapidly crawl websites to collect links, retrieve page content, and extract structured information from HTML documents. Spider can operate concurrently across many pages, allowing it to gather large datasets in a short period of time. Spider also provides mechanisms for subscribing to crawl events so developers can process page data such as URLs, status codes, or HTML content as it is discovered. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    QueryList

    QueryList

    Progressive PHP web crawler framework with jQuery-like DOM parsing

    QueryList is an extensible PHP web scraping and crawling framework designed to extract and process data from web pages. It provides a simple and expressive API that allows developers to collect structured information from HTML documents using familiar DOM traversal techniques. It is built on top of phpQuery and uses CSS3 selectors similar to those found in jQuery, making it easy for developers to query and manipulate page elements during scraping tasks. QueryList supports common data extraction scenarios such as retrieving lists of titles, links, images, and other page elements from structured or semi-structured content. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Toapi

    Toapi

    Convert websites into structured APIs automatically with Python tool

    ...Instead of building a traditional web crawler that collects and stores data before exposing it through an API, Toapi simplifies the process by allowing developers to define data structures that automatically generate an API layer from existing web pages. It works by parsing HTML content from a source site and mapping selected elements into structured data that can be returned as JSON through API endpoints. Developers define items and routes that determine how web pages are parsed and how the resulting data is exposed through the API interface. It also includes mechanisms for caching both page content and API requests, helping reduce repeated network calls and improving performance. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    wombat

    wombat

    Lightweight Ruby DSL for scraping structured data from web pages

    Wombat is a lightweight web crawling and scraping library written in Ruby that focuses on extracting structured data from web pages using a concise domain-specific language (DSL). It is designed to simplify the process of defining how information should be collected from HTML documents without requiring large amounts of scraping boilerplate code. Developers can declare the data fields they want and specify selectors or rules for retrieving them, allowing Wombat to parse and return structured...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Dev Browser

    Dev Browser

    A Claude Skill to give your agent the ability to use a web browser

    ...It supports flexible execution modes, allowing full-script runs when tasks are straightforward and step-by-step exploration when the agent needs to inspect or iterate carefully. To make automation more reliable for language models, it provides LLM-friendly DOM snapshots that structure the page in a way that is easier for an agent to interpret than raw HTML dumps.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Linkwarden

    Linkwarden

    Self-hosted collaborative bookmark manager

    Linkwarden is a self-hosted, open-source bookmark manager built to help individuals and teams collect, organize, and preserve important web content in a way that stays useful long after the original pages change or disappear. Instead of saving only a URL, it captures durable archived formats so your saved knowledge remains accessible even when link rot happens. The experience is designed to feel like a modern “read-it-later” tool, with a reader view that makes long articles easier to consume...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    watercrawl

    watercrawl

    AI-ready web crawler that extracts and structures website content

    WaterCrawl is an open source web crawling and data extraction platform designed to transform website content into structured data suitable for machine learning and AI workflows. It enables developers and researchers to crawl web pages, extract meaningful information, and convert it into formats that are easier to process and analyze. It provides a modern crawling system that can automatically navigate links, control crawl depth, and collect content from targeted sections of a website....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A Cake 2 PHP
    This project features a basic php/mysql content management and a public customizable interface. (CakePHP features) multi-user access, e-shop profiles, CRUD posting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    command-output-to-html-table

    command-output-to-html-table

    A shell script to convert any file or command output into a html table

    Please watch the video below, to convert any file or a command output into a nice html table, in less than 5 Minutes time. The output html file can then be browsed from any location, using a local webserver or an internet www domain. Usage Examples: (Type them on Terminal) cd ~/Downloads/tabulate # location chmod +x *.sh cat "student_marks.csv" | { cat ; echo ; } | ./tabulate.sh -d "," -t "My School" -h "First Term" > "marks.html" # or > "/var/www/html/marks.html" -d specifies delimiting character, which is in between your input columns -t specifies page title -h specifies a header Browse marks.html with browser now, either by double clicking file, from file manager or inputting location on your browser. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Extended Page Navigation

    Extended Page Navigation

    Extended Page Navigation Plugin for Joomla

    The purpose of this plugin is to replace the version of Page Navigation shipped with Joomla! core by an enhanced, more modern variation that adds extended navigation functionality to articles. Version 2.0 of the plugin is compatible with both 1.5 and 2.5 versions of Joomla!, however there will be no support for 1.5, new features will be only available in Joomla! 2.5 and higher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Extended Content Vote

    Extended Content Vote

    Extended Content Vote Plugin for Joomla

    The purpose of this plugin is to replace the version of Content Vote shipped with Joomla! core by an enhanced, more flexible variation that adds extended voting functionality to Joomla! articles and other items. Version 2.0 of the plugin is compatible with both 1.5 and 2.5 versions of Joomla!, however there will be no support for 1.5, new features will be only available in Joomla! 2.5 and higher.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Extended Pagebreak

    Extended Pagebreak

    Extended Pagebreak Plugin for Joomla

    The purpose of this plugin is to replace the version of Pagebreak shipped with Joomla! core by an enhanced, more modern variation that adds extended Table of Contents functionality to a paginated article. This plugin is safe for printing and PDF production (f.i. through PDF Preprocessor).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    K-Framework
    The KFramework is the first integral SOFEA/SOUI framework for web based business applications using Domain Driven Design. The framework provides a web delivered SWING frontend and a WebServices based backend.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    JDynamiTe, Dynamic Template in Java

    JDynamiTe, Dynamic Template in Java

    Dynamically generate documents from templates

    JDynamiTe is a tool which allows you to dynamically create documents in any format from "template" documents. And very few lines of code (or no line at all!) are needed to do that. Some typical usage domains of JDynamiTe are: - dynamic Web pages creation, - text document generation, - source code generation... In fact, it can be useful in any case where pre-defined documents (templates) have to be dynamically populated with data. The main benefit of JDynamiTe is to allow a true...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Abot

    Abot

    Fast and flexible C# framework for building customizable web crawlers

    Abot is an open source C# web crawler framework designed to help developers efficiently crawl and process web content. It focuses on speed, flexibility, and extensibility while handling the complex low-level tasks involved in web crawling. It manages essential components such as multithreading, HTTP requests, scheduling, and link parsing so developers can focus on processing the collected data. Abot follows a modular architecture that allows developers to customize nearly every stage of the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    gocrawl

    gocrawl

    Polite concurrent web crawler library for Go with flexible hooks

    ...It is designed to be polite when accessing websites by respecting crawling rules such as robots.txt policies and applying crawl delays for each host. It executes requests concurrently using Go’s goroutines, allowing efficient and scalable page retrieval across multiple URLs. Developers have full control over the crawling workflow, including which URLs are visited, inspected, and processed during execution. gocrawl integrates with HTML parsing tools so responses can be inspected and queried in a structured way while crawling. Instead of implementing a full search indexing pipeline, the library provides the core crawling engine and extension hooks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    ...The main project page for iText is now on GitHub, and all the latest releases, code samples, open source add-ons and tools, etc. can be found at https://github.com/itext/.
    Downloads: 107 This Week
    Last Update:
    See Project
  • 20
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    OpenAjax Alliance
    OpenAjax Alliance's (http://www.openajaxalliance.org) open source project that implements technologies defined by OpenAjax Alliance. The first effort is the OpenAjax Hub, a JavaScript module that enables multiple Ajax runtimes on the same Web page.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 23
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Gecco

    Gecco

    Lightweight Java web crawler framework with jQuery-style extraction

    Gecco is a lightweight web crawler framework written in Java that simplifies the process of building web scraping applications. It is designed to make crawler development straightforward by allowing developers to extract page elements using jQuery-style selectors rather than complex parsing logic. It integrates several well-known Java libraries and frameworks, including tools for HTTP requests, HTML parsing, JSON processing, and application development. Through its annotation-based design, developers can define crawling rules and data extraction logic directly within Java classes, reducing boilerplate code and improving readability. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    doorGets CMS - Responsive/Bootstrap PHP

    doorGets CMS - Responsive/Bootstrap PHP

    doorGets CMS PHP offers a free, Open-source Website solution

    Demo: www.doorgets.com/demo/ Content management system (CMS) open source and free, allowing you to create a collaborative website with versioning of all your publications. Create your professional responsive website! Get started in less than 10 minutes is 100% free! 20 Included languages English, Français, Deutsch, Español, Polski, Український, Pусский, Türk, Português, Svenska, Italiano, Italiano, Slovenčina, Indonesia, हिंदी, 日本の, 한국의, ภาษาไทย, 中国, עברית, العربية 17 Themes...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo