Search Results for "extract links in html files"

Showing 119 open source projects for "extract links in html files"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    monolith

    monolith

    CLI tool for saving complete web pages as a single HTML file

    A data hoarder’s dream come true, bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive. Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share. If compared to saving websites with wget -mpk, this tool embeds all assets...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    Goutte

    Goutte

    Goutte, a simple PHP Web Scraper

    Goutte is a screen scraping and web crawling library for PHP. Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses. Goutte depends on PHP 7.1+. Add fabpot/goutte as a require dependency in your composer.json file. Create a Goutte Client instance (which extends Symfony\Component\BrowserKit\HttpBrowser). Make requests with the request() method. The method returns a Crawler object (Symfony\Component\DomCrawler\Crawler). To use your own HTTP settings, you may...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    ..., and FictionBook 2. You can annotate PDF documents and fill out forms with the mobile viewers (this feature is coming soon to the desktop viewer as well). The command line tools allow you to annotate, edit, and convert documents to other formats such as HTML, SVG, PDF, and CBZ. You can also write scripts to manipulate documents using Javascript. The library is written modularly in portable C, so features can be added and removed by integrators if they so desire.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Image Toolbox

    Image Toolbox

    Image Toolbox is an powerful picture editor, which can crop

    Image Toolbox is a powerful picture editor, which can crop, apply filters, add some drawings, erase background, edit EXIF, or even create a PDF file.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Red Hat Enterprise Linux on Microsoft Azure Icon
    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
    Learn More
  • 5
    Markdown Monster

    Markdown Monster

    An extensible Markdown Editor, Viewer and Weblog Publisher for Windows

    ... and word counts keep your content streamlined. You can export Markdown to PDF or HTML on disk or copy Markdown selections as HTML to the clipboard. The HTML preview can display syntax-colored code snippets for most coding languages. Choose from light or dark app themes, and individual and fully customizable preview themes. Use the built-in folder browser to open, manage and drag files into content, use the document outline to quickly jump through content, or use our shell integration.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    WeasyPrint

    WeasyPrint

    The awesome document factory

    ..., tables of contents, links, annotations, optimized images, attachments, WeasyPrint provides many features out of the box, and even gives you the possibility to add your own ways to customize your PDF files. Digital fonts are finely tuned pieces of artwork. To give to your documents the subtle touch they deserve, carefully choose the options you want, kerning, ligatures, old-style numbers, tabular figures, ordinals, etc.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Notally

    Notally

    A beautiful notes app

    Notally was created because I wanted to make something that was beautiful and at the same time, useful. It's extremely light, there are minimal dependencies and lines of code.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. Includes features like Create PDF from Images, HTML, Text files. Create a processing log file. Extract Page, Split Page, Rotate Page, Merge Page, Duplicate page, Move Page, Printing, and Compress Page. Improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 9
    HTMLProofer

    HTMLProofer

    Test your rendered HTML files to make sure they're accurate.

    HTMLProofer is a set of tests to validate your HTML output. These tests check if your image references are legitimate, if they have alt tags, if your internal links are working, and so on. It's intended to be an all-in-one checker for your output. In scope for this project is any well-known and widely-used test for HTML document quality. A major use for this project is continuous integration -- so we must have reliable results. We usually balance correctness over performance. And, if necessary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 10
    vimwiki

    vimwiki

    Personal Wiki for Vim

    Vimwiki is a personal wiki for Vim, interlinked, plain text files written in a markup language. Organize notes and ideas and quickly create links between them, manage todo-lists, and write a diary. VimWiki is a personal wiki for Vim, a number of linked text files that have their own syntax highlighting. See the VimWiki Wiki for an example website built with VimWiki! Three markup syntaxes supported, Vimwiki's own syntax, Markdown, MediaWiki. Export everything to HTML, link to other wiki pages...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    crawley

    crawley

    The unix-way web crawler

    Crawls web pages and prints any link it can find. Fast HTML SAX-parser (powered by golang.org/x/net/html) Small (below 1500 SLOC), idiomatic, 100% test-covered codebase. Grabs most of useful resources URLs (pics, videos, audios, forms, etc...) Found URLs are streamed to stdout and guaranteed to be unique (with fragments omitted) Scan depth (limited by starting host and path, by default - 0) can be configured. Can crawl rules and sitemaps from robots.txt. Brute mode - scan HTML comments for URLs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Liquid (vscode)

    Liquid (vscode)

    Liquid language support for VS Code

    The essential vscode extension for Liquid (template language). Supports formatting, tag, filter, object and schema auto-completions, hovers, syntax highlighting, diagnostic capabilities, and respects HTML Intellisense features. Liquid Template Literal syntax highlighting support for TypeScript and JavaScript. Hover descriptions and reference links on tags, filters, objects, and more! Supports Liquid embedded code block syntax highlighting. Embedded JSON Schema Tag language completions...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Remarkable for Linux

    Remarkable for Linux

    The Markdown Editor for Linux

    With Live Preview you can see your changes as you make them. There is no need to export first to check your syntax. This is accompanied by synchronized scrolling. Remarkable has Github Flavoured Markdown. This has a simple, easy-to-learn syntax with features like checklists, highlighting, links, images and more. Remarkable allows you to export your files to PDF and HTML from within the app. The HTML code is even prettified and PDFs have a TOC. You can style your markdown documents however you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    go-wkhtmltopdf

    go-wkhtmltopdf

    Golang commandline wrapper for wkhtmltopdf

    ... page and TOC options for each page. It makes usage in server-type applications easier, every instance (PDF process) has its own output buffer which contains the PDF output and you can feed one input document from an io.Reader (using stdin in wkhtmltopdf). You can combine any number of external HTML documents (HTTP(S) links) with at most one HTML document from stdin and set options for each input document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    Trafilatura is a Python package and command-line tool designed to gather text on the Web. It includes discovery, extraction and text-processing components. Its main applications are web crawling, downloads, scraping, and extraction of main texts, metadata and comments. It aims at staying handy and modular: no database is required, the output can be converted to various commonly used formats. Going from raw HTML to essential parts can alleviate many problems related to text quality, first...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PHPScraper

    PHPScraper

    A universal web-util for PHP

    PHPScraper is a universal web-scraping util for PHP, built with simplicity in mind. The goal is to make xPath Selectors optional and avoid the commonly needed boilerplate code. Just create an instance of PHPScraper, go to a website, and start collecting data. All scraping functionality can be accessed either as a function call or a property call. For example, the title can be accessed in two ways. Many common use cases are covered already. You can find prepared extractors for various HTML tags...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    HTML Quiz Application With Timer & Point

    HTML Quiz Application With Timer & Point

    An Easy to Edit HTML, CSS & JAVASCRIPT QUIZ - For Students & Teachers

    ... files. If necessary, Please change the charset="UTF-8" to charset="Your Charset" to match your language's charset value in the index.html file. Check Google if you need to know your language's html charset value. This Application works offline, without needing an Internet Connection. Also, run this application on your internet browsers in Kiosk mode, for maximum secrecy. For details, please refer : https://ostechnix.com/launch-web-browsers-in-kiosk-mode-full-screen-from-cli/ See Below.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    WFDownloader App

    WFDownloader App

    Free batch downloader for image, wallpaper, video, audio, document,

    Use as an image gallery, wallpaper, audio/music, video, document, and other media bulk downloader from supported websites. Also use to download sequential website urls that have a certain pattern (e.g. image01.png to image100.png). Also use app's built-in site crawler for advanced link search or extraction. There is also special support for forum media and open directory downloading. It's a programmable downloader and also works with password protected sites. Say goodbye to downloading one...
    Leader badge
    Downloads: 120 This Week
    Last Update:
    See Project
  • 19
    Command Output to Html Chart for Website

    Command Output to Html Chart for Website

    A one liner to convert any file or a command output into a html chart

    Please download the above zip file, extract it and then view the README files for a simple one liner command for the chart creation, using your Terminal or Command Prompt. NOTE: For viewing the chart files, use one of these Supported Internet Browsers: Firefox / Google Chrome / Safari / MS Edge etc. ( Preferably their latest versions ) Not Supported are : MS IE ( upto 11 ), PaleMoon etc. As the Windows Version of this script was occasionally generating a zero byte file, because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    BlogHelper

    BlogHelper

    Helps user write blog content and entries

    A tray assistant that helps domestic users to write, publish local articles to mainstream blog platforms (Zhihu, Jianshu, Blog Park, CSDN, SegmentFault, Nuggets, Open Source China) with one click, upload clipboard pictures to Tubed (Sina, Github, Picture Shell, Tencent Cloud, Alibaba Cloud, Youpai Cloud, Qiniu Cloud). A little assistant that does not have any interface and is only stored in the system tray, to help more people write better! One-click publishing of local articles to Zhihu,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ant Movie Catalog

    Ant Movie Catalog

    Free program made to manage your collection of movies

    Manage your movie collection: User interface translated in about 30 languages. Can store catalog in two file format: its own binary format or XML. Import information from Internet (using scripts); by default it includes scripts for IMDB (US), DVDFR (FR), Allociné (FR) and lots of others. User-customizable links to do a search on movie websites. Information importation from various media files (audio & video codec, bitrates, resolution, framerate, size). Scripting technology, using Object...
    Downloads: 71 This Week
    Last Update:
    See Project
  • 22
    Wapiti

    Wapiti

    Wapiti is a web-application vulnerability scanner

    Wapiti is a vulnerability scanner for web applications. It currently search vulnerabilities like XSS, SQL and XPath injections, file inclusions, command execution, XXE injections, CRLF injections, Server Side Request Forgery, Open Redirects... It use the Python 3 programming language.
    Leader badge
    Downloads: 35 This Week
    Last Update:
    See Project
  • 23
    readpe

    readpe

    The PE file analysis toolkit

    readpe (formerly known as pev) is a multiplatform toolkit to work with PE (Portable Executable) binaries. Its main goal is to provide feature-rich tools for properly analyze binaries with a strong focus on suspicious ones.
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 24
    Super PDF Editor Lite

    Super PDF Editor Lite

    Create, Edit, Delete, Organize , Convert, Export, Secure & Sign.

    Super PDF Editor Lite is a robust and versatile PDF management software designed to streamline your document handling needs. Whether you're an individual, student, or professional, this software offers a comprehensive suite of tools to create, edit, and manage your PDFs with ease. Key Features: Extract Page: Easily extract specific pages from a PDF document. Split Page: Divide a single PDF page into multiple smaller pages. Rotate Page: Rotate pages to adjust their orientation. Merge Page...
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 25
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next