Showing 389 open source projects for "web archive extractor"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    Archive Extractor
    To use this tool, you need to have WinRar installed at the path "C:\Program Files\WinRAR" (which is typically set by default). Alternatively, you can have 7z installed at the path "C:\Program Files\7-Zip" (this is usually set by default as well). Please note that if you only have 7z installed, you will not be able to extract .rar files, but only .zip or .7z files. This tool is primarily designed to extract files from password-protected Rar/Zip/7z archives, although it also works on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Article Extractor

    Article Extractor

    To extract main article from given URL with Node.js

    A Node.js library for extracting main content from web articles, removing unnecessary clutter like ads and navigation elements.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Anna’s Archive

    Anna’s Archive

    Comprehensive search engine for books, papers, comics, magazines

    Anna’s Archive is a large-scale open-source search engine and data aggregation platform designed to index and provide access to a vast collection of books, academic papers, comics, magazines, and other digital texts through a unified interface. The project includes all the infrastructure required to run a full instance locally or in production, combining web servers, databases, and search indexing systems into a scalable architecture.
    Downloads: 95 This Week
    Last Update:
    See Project
  • 4
    Game Extractor

    Game Extractor

    Open and edit the archive files used in over 5700 games.

    Reads and writes archives used in many popular games. Great for mod development and game translations. Supports over 5700 games, with more games added through small plugins. Also shows file previews and converts them to different formats.
    Leader badge
    Downloads: 306 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Web Archives

    Web Archives

    Browser extension for viewing archived and cached versions of websites

    Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari. Web Archives is a browser extension that enables you to find archived and cached versions of web pages, and comes with support for more than 10 search engines. Searches can be initiated from the context menu and the browser toolbar. A diverse set of archive and cache sources are supported, which can be toggled and reordered from the extension's options. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Flashpoint Launcher

    Flashpoint Launcher

    Launcher for Flashpoint Archive

    The launcher for Flashpoint Archive (the non-profit web preservation project). The Flashpoint Launcher (FPL) is a desktop application made for browsing, storing, and launching other applications (games, animations, web apps etc.). It is specifically made for Flashpoint Archive, a non-profit with the goal of furthering archival and accessibility efforts for games, animations and other digital interactive experiences on the web.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    makeself

    makeself

    A self-extracting archiving tool for Unix systems

    A self-extracting archiving tool for Unix systems, in 100% shell script. makeself.sh is a small shell script that generates a self-extractable compressed tar archive from a directory. The resulting file appears as a shell script (many of those have a .run suffix), and can be launched as is. The archive will then uncompress itself to a temporary directory and an optional arbitrary command will be executed (for example an installation script). This is pretty similar to archives generated with WinZip Self-Extractor in the Windows world. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Google CTF

    Google CTF

    Google CTF

    Google CTF is the public repository that houses most of the challenges from Google’s Capture-the-Flag competitions since 2017 and the infrastructure used to run them. It’s a learning and practice archive: competitors and educators can replay tasks across categories like pwn, reversing, crypto, web, sandboxing, and forensics. The code and binaries intentionally contain vulnerabilities—by design—so users can explore exploit chains and patching in realistic settings. The repo also includes infrastructure components and links to a scoreboard implementation, giving organizers reference material for hosting their own events. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    PHP Desktop

    PHP Desktop

    Develop desktop GUI applications using PHP, HTML5, JavaScript

    ...The process of turning an existing website into a desktop application is basically a matter of copying it to the directory. In a certain sense phpdesktop acts as a PHP to EXE compiler. It embeds a web browser, a multi-threaded web server and a PHP interpreter. All embedded into a single application, a portable folder that you can easily distribute to end-users by packing it to zip archive or by making an installer for your application. The web server embedded is a custom edition of Mongoose.
    Downloads: 154 This Week
    Last Update:
    See Project
  • 11
    LinkAce

    LinkAce

    LinkAce is a self-hosted archive to collect links of your favorite web

    LinkAce is a self-hosted archive to collect links of your favorite websites. Save articles to read them later, tools to use in your next project, or historical content to archive it for the long term. LinkAce comes with a lot of features while keeping a clean and minimal interface. It provides a long-term archive to store links to websites, media files or anything else with a valid URL. The user is able to categorize the added links to be able to find them later, and share lists of links...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Wayback

    Wayback

    An archiving tool with an IM-style interface that prioritizes privacy

    ...Wayback is an open-source web archiving application written in Go. With a modular and customizable architecture, it is designed to be flexible and adaptable to various use cases and environments. It provides support for multiple storage backends and integration with other services. Whether you need to archive a single web page or a large collection of web sites, Wayback can help you capture and preserve web content for posterity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    LANraragi

    LANraragi

    Web application for archival and reading of manga/doujinshi

    LANraragi is a web application for archiving and reading manga/doujinshi. Lightweight and Docker-ready, it's designed for NAS and server environments, providing a user-friendly interface for managing comic collections.​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14

    Media Extractor

    Google Chrome extension designed to intercept and download media files

    # Media Extractor Media Extractor is a Google Chrome extension designed to intercept and download media files directly from websites. The extension monitors network requests in the browser and allows users to download detected media files such as video and audio streams. ## Key Features - Intercepts media files from web pages - Supports video and audio downloads - Works directly inside Google Chrome - Simple and intuitive interface - No external tools required ## Use Cases - Downloading embedded video content - Saving audio streams from websites - Analyzing media network requests - Offline media access ## Download - Chrome Web Store (if published): https://github.com/exxellengames/Media-Extractor/releases ## Official Website - EN: https://exxellengames.great-site.net/en/ - RU: https://exxellengames.great-site.net/ru/ ## Author exxellengames
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC is a library for accessing and creating SQLite database files in Java. Our SQLiteJDBC library requires no configuration since native libraries for major OSs, including Windows, Mac OS X, Linux etc., are assembled into a single JAR (Java Archive) file. The usage is quite simple; download our sqlite-jdbc library, then append the library (JAR file) to your classpath. SQLite JDBC is a library for accessing SQLite databases through the JDBC API. SQLite supports on-memory database...
    Downloads: 498 This Week
    Last Update:
    See Project
  • 16
    sperm

    sperm

    Collection of reverse engineering articles curated for learning

    sperm is a curated repository that gathers a collection of notable articles related to reverse engineering and software analysis. It primarily acts as a knowledge archive where previously published technical posts are compiled and organized for easier access and long-term reference. These articles originate from multiple technical communities and platforms and are exported into Markdown format to maintain a consistent and readable structure. sperm focuses on educational material that...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    claude-code-transcripts

    claude-code-transcripts

    Tools for publishing transcripts for Claude Code sessions

    claude-code-transcripts is a command-line utility that takes session files exported from Claude Code (in JSON or JSONL format) and turns them into clean, navigable HTML transcripts that can be viewed in any modern web browser. It is designed to make the often dense and verbose outputs from AI coding sessions easier to read, share, and archive by breaking conversations into paginated, annotated pages with navigable timelines of prompts and responses. Users can run this tool locally or fetch sessions from the Claude API, giving flexibility for individual workflows or team documentation practices. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Emupedia

    Emupedia

    The purpose of Emupedia is to serve as a nonprofit meta-resource

    Emupedia is an ambitious open-source web platform and preservation initiative that aims to archive, emulate, and make accessible legacy software, games, and operating systems through a browser-based interface. Rather than focusing on a single emulator, it acts as a meta-layer that integrates multiple emulation technologies into a cohesive virtual desktop experience. The project recreates entire operating system environments, such as Windows 95 and Windows 98, within the browser, allowing users to interact with software as if they were using original hardware. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 19
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    ...The extractor tries to strike a balance between limiting noise (precision) and including all valid parts (recall). It also has to be robust and reasonably fast, it runs in production on millions of documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    newpipeextractor

    newpipeextractor

    Library for extracting streaming site data without official APIs

    ...It handles many low-level tasks involved in web data extraction, including parsing responses, managing platform-specific logic, and handling errors, allowing developers to focus on implementing application features rather than scraping mechanics. Each supported service is implemented through its own extractor components that conform to a common interface, enabling consistent access to data across different platforms.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 698 This Week
    Last Update:
    See Project
  • 22
    bilibili-manga-downloader

    bilibili-manga-downloader

    Download and manage Bilibili Manga chapters with GUI downloader

    ...It also offers multiple output formats, allowing chapters to be saved as image folders or compressed comic archive formats suitable for local manga readers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    AutoPkg

    AutoPkg

    Automating packaging and software distribution on macOS

    AutoPkg is a system that automatically prepares software for distribution to managed clients. Recipes allow you to specify a series of simple actions which combined together can perform complex tasks, similar to Automator workflows or Unix pipes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ...Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data. It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more. ArchiveBox is an open-source, self-hosted web archiving tool for saving websites offline. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    scrawler

    scrawler

    Desktop tool for downloading media from many social platforms

    SCrawler is a desktop application designed to download media content from a wide range of online platforms and social media services. It allows users to add profiles, channels, or posts and automatically collect images, videos, and other media associated with them. It provides tools for organizing downloaded content locally, including feeds, profile folders, and customizable file naming rules. SCrawler includes advanced configuration options that allow users to control download behavior,...
    Downloads: 13 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB