Showing 314 open source projects for "web archive extractor"

View related business solutions
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Archive Extractor
    To use this tool, you need to have WinRar installed at the path "C:\Program Files\WinRAR" (which is typically set by default). Alternatively, you can have 7z installed at the path "C:\Program Files\7-Zip" (this is usually set by default as well). Please note that if you only have 7z installed, you will not be able to extract .rar files, but only .zip or .7z files. This tool is primarily designed to extract files from password-protected Rar/Zip/7z archives, although it also works on...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Article Extractor

    Article Extractor

    To extract main article from given URL with Node.js

    A Node.js library for extracting main content from web articles, removing unnecessary clutter like ads and navigation elements.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Anna’s Archive

    Anna’s Archive

    Comprehensive search engine for books, papers, comics, magazines

    Anna’s Archive is a large-scale open-source search engine and data aggregation platform designed to index and provide access to a vast collection of books, academic papers, comics, magazines, and other digital texts through a unified interface. The project includes all the infrastructure required to run a full instance locally or in production, combining web servers, databases, and search indexing systems into a scalable architecture.
    Downloads: 105 This Week
    Last Update:
    See Project
  • 4
    Game Extractor

    Game Extractor

    Open and edit the archive files used in over 5700 games.

    Reads and writes archives used in many popular games. Great for mod development and game translations. Supports over 5700 games, with more games added through small plugins. Also shows file previews and converts them to different formats.
    Leader badge
    Downloads: 310 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    Web Archives

    Web Archives

    Browser extension for viewing archived and cached versions of websites

    Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari. Web Archives is a browser extension that enables you to find archived and cached versions of web pages, and comes with support for more than 10 search engines. Searches can be initiated from the context menu and the browser toolbar. A diverse set of archive and cache sources are supported, which can be toggled and reordered from the extension's options. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Flashpoint Launcher

    Flashpoint Launcher

    Launcher for Flashpoint Archive

    The launcher for Flashpoint Archive (the non-profit web preservation project). The Flashpoint Launcher (FPL) is a desktop application made for browsing, storing, and launching other applications (games, animations, web apps etc.). It is specifically made for Flashpoint Archive, a non-profit with the goal of furthering archival and accessibility efforts for games, animations and other digital interactive experiences on the web.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    makeself

    makeself

    A self-extracting archiving tool for Unix systems

    A self-extracting archiving tool for Unix systems, in 100% shell script. makeself.sh is a small shell script that generates a self-extractable compressed tar archive from a directory. The resulting file appears as a shell script (many of those have a .run suffix), and can be launched as is. The archive will then uncompress itself to a temporary directory and an optional arbitrary command will be executed (for example an installation script). This is pretty similar to archives generated with WinZip Self-Extractor in the Windows world. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Google CTF

    Google CTF

    Google CTF

    Google CTF is the public repository that houses most of the challenges from Google’s Capture-the-Flag competitions since 2017 and the infrastructure used to run them. It’s a learning and practice archive: competitors and educators can replay tasks across categories like pwn, reversing, crypto, web, sandboxing, and forensics. The code and binaries intentionally contain vulnerabilities—by design—so users can explore exploit chains and patching in realistic settings. The repo also includes infrastructure components and links to a scoreboard implementation, giving organizers reference material for hosting their own events. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    PHP Desktop

    PHP Desktop

    Develop desktop GUI applications using PHP, HTML5, JavaScript

    ...The process of turning an existing website into a desktop application is basically a matter of copying it to the directory. In a certain sense phpdesktop acts as a PHP to EXE compiler. It embeds a web browser, a multi-threaded web server and a PHP interpreter. All embedded into a single application, a portable folder that you can easily distribute to end-users by packing it to zip archive or by making an installer for your application. The web server embedded is a custom edition of Mongoose.
    Downloads: 150 This Week
    Last Update:
    See Project
  • 11
    LinkAce

    LinkAce

    LinkAce is a self-hosted archive to collect links of your favorite web

    LinkAce is a self-hosted archive to collect links of your favorite websites. Save articles to read them later, tools to use in your next project, or historical content to archive it for the long term. LinkAce comes with a lot of features while keeping a clean and minimal interface. It provides a long-term archive to store links to websites, media files or anything else with a valid URL. The user is able to categorize the added links to be able to find them later, and share lists of links...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Wayback

    Wayback

    An archiving tool with an IM-style interface that prioritizes privacy

    ...Wayback is an open-source web archiving application written in Go. With a modular and customizable architecture, it is designed to be flexible and adaptable to various use cases and environments. It provides support for multiple storage backends and integration with other services. Whether you need to archive a single web page or a large collection of web sites, Wayback can help you capture and preserve web content for posterity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Media Extractor

    Google Chrome extension designed to intercept and download media files

    # Media Extractor Media Extractor is a Google Chrome extension designed to intercept and download media files directly from websites. The extension monitors network requests in the browser and allows users to download detected media files such as video and audio streams. ## Key Features - Intercepts media files from web pages - Supports video and audio downloads - Works directly inside Google Chrome - Simple and intuitive interface - No external tools required ## Use Cases - Downloading embedded video content - Saving audio streams from websites - Analyzing media network requests - Offline media access ## Download - Chrome Web Store (if published): https://github.com/exxellengames/Media-Extractor/releases ## Official Website - EN: https://exxellengames.great-site.net/en/ - RU: https://exxellengames.great-site.net/ru/ ## Author exxellengames
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    LANraragi

    LANraragi

    Web application for archival and reading of manga/doujinshi

    LANraragi is a web application for archiving and reading manga/doujinshi. Lightweight and Docker-ready, it's designed for NAS and server environments, providing a user-friendly interface for managing comic collections.​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC is a library for accessing and creating SQLite database files in Java. Our SQLiteJDBC library requires no configuration since native libraries for major OSs, including Windows, Mac OS X, Linux etc., are assembled into a single JAR (Java Archive) file. The usage is quite simple; download our sqlite-jdbc library, then append the library (JAR file) to your classpath. SQLite JDBC is a library for accessing SQLite databases through the JDBC API. SQLite supports on-memory database...
    Downloads: 459 This Week
    Last Update:
    See Project
  • 16
    Emupedia

    Emupedia

    The purpose of Emupedia is to serve as a nonprofit meta-resource

    Emupedia is an ambitious open-source web platform and preservation initiative that aims to archive, emulate, and make accessible legacy software, games, and operating systems through a browser-based interface. Rather than focusing on a single emulator, it acts as a meta-layer that integrates multiple emulation technologies into a cohesive virtual desktop experience. The project recreates entire operating system environments, such as Windows 95 and Windows 98, within the browser, allowing users to interact with software as if they were using original hardware. ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 17
    sperm

    sperm

    Collection of reverse engineering articles curated for learning

    sperm is a curated repository that gathers a collection of notable articles related to reverse engineering and software analysis. It primarily acts as a knowledge archive where previously published technical posts are compiled and organized for easier access and long-term reference. These articles originate from multiple technical communities and platforms and are exported into Markdown format to maintain a consistent and readable structure. sperm focuses on educational material that...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    claude-code-transcripts

    claude-code-transcripts

    Tools for publishing transcripts for Claude Code sessions

    claude-code-transcripts is a command-line utility that takes session files exported from Claude Code (in JSON or JSONL format) and turns them into clean, navigable HTML transcripts that can be viewed in any modern web browser. It is designed to make the often dense and verbose outputs from AI coding sessions easier to read, share, and archive by breaking conversations into paginated, annotated pages with navigable timelines of prompts and responses. Users can run this tool locally or fetch sessions from the Claude API, giving flexibility for individual workflows or team documentation practices. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    ...The extractor tries to strike a balance between limiting noise (precision) and including all valid parts (recall). It also has to be robust and reasonably fast, it runs in production on millions of documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 706 This Week
    Last Update:
    See Project
  • 21
    bilibili-manga-downloader

    bilibili-manga-downloader

    Download and manage Bilibili Manga chapters with GUI downloader

    ...It also offers multiple output formats, allowing chapters to be saved as image folders or compressed comic archive formats suitable for local manga readers.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ...Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data. It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more. ArchiveBox is an open-source, self-hosted web archiving tool for saving websites offline. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    scrawler

    scrawler

    Desktop tool for downloading media from many social platforms

    SCrawler is a desktop application designed to download media content from a wide range of online platforms and social media services. It allows users to add profiles, channels, or posts and automatically collect images, videos, and other media associated with them. It provides tools for organizing downloaded content locally, including feeds, profile folders, and customizable file naming rules. SCrawler includes advanced configuration options that allow users to control download behavior,...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 24
    Snap Lens File Extractor

    Snap Lens File Extractor

    Online file extractor for the Snapcha lens file format

    Web Browser based JavaScript Online File Extractor, Parser, Unpacker and Zip File Converter. Reads and unpacks the Snap Camera / Snapchat Lens File Format (lens.lns / *.lns) Snap Lens Tool https://snap-lens-tool.sourceforge.io Snap Lens File Format https://snap-lens-file-format.sourceforge.io
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    nb

    nb

    CLI and local web plain text note‑taking, bookmarking, and archiving

    nb is a powerful, flexible, and self-contained note-taking and knowledge management application that empowers users to write, organize, search, and archive plain-text notes, bookmarks, and tasks from the command line or through a local web server. Designed to be portable and vendor-independent, it stores all content as plain text files with optional encryption, which means notes can be versioned, diffed, and synced with Git just like code, giving users fine-grained control over their personal information. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB