Showing 325 open source projects for "web archive extractor"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Article Extractor

    Article Extractor

    To extract main article from given URL with Node.js

    A Node.js library for extracting main content from web articles, removing unnecessary clutter like ads and navigation elements.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Game Extractor

    Game Extractor

    Open and edit the archive files used in over 5700 games.

    Reads and writes archives used in many popular games. Great for mod development and game translations. Supports over 5700 games, with more games added through small plugins. Also shows file previews and converts them to different formats.
    Leader badge
    Downloads: 310 This Week
    Last Update:
    See Project
  • 3
    Web Archives

    Web Archives

    Browser extension for viewing archived and cached versions of websites

    Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari. Web Archives is a browser extension that enables you to find archived and cached versions of web pages, and comes with support for more than 10 search engines. Searches can be initiated from the context menu and the browser toolbar. A diverse set of archive and cache sources are supported, which can be toggled and reordered from the extension's options. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Flashpoint Launcher

    Flashpoint Launcher

    Launcher for Flashpoint Archive

    The launcher for Flashpoint Archive (the non-profit web preservation project). The Flashpoint Launcher (FPL) is a desktop application made for browsing, storing, and launching other applications (games, animations, web apps etc.). It is specifically made for Flashpoint Archive, a non-profit with the goal of furthering archival and accessibility efforts for games, animations and other digital interactive experiences on the web.
    Downloads: 37 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    makeself

    makeself

    A self-extracting archiving tool for Unix systems

    A self-extracting archiving tool for Unix systems, in 100% shell script. makeself.sh is a small shell script that generates a self-extractable compressed tar archive from a directory. The resulting file appears as a shell script (many of those have a .run suffix), and can be launched as is. The archive will then uncompress itself to a temporary directory and an optional arbitrary command will be executed (for example an installation script). This is pretty similar to archives generated with WinZip Self-Extractor in the Windows world. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender : https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Google CTF

    Google CTF

    Google CTF

    Google CTF is the public repository that houses most of the challenges from Google’s Capture-the-Flag competitions since 2017 and the infrastructure used to run them. It’s a learning and practice archive: competitors and educators can replay tasks across categories like pwn, reversing, crypto, web, sandboxing, and forensics. The code and binaries intentionally contain vulnerabilities—by design—so users can explore exploit chains and patching in realistic settings. The repo also includes infrastructure components and links to a scoreboard implementation, giving organizers reference material for hosting their own events. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    PHP Desktop

    PHP Desktop

    Develop desktop GUI applications using PHP, HTML5, JavaScript

    ...The process of turning an existing website into a desktop application is basically a matter of copying it to the directory. In a certain sense phpdesktop acts as a PHP to EXE compiler. It embeds a web browser, a multi-threaded web server and a PHP interpreter. All embedded into a single application, a portable folder that you can easily distribute to end-users by packing it to zip archive or by making an installer for your application. The web server embedded is a custom edition of Mongoose.
    Downloads: 150 This Week
    Last Update:
    See Project
  • 9
    LinkAce

    LinkAce

    LinkAce is a self-hosted archive to collect links of your favorite web

    LinkAce is a self-hosted archive to collect links of your favorite websites. Save articles to read them later, tools to use in your next project, or historical content to archive it for the long term. LinkAce comes with a lot of features while keeping a clean and minimal interface. It provides a long-term archive to store links to websites, media files or anything else with a valid URL. The user is able to categorize the added links to be able to find them later, and share lists of links...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    Wayback

    Wayback

    An archiving tool with an IM-style interface that prioritizes privacy

    ...Wayback is an open-source web archiving application written in Go. With a modular and customizable architecture, it is designed to be flexible and adaptable to various use cases and environments. It provides support for multiple storage backends and integration with other services. Whether you need to archive a single web page or a large collection of web sites, Wayback can help you capture and preserve web content for posterity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Media Extractor

    Google Chrome extension designed to intercept and download media files

    # Media Extractor Media Extractor is a Google Chrome extension designed to intercept and download media files directly from websites. The extension monitors network requests in the browser and allows users to download detected media files such as video and audio streams. ## Key Features - Intercepts media files from web pages - Supports video and audio downloads - Works directly inside Google Chrome - Simple and intuitive interface - No external tools required ## Use Cases - Downloading embedded video content - Saving audio streams from websites - Analyzing media network requests - Offline media access ## Download - Chrome Web Store (if published): https://github.com/exxellengames/Media-Extractor/releases ## Official Website - EN: https://exxellengames.great-site.net/en/ - RU: https://exxellengames.great-site.net/ru/ ## Author exxellengames
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    LANraragi

    LANraragi

    Web application for archival and reading of manga/doujinshi

    LANraragi is a web application for archiving and reading manga/doujinshi. Lightweight and Docker-ready, it's designed for NAS and server environments, providing a user-friendly interface for managing comic collections.​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC Driver

    SQLite JDBC is a library for accessing and creating SQLite database files in Java. Our SQLiteJDBC library requires no configuration since native libraries for major OSs, including Windows, Mac OS X, Linux etc., are assembled into a single JAR (Java Archive) file. The usage is quite simple; download our sqlite-jdbc library, then append the library (JAR file) to your classpath. SQLite JDBC is a library for accessing SQLite databases through the JDBC API. SQLite supports on-memory database...
    Downloads: 459 This Week
    Last Update:
    See Project
  • 14
    sperm

    sperm

    Collection of reverse engineering articles curated for learning

    sperm is a curated repository that gathers a collection of notable articles related to reverse engineering and software analysis. It primarily acts as a knowledge archive where previously published technical posts are compiled and organized for easier access and long-term reference. These articles originate from multiple technical communities and platforms and are exported into Markdown format to maintain a consistent and readable structure. sperm focuses on educational material that...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    claude-code-transcripts

    claude-code-transcripts

    Tools for publishing transcripts for Claude Code sessions

    claude-code-transcripts is a command-line utility that takes session files exported from Claude Code (in JSON or JSONL format) and turns them into clean, navigable HTML transcripts that can be viewed in any modern web browser. It is designed to make the often dense and verbose outputs from AI coding sessions easier to read, share, and archive by breaking conversations into paginated, annotated pages with navigable timelines of prompts and responses. Users can run this tool locally or fetch sessions from the Claude API, giving flexibility for individual workflows or team documentation practices. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    ...The extractor tries to strike a balance between limiting noise (precision) and including all valid parts (recall). It also has to be robust and reasonably fast, it runs in production on millions of documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 706 This Week
    Last Update:
    See Project
  • 18
    bilibili-manga-downloader

    bilibili-manga-downloader

    Download and manage Bilibili Manga chapters with GUI downloader

    ...It also offers multiple output formats, allowing chapters to be saved as image folders or compressed comic archive formats suitable for local manga readers.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ...Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data. It can be used to save copies of bookmarks, preserve evidence for legal cases, backup photos from FB/Insta/Flickr or media from YT/Soundcloud/etc., save research papers, and more. ArchiveBox is an open-source, self-hosted web archiving tool for saving websites offline. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Snap Lens File Extractor

    Snap Lens File Extractor

    Online file extractor for the Snapcha lens file format

    Web Browser based JavaScript Online File Extractor, Parser, Unpacker and Zip File Converter. Reads and unpacks the Snap Camera / Snapchat Lens File Format (lens.lns / *.lns) Snap Lens Tool https://snap-lens-tool.sourceforge.io Snap Lens File Format https://snap-lens-file-format.sourceforge.io
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    SimpDL

    SimpDL

    A tool to scrape images from SimpCity

    SimpDL is an open-source media downloading tool designed to retrieve content from subscription-based or creator platforms, focusing on simplicity and ease of use. It enables users to download images, videos, and other media associated with specific creators or accounts, often through authenticated sessions. The project emphasizes a straightforward workflow where users provide login credentials or tokens, and the tool handles the retrieval and storage of content automatically. It is designed...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    ipwb

    ipwb

    A distributed and persistent archive replay system using IPFS

    ...An important aspect of archival replay systems is rewriting various resource references for proper memento reconstruction so that they are dereferenced properly from the archive from around the same datetime as of the root memento and not from the live site (in which case the resource might have changed or gone missing). Many archival replay systems perform server-side rewriting, but it has its limitations when URIs are generated using JavaScript.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    nb

    nb

    CLI and local web plain text note‑taking, bookmarking, and archiving

    nb is a powerful, flexible, and self-contained note-taking and knowledge management application that empowers users to write, organize, search, and archive plain-text notes, bookmarks, and tasks from the command line or through a local web server. Designed to be portable and vendor-independent, it stores all content as plain text files with optional encryption, which means notes can be versioned, diffed, and synced with Git just like code, giving users fine-grained control over their personal information. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Omnom

    Omnom

    A web content preservation service

    Omnom is a self-hosted content preservation and web bookmarking platform that lets individuals and communities save and archive web content in a way that reflects exactly what they saw in their browser at the moment of capture. Unlike simple bookmark lists, Omnom makes full page snapshots — including dynamic content — so that saved pages remain viewable even if the original goes offline or changes later.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Browser Logos

    Browser Logos

    High resolution web browser logos

    You may want these for a presentation, a blog post or for the site featuring your brand new awesome lightbox script (please no!). All logos are high resolution with transparent backgrounds. For logos of browsers that are currently active, check the src/ directory. For older logos, or logos of browsers that are no longer active, check the src/archive/ directory. There are a few options for getting the browser logos. Get individual logos or get all logos. All logos and trademarks are the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB