Offnet is an open source tool for mirroring web pages. It lets you manage several snapshots of web pages in order to retain the newest as well as some elder versions of a web page. It has also file based deduplication features to store content efficiently. The application comes with an integrated web server to navigate directly through your web browser.

Project goals:
- Web page downloads for less experienced users, including easy setup
- Project based page maintanance
- Not too plain functions that include also multiple snapshots per project
- Iterative, understandable and storage efficient data structure to enable more manual control over stored pages (meta files editable with Easy Folder Morpher)
- Retain archived files and query links as original, altering links only during query

Current status:
- Alpha stadium, archivation quality below Heritrix

Project Activity

See All Activity >

Follow Offnet

Offnet Web Site

Other Useful Business Software
Train ML Models With SQL You Already Know Icon
Train ML Models With SQL You Already Know

BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Offnet!

Additional Project Details

Registered

2015-06-06