DocSedoter Icon

DocSedoter

Collecting documents over internet (doing web crawling)

Add a Review
1 Download (This Week)
Last Update:
Download DocSedoter-source.zip
Browse All Files

Description

DocSedoter is to download/collect the documents from the internet. For web documents (HTML/CSS), it also will download all resources (image, script, HTML, CSS etc) linked with them. The process will be done recursively. This process is also called as web crawling process. All links URI will be changed to the local path, so that the collected web documents can be navigated by offline.

This project consists of two sub projects. The first is the library that can be used by different applications whose different UI design. The second one is the example of application whose a simple UI which uses the library mentioned before.

This library provides the interface to set where the collected documents/resources will be saved in. Currently, this library only provides a class to saves those documents in the files. However, you may implement a class to save the documents/resources in the database. So, it will be like something used by a searching engine.

DocSedoter Web Site

Categories

Update Notifications





Write a Review

User Reviews

Be the first to post a review of DocSedoter!

Additional Project Details

Registered

2013-05-28
Screenshots can attract more users to your project.
Features can attract more users to your project.

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.