HITCrawler Code
HITCralwer is a lightweight, easy to use cralwer
Status: Pre-Alpha
Brought to you by:
wangshilianghit
File | Date | Author | Commit |
---|---|---|---|
Linux | 2014-12-20 |
![]() |
[6e7853] Latest change on Dec 19 2014 |
Windows | 2014-12-11 |
![]() |
[727093] Latest version in Dec 11 2014 |
lib | 2014-11-08 |
![]() |
[379d1a] This is the first version of the project |
sourcecode | 2014-11-08 |
![]() |
[379d1a] This is the first version of the project |
Readme.txt | 2014-11-08 |
![]() |
[379d1a] This is the first version of the project |
Instructions: 1. Create a new project with all the .cpp files. 2. You can set the seeds of the cralwer by changing the first_url variable. Set the memory of the software by setting the size of the maxHeap. 3. Build the project and run it. The website will automatically store in the page folder. 4. You can check the current cralwer condition by open the time.txt, visited url.txt, unvisited url.txt.