33 |
$abstract not defined in Harvest/Reaper/Summarise/HTML.pm |
None |
open |
|
2003-09-24
|
2003-09-24
|
5 |
|
32 |
REDIR of robots.txt leads to no fetching |
None |
open |
|
2001-09-06
|
2001-09-06
|
5 |
|
31 |
"Can't locate object method 'host' via package..." |
None |
open |
|
2000-12-12
|
2000-12-12
|
5 |
|
30 |
URLs rejected erroneously by "RobotsTxt" filter? |
None |
open |
|
2000-12-11
|
2000-12-11
|
5 |
|
29 |
NG summariser: no check for ALWAYS: and ALWAYS:: modules |
None |
open |
Simon Wilkinson
|
2000-03-03
|
2000-03-07
|
5 |
|
28 |
Timeout wrappers on RunProg summarisers |
None |
open |
|
2000-02-29
|
2000-02-29
|
5 |
|
27 |
Support for indexing password protected sites. |
None |
open |
|
2000-02-29
|
2000-02-29
|
7 |
|
26 |
Multiple url lists seem not to work in some situations |
None |
open |
|
2000-02-29
|
2000-02-29
|
7 |
|
21 |
Provide URLLimit per host |
None |
open |
|
2000-02-22
|
2000-02-22
|
5 |
|
20 |
Provide MD5 hashes on visible content, not entire page |
None |
open |
|
2000-02-22
|
2000-02-22
|
5 |
|
19 |
Should indexing be breadth, rather than depth first? |
None |
open |
|
2000-02-22
|
2000-02-29
|
1 |
|
18 |
Debug should support grouping |
None |
open |
|
2000-02-22
|
2000-02-22
|
5 |
|
15 |
Page not founds clutter index |
None |
open |
|
2000-02-21
|
2000-02-21
|
5 |
|
11 |
Should be possible to set a size limit on fetches |
None |
open |
|
2000-02-18
|
2000-02-21
|
5 |
|
10 |
RunProg needs a way of setting file extensions |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|
9 |
Workload buckets should be alternated |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|
7 |
ClientServer code should be tidied |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|
3 |
Allow header should be user setable |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|
2 |
Multiple RootNodes should be allowed |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|
1 |
NewsHeaders doesn't add to workload |
None |
open |
|
2000-02-17
|
2000-02-17
|
5 |
|