Using the 1.6.0 release code.
Created a new crawl, added a mid-fetch filter of type
ContentTypeRegExpFilter. In Settings I entered the
regex for it. After running the crawl and noticing it
didn't seem to be working correctly I looked at the
crawl order file and the midfetch filters section
looked like this:
<map name="midfetch-filters">
<newObject name="html-only"
class="org.archive.crawler.filter.ContentTypeRegExpFilter">
<boolean name="enabled">true</boolean>
<boolean name="if-match-return">true</boolean>
<string name="regexp"/>
</newObject>
</map>
Seems like it's not saving the regex to the file.
Rob Eger
Local Matters, Inc.
Denver, CO
Nobody/Anonymous ( nobody ) - 2005-12-12 22:02
6
Closed
Fixed
Gordon Mohr
configuration
1.10.0
Public
|
Date: 2007-03-14 01:03
|
|
Date: 2006-08-21 22:26 Logged In: YES |
|
Date: 2006-04-18 22:34 Logged In: YES |
|
Date: 2006-04-16 13:15 Logged In: YES |
|
Date: 2006-04-16 13:12 Logged In: YES |
|
Date: 2006-04-13 10:11 Logged In: NO |
| Field | Old Value | Date | By |
|---|---|---|---|
| status_id | Open | 2006-08-21 22:26 | gojomo |
| resolution_id | None | 2006-08-21 22:26 | gojomo |
| artifact_group_id | None | 2006-08-21 22:26 | gojomo |
| close_date | - | 2006-08-21 22:26 | gojomo |
| priority | 7 | 2006-04-18 22:34 | gojomo |
| assigned_to | nobody | 2006-04-18 01:54 | gojomo |
| priority | 5 | 2006-04-18 01:54 | gojomo |
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use