Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

5 PDFParser URL extraction bug - ID: 1117916
Last Update: Comment added ( karl-ia )

In the PDFParser code is the following

// see if it's the key is a UR[I,L]
if( key.toString().equals("/URI") ||
key.toString().equals("/URI") ){

I assume it should be

// see if it's the key is a UR[I,L]
if( key.toString().equals("/URI") ||
key.toString().equals("/URL") ){

Notice that one is URI the other it URL.

Ben Litchfield


Ben Litchfield ( benlitchfield ) - 2005-02-07 15:09

5

Closed

Fixed

Nobody/Anonymous

Extraction

None

Public


Comments ( 2 )

Date: 2007-03-14 00:20
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-353 -- please add further
comments at that location.


Date: 2005-02-07 16:05
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Thanks Ben. Patch applied to HEAD. Closing.


Attached File

No Files Currently Attached

Changes ( 3 )

Field Old Value Date By
status_id Open 2005-02-07 16:05 stack-sf
resolution_id None 2005-02-07 16:05 stack-sf
close_date - 2005-02-07 16:05 stack-sf