Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

7 [UURI] >2047 AFTER escaping (Stops crawl) - ID: 1033657
Last Update: Comment added ( karl-ia )

A uuri can be >2047 AFTER its been escaped. This can
be awkward when we later try to use the string version
of such an url to produce a new url -- we'll get a
>2047 in a place that can be awkward to deal with
(Asking CrawlURI to make a name for a new queue; means
item doesn't get queued).


Michael Stack ( stack-sf ) - 2004-09-23 20:58

7

Closed

Fixed

Michael Stack

uri

None

Public


Comments ( 5 )

Date: 2007-03-14 00:16
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-249 -- please add further
comments at that location.


Date: 2004-09-23 21:52
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Added check after escaping.


Date: 2004-09-23 21:52
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Added check after escaping.


Date: 2004-09-23 21:00
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Added check after escaping.


Date: 2004-09-23 21:00
Sender: stack-sfProject Admin

Logged In: YES
user_id=924942

Fixed. Added check after escaping.


Attached File

No Files Currently Attached

Changes ( 3 )

Field Old Value Date By
status_id Open 2004-09-23 21:00 stack-sf
resolution_id None 2004-09-23 21:00 stack-sf
close_date - 2004-09-23 21:00 stack-sf