Share

Heritrix: Internet Archive Web Crawler

Tracker: Bugs

2 converting URI's '\' into '/' character - ID: 913214
Last Update: Comment added ( karl-ia )

Since IE converts '\' to '/' we should probably apply
this conversion on discovered URIs.

Maybe we should crawl both versions of these URIs (one
with '\' converted to '/' and the other with '\'
converted to '%5C') since they are both valid within
the browsers' realms.


Igor Ranitovic ( ia_igor ) - 2004-03-10 02:40

2

Closed

Fixed

Igor Ranitovic

Extraction

0.8.0

Public


Comments ( 2 )

Date: 2007-03-14 00:08
Sender: karl-ia


This issue is now discussed in the new JIRA tracker at
http://webteam.archive.org/jira/browse/HER-90 -- please add further
comments at that location.


Date: 2004-04-05 20:06
Sender: ia_igorProject Admin

Logged In: YES
user_id=715474

I added a fix that only converts backslashes to slashes.
Since URLs with backslashes usually only work with IE and
since IE does the same thing I thought that is right thing
to do.


Attached File

No Files Currently Attached

Changes ( 5 )

Field Old Value Date By
status_id Open 2004-04-05 20:06 ia_igor
resolution_id None 2004-04-05 20:06 ia_igor
close_date - 2004-04-05 20:06 ia_igor
artifact_group_id None 2004-03-31 01:12 gojomo
assigned_to nobody 2004-03-31 01:08 gojomo