Crawling this link http://www.msn.com/robots.txt
generates junk into the arc file. Its probably because
the pages is UTF-16
(<meta http-equiv="Content-Type" content="text/html;
charset=UTF-16">). I'd guess we're not respecting the
page encoding and are mangling it when we write to disk.
Michael Stack
Protocols
0.2.0
Public
|
Date: 2007-03-14 00:06
|
|
Date: 2004-02-18 21:41 Logged In: YES |
|
Date: 2004-02-03 19:18 Logged In: YES |
| Field | Old Value | Date | By |
|---|---|---|---|
| status_id | Open | 2004-02-18 21:41 | stack-sf |
| resolution_id | None | 2004-02-18 21:41 | stack-sf |
| close_date | - | 2004-02-18 21:41 | stack-sf |
| assigned_to | nobody | 2004-02-03 19:18 | gojomo |
Copyright © 2010 Geeknet, Inc. All rights reserved. Terms of Use