From: John A. K. <joh...@us...> - 2005-08-26 22:29:48
|
Update of /cvsroot/archive-access/archive-access/src/docs/warc In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv6493 Modified Files: warc_file_format.xml Log Message: tinkered with section 9.3 (GZIP WARC File Extension) for clarity Index: warc_file_format.xml =================================================================== RCS file: /cvsroot/archive-access/archive-access/src/docs/warc/warc_file_format.xml,v retrieving revision 1.8 retrieving revision 1.9 diff -C2 -d -r1.8 -r1.9 *** warc_file_format.xml 24 Aug 2005 01:39:50 -0000 1.8 --- warc_file_format.xml 26 Aug 2005 22:29:40 -0000 1.9 *************** *** 945,956 **** </section> ! <section title="GZIP WARC File Extension"> ! <t>WARC files compressed with the above conventions remain legal GZIP ! files. Thus, to ensure they are properly recognized by GZIP tools, they ! should only get the customary additional ".gz" file extension suffix, ! making their suffix ".warc.gz". Software which works with WARC files ! compressed using these conventions will detect and exploit them; other ! GZIP software will harmlessly ignore the extensions.</t> </section> --- 945,956 ---- </section> ! <section title="GZIP WARC File Name Suffix"> ! <t>A WARC file compressed with the extra GZIP field conventions described ! in this document is a legal GZIP file. To ensure that it is properly ! recognized by GZIP tools, its name should have the customary ".gz" ! appended to it, making the complete suffix, ".warc.gz". ! GZIP software that does not recognize the extra GZIP fields will ! simply pass over them without benefit or harm.</t> </section> |