Update of /cvsroot/archive-access/archive-access/src/docs/warc
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv6493
Modified Files:
warc_file_format.xml
Log Message:
tinkered with section 9.3 (GZIP WARC File Extension) for clarity
Index: warc_file_format.xml
===================================================================
RCS file: /cvsroot/archive-access/archive-access/src/docs/warc/warc_file_format.xml,v
retrieving revision 1.8
retrieving revision 1.9
diff -C2 -d -r1.8 -r1.9
*** warc_file_format.xml 24 Aug 2005 01:39:50 -0000 1.8
--- warc_file_format.xml 26 Aug 2005 22:29:40 -0000 1.9
***************
*** 945,956 ****
</section>
! <section title="GZIP WARC File Extension">
! <t>WARC files compressed with the above conventions remain legal GZIP
! files. Thus, to ensure they are properly recognized by GZIP tools, they
! should only get the customary additional ".gz" file extension suffix,
! making their suffix ".warc.gz". Software which works with WARC files
! compressed using these conventions will detect and exploit them; other
! GZIP software will harmlessly ignore the extensions.</t>
</section>
--- 945,956 ----
</section>
! <section title="GZIP WARC File Name Suffix">
! <t>A WARC file compressed with the extra GZIP field conventions described
! in this document is a legal GZIP file. To ensure that it is properly
! recognized by GZIP tools, its name should have the customary ".gz"
! appended to it, making the complete suffix, ".warc.gz".
! GZIP software that does not recognize the extra GZIP fields will
! simply pass over them without benefit or harm.</t>
</section>
|