From: Michael S. <sta...@us...> - 2005-10-06 19:14:59
|
Update of /cvsroot/archive-access/archive-access/projects/nutch/xdocs In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv19265/xdocs Modified Files: faq.fml Log Message: * xdocs/faq.fml Note on querying for mimetypes. Index: faq.fml =================================================================== RCS file: /cvsroot/archive-access/archive-access/projects/nutch/xdocs/faq.fml,v retrieving revision 1.8 retrieving revision 1.9 diff -C2 -d -r1.8 -r1.9 *** faq.fml 6 Oct 2005 17:38:25 -0000 1.8 --- faq.fml 6 Oct 2005 19:14:47 -0000 1.9 *************** *** 201,206 **** </faq> <faq id="datesort"> ! <question>How to sort by date? </question> --- 201,227 ---- </faq> + <faq id="encoding"> + <question>Why is encoding of non-ascii characters all messed up? + </question> + <answer> + <p>See <i>useBodyEncodingForURI</i> in the <a + href="http://jakarta.apache.org/tomcat/tomcat-5.5-doc/config/ajp.html">Tomcat Configuration Reference</a>. Edit <code>$TOMCAT_HOME/conf/server.xml</code> + and add <i>useBodyEncodingForURI=true</i>. Here is what it looks like + when edit has been added: + <pre><!-- Define a non-SSL HTTP/1.1 Connector on port 8080 --> + <Connector port="8080" maxHttpHeaderSize="8192" + maxThreads="150" minSpareThreads="25" maxSpareThreads="75" + enableLookups="false" redirectPort="8443" acceptCount="100" + connectionTimeout="20000" disableUploadTimeout="true" + useBodyEncodingForURI="true" + /></pre> + </p> + </answer> + </faq> + </part> + <part id="querying"> + <title>Querying</title> <faq id="datesort"> ! <question>How to sort results by date? </question> *************** *** 218,229 **** </p></answer> </faq> ! <faq id="encoding"> ! <question>Why is encoding of non-ascii characters all messed up? ! </question> ! <answer> ! <p>See <i>useBodyEncodingForURI</i> in the <a ! href="http://jakarta.apache.org/tomcat/tomcat-5.5-doc/config/ajp.html">Tomcat Configuration Reference</a>.</p> ! </answer> ! </faq> ! </part> </faqs> --- 239,254 ---- </p></answer> </faq> ! <faq> ! <question>How to query for mimetypes? ! </question> ! <answer> ! <p>Use <i>type</i> query field name. Nutchwax -- like nutch -- adds the ! mimetype, the primary type and subtype to a <i>type</i> field. This ! means that you can query for the mimetypes 'text/html' by querying ! <code>type:text/html</code>, or for primary type 'text' by ! querying <code>type:text</code>, or for subtype 'html' by querying ! <code>type:html</code>, etc.</p> ! </answer> ! </faq> ! </part> </faqs> |