bigdata-commit Mailing List for Blazegraph (powered by bigdata) (Page 20)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Revision: 8259
          http://sourceforge.net/p/bigdata/code/8259
Author:   jeremy_carroll
Date:     2014-05-09 23:08:34 +0000 (Fri, 09 May 2014)
Log Message:
-----------
javadoc changes

Modified Paths:
--------------
    branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/ConfigurableAnalyzerFactory.java
    branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/DefaultAnalyzerFactory.java

Modified: branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/ConfigurableAnalyzerFactory.java
===================================================================

--- branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/ConfigurableAnalyzerFactory.java	2014-05-09 22:39:19 UTC (rev 8258)
+++ branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/ConfigurableAnalyzerFactory.java	2014-05-09 23:08:34 UTC (rev 8259)
@@ -66,6 +66,7 @@
  * Supported classes included all the natural language specific classes from Lucene, and also:
  * <ul>
  * <li>{@link PatternAnalyzer}
+ * <li>{@link TermCompletionAnalyzer}
  * <li>{@link KeywordAnalyzer}
  * <li>{@link SimpleAnalyzer}
  * <li>{@link StopAnalyzer}
@@ -76,7 +77,6 @@
  * <ul>
  * <li>no arguments
  * <li>{@link Version}
- * <li>{@link Set} (of strings, the stop words)
  * <li>{@link Version}, {@link Set}
  * </ul>
  * is usable. If the class has a static method named <code>getDefaultStopSet()</code> then this is assumed
@@ -89,10 +89,6 @@
  * abbreviate to <code>c.b.s.C</code> in this documentation. 
  * Properties from {@link Options} apply to the factory.
  * <p>
- * 
- * If there are no such properties at all then the property {@link Options#NATURAL_LANGUAGE_SUPPORT} is set to true,
- * and the behavior of this class is the same as the legacy {@link DefaultAnalyzerFactory}.
- * <p>
  * Other properties, from {@link AnalyzerOptions} start with
  * <code>c.b.s.C.analyzer.<em>language-range</em></code> where <code><em>language-range</em></code> conforms
  * with the extended language range construct from RFC 4647, section 2.2. 
@@ -103,7 +99,7 @@
  * If no analyzer is specified for the language range <code>*</code> then the {@link StandardAnalyzer} is used.
  * <p>
  * Given any specific language, then the analyzer matching the longest configured language range, 
- * measured in number of subtags is used {@link #getAnalyzer(String, boolean)} 
+ * measured in number of subtags is returned by {@link #getAnalyzer(String, boolean)} 
  * In the event of a tie, the alphabetically first language range is used.
  * The algorithm to find a match is "Extended Filtering" as defined in section 3.3.2 of RFC 4647.
  * <p>
@@ -132,11 +128,11 @@
 
 	/**
 	 * This is an implementation of RFC 4647 language range,
-	 * targetted at some of the context of bigdata, and only
+	 * targetted at the specific needs within bigdata, and only
 	 * supporting the extended filtering specified in section 3.3.2
 	 * <p>
 	 * Language ranges are comparable so that
-	 * sorting an array and then matching a language tage against each
+	 * sorting an array and then matching a language tag against each
 	 * member of the array in sequence will give the longest match.
 	 * i.e. the longer ranges come first.
 	 * @author jeremycarroll

Modified: branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/DefaultAnalyzerFactory.java
===================================================================
--- branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/DefaultAnalyzerFactory.java	2014-05-09 22:39:19 UTC (rev 8258)
+++ branches/TEXT_ANALYZERS/bigdata/src/java/com/bigdata/search/DefaultAnalyzerFactory.java	2014-05-09 23:08:34 UTC (rev 8259)
@@ -51,18 +51,15 @@
 import com.bigdata.btree.keys.KeyBuilder;
 
 /**
- * This is the default implementation but could be regarded as legacy since
+ * This is the default implementation but should be regarded as legacy since
  * it fails to use the correct {@link Analyzer} for almost all languages (other than
- * English). It uses the correct natural language analyzer for literals tagged with
+ * English). It uses the correct natural language analyzer only for literals tagged with
+ * certain three letter ISO 639 codes:
  * "por", "deu", "ger", "zho", "chi", "jpn", "kor", "ces", "cze", "dut", "nld", "gre", "ell",
- * "fra", "fre", "rus" and "tha". 
- * This codes do not work if they are used with subtags, e.g. "ger-AT" is treated as English.
- * No two letter code works correctly: note that the W3C and 
+ * "fra", "fre", "rus" and "tha". All other tags are treated as English.
+ * These codes do not work if they are used with subtags, e.g. "ger-AT" is treated as English.
+ * No two letter code, other than "en" works correctly: note that the W3C and 
  * IETF recommend the use of the two letter forms instead of the three letter forms.
- * <p>
- * Default implementation registers a bunch of {@link Analyzer}s for various
- * language codes and then serves the appropriate {@link Analyzer} based on
- * the specified language code.
  * 
  * @author <a href="mailto:tho...@us...">Bryan Thompson</a>
  * @deprecated Using {@link ConfigurableAnalyzerFactory} with 

This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.





2010	Jan	Feb	Mar	Apr	May	Jun	Jul (139)	Aug (94)	Sep (232)	Oct (143)	Nov (138)	Dec (55)
2011	Jan (127)	Feb (90)	Mar (101)	Apr (74)	May (148)	Jun (241)	Jul (169)	Aug (121)	Sep (157)	Oct (199)	Nov (281)	Dec (75)
2012	Jan (107)	Feb (122)	Mar (184)	Apr (73)	May (14)	Jun (49)	Jul (26)	Aug (103)	Sep (133)	Oct (61)	Nov (51)	Dec (55)
2013	Jan (59)	Feb (72)	Mar (99)	Apr (62)	May (92)	Jun (19)	Jul (31)	Aug (138)	Sep (47)	Oct (83)	Nov (95)	Dec (111)
2014	Jan (125)	Feb (60)	Mar (119)	Apr (136)	May (270)	Jun (83)	Jul (88)	Aug (30)	Sep (47)	Oct (27)	Nov (23)	Dec
2015	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep (3)	Oct	Nov	Dec
2016	Jan	Feb	Mar (4)	Apr (1)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

bigdata-commit Mailing List for Blazegraph (powered by bigdata) (Page 20)

Fast, scalable, robust graph database platform

bigdata-commit — Commit traffic.