|
From: Maximilian S. <sch...@ci...> - 2006-10-26 12:32:07
|
Hi,
I'm trying to get Wayback with nutchWax working, but I'm running into thi=
s
error from nutchwax: field "link" does not appear to be indexed
Now I don't know wether nutchWax or Wayback is to blame here, or if "link=
"
_should_ be in my index but isn't somehow?!
When I just remove &sort=3Dlink from the query URL the query works fine. =
I
found it being added here:
org.archive.wayback.resourceindex.NutchResourceIndex.java (285):
ms.append("&sort=3Dlink");
But even without the exception being thrown I don't get any results as th=
e
exact date is added to the query each time. And by exact I mean _second_!
Even when I select "All" from the years select box,
"date%3A20061231235959" is added (?), when I select "2003",
date%3A20031231235959 is added. Nutch will then only search for documents
with this specific timestamp.
Am I missing some important configuration thing here or did the Nutch par=
t
of wayback just not get enough love the last months (-: ?
Cheers,
Max
Full stacktrace:
java.lang.RuntimeException: field "link" does not appear to be indexed
org.apache.lucene.search.FieldCacheImpl.getAuto(FieldCacheImpl.java:356)
org.apache.lucene.search.FieldSortedHitQueue.comparatorAuto(FieldSortedH=
itQueue.java:341)
org.apache.lucene.search.FieldSortedHitQueue.getCachedComparator(FieldSo=
rtedHitQueue.java:184)
org.apache.lucene.search.FieldSortedHitQueue.<init>(FieldSortedHitQueue.=
java:58)
org.apache.lucene.search.TopFieldDocCollector.<init>(TopFieldDocCollecto=
r.java:44)
org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:108)
org.apache.lucene.search.Searcher.search(Searcher.java:76)
org.apache.nutch.searcher.LuceneQueryOptimizer.optimize(LuceneQueryOptim=
izer.java:268)
org.apache.nutch.searcher.IndexSearcher.search(IndexSearcher.java:95)
org.apache.nutch.searcher.NutchBean.search(NutchBean.java:180)
org.apache.nutch.searcher.NutchBean.search(NutchBean.java:242)
org.apache.nutch.searcher.OpenSearchServlet.doGet(OpenSearchServlet.java=
:136)
org.archive.access.nutch.NutchwaxOpenSearchServlet.doGet(NutchwaxOpenSea=
rchServlet.java:69)
javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
|