Part of the problem is that I'm using an older version of that jar.
It's not included in my download... should I just check it out of SVN
and build a new jar?
Adam
On Tue, Sep 9, 2008 at 1:58 PM, Stanislaw Osinski
<stanislaw@...> wrote:
>>> I'm attempting to run the solr demo web app using solr as an input...
>>> this required add a custom solr input component and also uploading the
>>> solr input class jar that's not included in the distro. Everything
>>> seems good except I'm getting errors like this when doing a search:
>>>
>>> Internal processing error. class java.lang.RuntimeException
>>> Error in the input XML, duplicated document identifier:
>>
>> Hi Adam,
>>
>> This error is raised when two or more documents in the input have the same
>> ids. Duplicated content (snippets, titles) is not a problem, but documents
>> must have unique ids so that we can then output unambiguous assignments of
>> documents to clusters.
>>
>> From your description it looks like the ids in your Solr index should be
>> unique, so the error looks weird. For quicker debugging -- could you e-mail
>> me some example XML output from Solr (in Solr format) that causes the
>> problems?
>
> Ok -- I've found the problem. Solr input has a parameter that allows you to
> specify the Solr field names that should map to Carrot2 document's id,
> title, snippet and url. The problem is that when using thin input in carrot2
> webapp, there is no easy way to set these parameters (something we've
> addressed in the upcoming 3.0 release).
>
> For now the quickest solution would be to modify the solr-to-c2.xsl XSLT
> (it's in carrot2-input-solr.jar) and provide your specific Solr field names:
>
> <xsl:param name="solr.id-field">id</xsl:param>
> <xsl:param name="solr.title-field">title</xsl:param>
> <xsl:param name="solr.snippet-field">description</xsl:param>
> <xsl:param name="solr.url-field">url</xsl:param>
>
> For the XML you e-mailed me the mappings should probably be:
>
> <xsl:param name="solr.id-field">doc_id</xsl:param>
> <xsl:param name="solr.title-field">title</xsl:param>
> <xsl:param name="solr.snippet-field">visible</xsl:param>
> <xsl:param name="solr.url-field">domain</xsl:param> <!-- Not sure here --
> you can put anything as the URL, this will only affect the URL shown in the
> webapp, will not affect clustering -->
>
> Please let me know if this helps.
>
> Cheers,
>
> S.
>
>
> -------------------------------------------------------------------------
> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
> Build the coolest Linux based applications with Moblin SDK & win great
> prizes
> Grand prize is a trip for two to an Open Source event anywhere in the world
> http://moblin-contest.org/redirect.php?banner_id=100&url=/
> _______________________________________________
> Carrot2-developers mailing list
> Carrot2-developers@...
> https://lists.sourceforge.net/lists/listinfo/carrot2-developers
>
>
|