Thanks for your previous answers it was quite helpful.
I have some questions related to web page parsing.
<GoogleOn> ... <GoogleOn> -- On
ly this will be parsed
--- meta name="zipcode" content="45212,45208,45218"
--- meta name="keywords" content="opensearch, search server"
--- meta name="author" content="kim"
if possible could you please guide how?
Can this be made automatic? Like converting all meta tags into return fields?
Sorry strangely there was something wrong and some of my query were displayed
Here are my questions again:
<GoogleOn> ... <GoogleOn> -- Only the content inside this will be parsed
parsed by GoogleMini
--- <meta name="zipcode" content="45212,45208,45218"/>
--- <meta name="keywords" content="opensearch, search server"/>
--- <meta name="author" content="kim"/>
if possible could you please guide how?
Currently, the 1.1 branch does not provide this.
I think that both of your questions are very interesting features. I will add
<OssOn/> and <OssOff/> tags support in the 1.2 branch.
We plan to release the first 1.2 beta version next week. Perhaps we have time
to implements it.
Extracting the meta informations requires to implement a dynamic way to create
fields in HTML parser. It's a good idea too...
This is really a great news indeed. I am keenly looking forward for these
features. It is also interesting to have the ability to preserve some HTMl
tags that are within some specific tag. This might be very useful if we would
wish to show some of the images from the specific page being searched :)
I am keenly looking forward for the 1.2 beta.
I almost got most of it's features I really really loved this product. Thank
you for your great work.
I just added two new feature requests. You will find interesting proposals
related to your request.
Thank you for your support !
Thank you for your support.
The first 1.2 developer release is available. We have implemented the <oss ignore="yes"> feature.
Let us know if it works for you.
Today I tested, I see that this feature works really perfect. I also noticed
two new important options the new Privileges tab for security, as well as the
option to Erase an index. These are really great, and they are already working
I am also looking out to test the new meta tag to fields implementation :)
Thank you for your efforts.
By the way,
Seems like the PHP API do not have the updated code to support the new
privileges, or am I looking in some wrong location. I am not yet sure.
I also tried a simple query like the following url:
This returned the following exception:
com.jaeksoft.searchlib.SearchLibException: Bad credential
So how do I make a query or how should the API key be added, could you let me
Thanking you in advance.
the credential support are not implemented in PHP API for the moment.
I'll have time to implement all the evolutions the next week.
Until Pascal has updated the php client, you can manually add the parameters
login et key:
The API Key is not the password. It is an auto-generated key that can be found
in the web interface, in the privileges panel.
Thanks for the information. Yes, I am able to search using this way.
We provide now an alternative way to the <oss ignore="yes"> feature. The goal
is to preserve the validity of XHTML pages.
You can now use:
<p>This text should not be indexed.</p>
The meta tag to fields implementation is available for testing (1.2 revision
You can see HTML sample here.
credentials have been added to the OSS_Search.class.php.
OSS_API.class.php will be commited once I've added support for Schema API.
These are truly great news. Thank you for these information.
I hope to test these features soon and I will surely update you on the same.
Thank you once again
Today I made a test to check the automated field generation from the meta
tags. I however could not get any idea where I can see these fields being
I used the exact same content as given by you in this url: http://www.open-
I even checked the xml output that we get from the following query:
I even checked in the backend "Returned Fields" Tab inside the "Query" tab
I still could not find where to find the field named "ccategory".
Could you please let me know how and where to find these fields and how to get
the field values?
The fields are not created dynamically. Did you create the field in the schema
I went to the schema tab, and tried to add the "category field as follows:
I got the error saying "Unknown exception: java.lang.NullPointerException."
What could be the problem, could you please let me know the exact procedure?
Strange, just now I was able to add the category field. However to my surprise
I see the "NullPointerException" for whatever operation I do in the backend!
What could be going wrong? Any idea?
Sorry for the inconvenience. The bug has been fixed.
These features are working pretty cool.
By the way, I have a a small suggestion, if you don't mind.
It would have been nice if we could omit the login and the key being passed
along with the URL. Otherwise when we start querying for different fields, the
URL length might exceed the limit.
A first workaround is to use the POST method. OSS handled both HTTP GET and
I will add a "no password" option for user. Sill need to pass the
login=username, but no more API key.
Thank you for the information.
By the way, I noticed another issue. Today I updated to the latest revision
In the backend I went to > Query > Faceted Fields > selected new field "title"
Then if I hit the Search button, and I get a Message box titled "ZK" with the
message "3" !
Well I found the issue.
For a field to be a Facet I had to ensure that TermVector is set to "No".
Let me correct that. For field to be a facet, it only need to be indexable
("Indexation of content" checked in the schema tab panel).
Log in to post a comment.