You can subscribe to this list here.
2003 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(16) |
Jul
(56) |
Aug
(2) |
Sep
(62) |
Oct
(71) |
Nov
(45) |
Dec
(6) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2004 |
Jan
(12) |
Feb
(22) |
Mar
|
Apr
(62) |
May
(15) |
Jun
(57) |
Jul
(4) |
Aug
(24) |
Sep
(7) |
Oct
(34) |
Nov
(81) |
Dec
(41) |
2005 |
Jan
(70) |
Feb
(51) |
Mar
(46) |
Apr
(16) |
May
(22) |
Jun
(34) |
Jul
(23) |
Aug
(13) |
Sep
(43) |
Oct
(42) |
Nov
(54) |
Dec
(68) |
2006 |
Jan
(81) |
Feb
(43) |
Mar
(64) |
Apr
(141) |
May
(37) |
Jun
(101) |
Jul
(112) |
Aug
(32) |
Sep
(85) |
Oct
(63) |
Nov
(84) |
Dec
(81) |
2007 |
Jan
(25) |
Feb
(64) |
Mar
(46) |
Apr
(28) |
May
(14) |
Jun
(42) |
Jul
(19) |
Aug
(34) |
Sep
(29) |
Oct
(25) |
Nov
(12) |
Dec
(9) |
2008 |
Jan
(15) |
Feb
(34) |
Mar
(37) |
Apr
(23) |
May
(18) |
Jun
(47) |
Jul
(28) |
Aug
(61) |
Sep
(29) |
Oct
(48) |
Nov
(24) |
Dec
(79) |
2009 |
Jan
(48) |
Feb
(50) |
Mar
(28) |
Apr
(10) |
May
(51) |
Jun
(22) |
Jul
(125) |
Aug
(29) |
Sep
(38) |
Oct
(29) |
Nov
(58) |
Dec
(32) |
2010 |
Jan
(15) |
Feb
(10) |
Mar
(12) |
Apr
(64) |
May
(4) |
Jun
(81) |
Jul
(41) |
Aug
(82) |
Sep
(84) |
Oct
(35) |
Nov
(43) |
Dec
(26) |
2011 |
Jan
(59) |
Feb
(25) |
Mar
(23) |
Apr
(14) |
May
(22) |
Jun
(8) |
Jul
(5) |
Aug
(20) |
Sep
(10) |
Oct
(12) |
Nov
(29) |
Dec
(7) |
2012 |
Jan
(1) |
Feb
(22) |
Mar
(9) |
Apr
(5) |
May
(2) |
Jun
|
Jul
(6) |
Aug
(2) |
Sep
|
Oct
(5) |
Nov
(9) |
Dec
(10) |
2013 |
Jan
(9) |
Feb
(3) |
Mar
(2) |
Apr
(4) |
May
(2) |
Jun
(1) |
Jul
(2) |
Aug
(5) |
Sep
|
Oct
(3) |
Nov
(3) |
Dec
(2) |
2014 |
Jan
(1) |
Feb
(2) |
Mar
|
Apr
(10) |
May
(3) |
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
(3) |
2015 |
Jan
(8) |
Feb
(3) |
Mar
(7) |
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(1) |
Nov
(3) |
Dec
|
2016 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
2018 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
2019 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(8) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2020 |
Jan
|
Feb
|
Mar
|
Apr
(2) |
May
|
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
(1) |
Dec
|
2021 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
2023 |
Jan
|
Feb
|
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
(4) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2025 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Veit J. <nun...@go...> - 2011-10-14 19:56:44
|
2011/10/10 Veit Jahns <nun...@go...>: >> Microsoft suggests this: >> #ifdef _MSC_VER >> #if _MSC_VER == 1200 >> std::ostream& operator<<(std::ostream& os, __int64 i ) // borrowed from >> http://support.microsoft.com/kb/168440 >> { >> char buf[20]; >> sprintf(buf,"%I64d", i ); >> os << buf; >> return os; >> } >> #endif >> #endif >> With either of the 2 fixes above, both CLUCENE_CORE and CLUCENE_SHARED >> compile okay. > > I will then integrate the later one. Added this workaround to th vc6-branch. Kind regards, Veit Jahns |
From: Veit J. <nun...@go...> - 2011-10-10 12:04:06
|
Hi Celto! 2011/10/10 cel tix44 <cel...@gm...>: > Veit > Thanks a lot, very much appreciated. You are welcome! > I've tested the vc6-fixes-working branch and out of the box, VC6 throws just > one error -- when compiling SegmentInfos.cpp: > .... error C2593: 'operator <<' is ambiguous > which happens on this line: > .... (*infoStream) << "[SIS]: directory listing genA=" << genA << "\n"; > In my previous post, I used (int32_t) as a stopgap: > (*infoStream) << "[SIS]: directory listing genA=" << (int32_t)genA << "\n"; My mistake. Missed that. > Microsoft suggests this: > #ifdef _MSC_VER > #if _MSC_VER == 1200 > std::ostream& operator<<(std::ostream& os, __int64 i ) // borrowed from > http://support.microsoft.com/kb/168440 > { > char buf[20]; > sprintf(buf,"%I64d", i ); > os << buf; > return os; > } > #endif > #endif > With either of the 2 fixes above, both CLUCENE_CORE and CLUCENE_SHARED > compile okay. I will then integrate the later one. > Thanks again for your time & effort. > Could you advise whether VC6 fixes will live in the branch you created -- or > will they be merged into master? At least they will stay in this branch. In my opinion, they can be merged into master. But I don't know what the others think about this. Any thoughts? Kind regards, Veit |
From: cel t. <cel...@gm...> - 2011-10-10 02:18:30
|
Veit Thanks a lot, very much appreciated. I've tested the vc6-fixes-working branch and out of the box, VC6 throws just one error -- when compiling SegmentInfos.cpp: .... error C2593: 'operator <<' is ambiguous which happens on this line: .... (*infoStream) << "[SIS]: directory listing genA=" << genA << "\n"; In my previous post, I used (int32_t) as a stopgap: (*infoStream) << "[SIS]: directory listing genA=" << (int32_t)genA << "\n"; Microsoft suggests this: #ifdef _MSC_VER #if _MSC_VER == 1200 std::ostream& operator<<(std::ostream& os, __int64 i ) // borrowed from http://support.microsoft.com/kb/168440 { char buf[20]; sprintf(buf,"%I64d", i ); os << buf; return os; } #endif #endif With either of the 2 fixes above, both CLUCENE_CORE and CLUCENE_SHARED compile okay. Thanks again for your time & effort. Could you advise whether VC6 fixes will live in the branch you created -- or will they be merged into master? Regards Celto On Mon, Oct 10, 2011 at 1:14 AM, Veit Jahns <nun...@go...>wrote: > 2011/10/9 Veit Jahns <nun...@go...>: > > - Put #if-macros around the fixes, where I thought there are only VC6 > specific. > > - I didn't integrated your fix in IndexInput.h. I think this was a > > error caused by the error in ByteSliceReader::clone() > > Forgot to mention: I changed also your fixes with the static_casts. > Here you made downcasts. But the pointer there can also point to other > subclasses of Query. So I made it a static upcast on the other side of > the comparison. > > Veit > > > ------------------------------------------------------------------------------ > All of the data generated in your IT infrastructure is seriously valuable. > Why? It contains a definitive record of application performance, security > threats, fraudulent activity, and more. Splunk takes this data and makes > sense of it. IT sense. And common sense. > http://p.sf.net/sfu/splunk-d2dcopy2 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > |
From: Veit J. <nun...@go...> - 2011-10-09 14:15:23
|
2011/9/12 cel tix44 <cel...@gm...>: > Team > Tried to compile v.2.3.3.4 by using VS2008 -- got 2 errors in > TestIndexSearcher.cpp: > error C3861: 'usleep': identifier not found > Fixed it as follows: > 1) replace usleep(9999) with _LUCENE_SLEEP(9999) > 2) add this include to test.h: > #include "CLucene/_SharedHeader.h" This is also integrated in the vc6-fixes-working branch [1]. Thanks for pointing this out. Veit [1] http://clucene.git.sourceforge.net/git/gitweb.cgi?p=clucene/clucene;a=commit;h=75905bca936bfbd6d34d38f0c2fbb360c7352a52 |
From: Veit J. <nun...@go...> - 2011-10-09 14:14:14
|
2011/10/9 Veit Jahns <nun...@go...>: > - Put #if-macros around the fixes, where I thought there are only VC6 specific. > - I didn't integrated your fix in IndexInput.h. I think this was a > error caused by the error in ByteSliceReader::clone() Forgot to mention: I changed also your fixes with the static_casts. Here you made downcasts. But the pointer there can also point to other subclasses of Query. So I made it a static upcast on the other side of the comparison. Veit |
From: Veit J. <nun...@go...> - 2011-10-09 14:11:17
|
Hi Celto, it took a while, but I integrated your fixes into clucene. I pushed them to the branch vc6-fixes-working [1]. I made the following changes to your fixes: - Put #if-macros around the fixes, where I thought there are only VC6 specific. - I didn't integrated your fix in IndexInput.h. I think this was a error caused by the error in ByteSliceReader::clone() I compiled them with on Windows VS2010 and with gcc on Ubuntu 10. No errors occurred. It would be nice, if you will check this branch too. In particular with VC6. Kind regards, Veit [1] http://clucene.git.sourceforge.net/git/gitweb.cgi?p=clucene/clucene;a=shortlog;h=refs/heads/vc6-fixes-working |
From: cel t. <cel...@gm...> - 2011-10-04 22:10:27
|
Klaus I've retested the compilation process twice, with VS 2003 .Net and VS 2008 -- in two independent tests -- both compiled okay for me as detailed below. If you are on Windows and Visual Studio is an option for you, you could try & follow the steps below and see if that works for you, too. Regards Celto 1) download & install Cmake -- link below is for v.2.8.6 Win32 Installer: http://www.cmake.org/files/v2.8/cmake-2.8.6-win32-x86.exe 2) download Zlib source code -- link below is for v.1.2.5 http://zlib.net/zlib125.zip 3) unzip Zlib -- in my test, I unzipped to c:\zlib-1.2.5 4) download Clucene source code -- link below is for v.2.3.3.4 http://sourceforge.net/projects/clucene/files/clucene-core-unstable/2.3/clucene-core-2.3.3.4.tar.gz/download 5) decompress Clucene -- in my test, I decompressed to C:\clucene-core-2.3.3.4 6) start Cmake and specify the following: -- "Where is the source code" --> C:/clucene-core-2.3.3.4 -- "Whereto build the binaries" --> C:/clucene-core-2.3.3.4 7) click the "Configure" button and choose your compiler. In my second test, I specified "Visual Studio 9 2008". 8) click "Finish" and wait a few minutes while Cmake generates code for you 9) after the first run, Cmake will show this error: "Error in configuration process, project files may be invalid" 10) To fix this, do the following: -- tick the "Advanced" checkbox (it's in the upper / right-hand side section of the Cmake screen) -- scroll down the list of options and locate the ZLIB_INCLUDE_DIR option -- set it to C:\zlib-1.2.5 -- click the Configure button again -- the error should be gone now 11) Click the "Stop" button. In Cmake's output window, you'll see "Generating done". 12) Now, check the contents of the C:\clucene-core-2.3.3.4 folder -- you'll see the clucene.sln solution there 13) Open that solution -- in my 2nd test, it was generated for VS 2008, so I opened it with VS 2008 14) In the "Solution Explorer" window, right-click the ALL_BUILD project and choose "Rebuild" 15) The compilation throws a few warning -- but completes without errors. On Sat, Oct 1, 2011 at 12:38 AM, Veit Jahns <nun...@go...>wrote: > Hi Klaus, > > what are your troubles with compiling clucene? Using the Cmake tool > compiling clucene is rather simple. Just execute Cmake and open and > build the created VS solution file. > > Kind regards, > > Veit Jahns > > 2011/9/28 Klaus Gruen <kla...@ro...>: > > Hi all, > > I am not a compiling guru and struggle with building windows > binaries...can > > ANYONE kindly post a link or point me to compiled EXE binaries for > windows, > > or put it up temporarily for me ? I cannot even move forward until this > in > > place, and I followed the directions here with dismal failure results: > > http://clucene.sourceforge.net/download.shtml#2_3_2 > > thank you!!!!! > > klaus > > > > > ------------------------------------------------------------------------------ > > All the data continuously generated in your IT infrastructure contains a > > definitive record of customers, application performance, security > > threats, fraudulent activity and more. Splunk takes this data and makes > > sense of it. Business sense. IT sense. Common sense. > > http://p.sf.net/sfu/splunk-d2dcopy1 > > _______________________________________________ > > CLucene-developers mailing list > > CLu...@li... > > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > > > > > > ------------------------------------------------------------------------------ > All of the data generated in your IT infrastructure is seriously valuable. > Why? It contains a definitive record of application performance, security > threats, fraudulent activity, and more. Splunk takes this data and makes > sense of it. IT sense. And common sense. > http://p.sf.net/sfu/splunk-d2dcopy2 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > |
From: Veit J. <nun...@go...> - 2011-09-30 14:38:50
|
Hi Klaus, what are your troubles with compiling clucene? Using the Cmake tool compiling clucene is rather simple. Just execute Cmake and open and build the created VS solution file. Kind regards, Veit Jahns 2011/9/28 Klaus Gruen <kla...@ro...>: > Hi all, > I am not a compiling guru and struggle with building windows binaries...can > ANYONE kindly post a link or point me to compiled EXE binaries for windows, > or put it up temporarily for me ? I cannot even move forward until this in > place, and I followed the directions here with dismal failure results: > http://clucene.sourceforge.net/download.shtml#2_3_2 > thank you!!!!! > klaus > > ------------------------------------------------------------------------------ > All the data continuously generated in your IT infrastructure contains a > definitive record of customers, application performance, security > threats, fraudulent activity and more. Splunk takes this data and makes > sense of it. Business sense. IT sense. Common sense. > http://p.sf.net/sfu/splunk-d2dcopy1 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Klaus G. <kla...@ro...> - 2011-09-28 04:00:31
|
Hi all, I am not a compiling guru and struggle with building windows binaries...can ANYONE kindly post a link or point me to compiled EXE binaries for windows, or put it up temporarily for me ? I cannot even move forward until this in place, and I followed the directions here with dismal failure results: http://clucene.sourceforge.net/download.shtml#2_3_2 thank you!!!!! klaus |
From: cel t. <cel...@gm...> - 2011-09-20 11:08:46
|
Veit 1) I've patched both CORE and SHARED to make them compile under VC6. 2) As changes affected many files -- I will send you the whole SRC folder rather than individual files 3) My changes are marked with the word CELTO. The fixes need to be reviewed, please -- as some of them were made just to avoid compilation errors, without proper evaluation of their effect. Please let me know if I can do anything to make revision easier for you. Regards Celto On Tue, Sep 20, 2011 at 2:57 AM, Veit Jahns <nun...@go...>wrote: > Hi Celto! > > 2011/9/19 cel tix44 <cel...@gm...>: > > Veit > > Thanks very much for your response. > > Attached is a list of compilation errors "as is" -- produced after > getting > > the source code from GIT, config'ed with Cmake 2.8 (default settings, no > > boost, no zlib) and built with vc6 -- essentially, for you to estimate if > > it's worth the time & effort at all. > > I think so. Actually, some of them seem to me to be errors in the > code. E.g., the "void function returning a value" errors. I wonder, > why these cause no errors with newer versions of Visual C++. > > > If yes, I can start by fixing all simple cases (like C2248: cannot access > > private member) -- and then, if that's okay with you, resubmit what's > left > > for your perusal? > > It's okay. > > Kind regards, > > Veit > > > ------------------------------------------------------------------------------ > BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > Learn about the latest advances in developing for the > BlackBerry® mobile platform with sessions, labs & more. > See new tools and technologies. Register for BlackBerry® DevCon today! > http://p.sf.net/sfu/rim-devcon-copy1 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > |
From: Veit J. <nun...@go...> - 2011-09-19 16:57:23
|
Hi Celto! 2011/9/19 cel tix44 <cel...@gm...>: > Veit > Thanks very much for your response. > Attached is a list of compilation errors "as is" -- produced after getting > the source code from GIT, config'ed with Cmake 2.8 (default settings, no > boost, no zlib) and built with vc6 -- essentially, for you to estimate if > it's worth the time & effort at all. I think so. Actually, some of them seem to me to be errors in the code. E.g., the "void function returning a value" errors. I wonder, why these cause no errors with newer versions of Visual C++. > If yes, I can start by fixing all simple cases (like C2248: cannot access > private member) -- and then, if that's okay with you, resubmit what's left > for your perusal? It's okay. Kind regards, Veit |
From: Itamar Syn-H. <it...@co...> - 2011-09-17 18:11:38
|
inline On Fri, Sep 16, 2011 at 5:51 PM, Greg Hellings <gre...@gm...>wrote: > I'm working to update two of my projects which utilize CLucene 0.21b > and trying to update them to CLucene 2.3.3.4. I haven't come across > any posts or wiki pages that document a suggested upgrade path - are > there any? If so, it might make my transition proceed much more > quickly. No, since the 2.3.* work was never completed, and we are now working on a newer codebase , which we hope to finalize in the upcoming months. > At present I have the following specific questions: > > My project utilizes lucene_utf8towcs and lucene_wcstoutf8. These are > still present in CLucene but are not included in the publicly > installed headers. Is this purposeful or not? If it is, what is the > proper way to do those conversions when passing data into and out of > CLucene? > Those were split to a clucene_shared project, whereas what you are using is clucene_core. Just reference the main header of the shared project and you should be set. > > There are a number of other missing classes or the like which were > being used in 0.9.21b which are not present in 2.3.3.4. These lead to > error messages like > > error: ‘Text’ is not a member of ‘lucene::document::Field’ > error: ‘UnStored’ is not a member of ‘lucene::document::Field’ > error: no matching function for call to > ‘lucene::index::IndexWriter::addIndexes(lucene::store::Directory* > [2])’ > Some classes where made invisible intentionally, since they should only be used internally. In the specific error message you have shown here, the API has changed to match that of Lucene of the parallel version. See the sample application and tests for the up to date syntax (sorry, don't have it handy atm). HTH > > and similar. Again, is there a document somewhere that defines > standard upgrade paths and suggestions? Thanks. > > --Greg > > > ------------------------------------------------------------------------------ > BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > http://p.sf.net/sfu/rim-devcon-copy2 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Veit J. <nun...@go...> - 2011-09-17 10:17:47
|
Hi Celto! 2011/9/14 cel tix44 <cel...@gm...>: > Team > When compiled with VC6, v.2.3.3.4 throws some 300 errors. > May I please ask the DEV team whether Clucene is still going to be > compatible with VC6? I think, it was not intended in the first place. This release includes some parts of the Boost library 1.42. And this version does not support VC6 [1]. But otherwise, only small parts of the Boost library are used, so maybe this is not a big issue. And if there is a need that this version can be compiled with VC6, we should at least try to make it compile. What kind of errors do you get? > If yes, I'll fix what I can, but I will need help from the DEV team where > it's over my head, please. If I can assist you with this, I will do so. Kind regards, Veit [1] http://www.boost.org/users/history/version_1_42_0.html |
From: Greg H. <gre...@gm...> - 2011-09-16 14:51:57
|
I'm working to update two of my projects which utilize CLucene 0.21b and trying to update them to CLucene 2.3.3.4. I haven't come across any posts or wiki pages that document a suggested upgrade path - are there any? If so, it might make my transition proceed much more quickly. At present I have the following specific questions: My project utilizes lucene_utf8towcs and lucene_wcstoutf8. These are still present in CLucene but are not included in the publicly installed headers. Is this purposeful or not? If it is, what is the proper way to do those conversions when passing data into and out of CLucene? There are a number of other missing classes or the like which were being used in 0.9.21b which are not present in 2.3.3.4. These lead to error messages like error: ‘Text’ is not a member of ‘lucene::document::Field’ error: ‘UnStored’ is not a member of ‘lucene::document::Field’ error: no matching function for call to ‘lucene::index::IndexWriter::addIndexes(lucene::store::Directory* [2])’ and similar. Again, is there a document somewhere that defines standard upgrade paths and suggestions? Thanks. --Greg |
From: cel t. <cel...@gm...> - 2011-09-14 13:26:02
|
Team When compiled with VC6, v.2.3.3.4 throws some 300 errors. May I please ask the DEV team whether Clucene is still going to be compatible with VC6? If yes, I'll fix what I can, but I will need help from the DEV team where it's over my head, please. Regards Celto |
From: cel t. <cel...@gm...> - 2011-09-12 14:01:36
|
Team Tried to compile v.2.3.3.4 by using VS2008 -- got 2 errors in TestIndexSearcher.cpp: error C3861: 'usleep': identifier not found Fixed it as follows: 1) replace usleep(9999) with _LUCENE_SLEEP(9999) 2) add this include to test.h: #include "CLucene/_SharedHeader.h" Regards Celto |
From: Veit J. <nun...@go...> - 2011-08-25 18:50:01
|
Hi Rahul! 2011/8/18 Asharudeen <ash...@gm...>: > However, I believe, in Lucene the indexed data would only be 1% to 10% of > the original file data. Plz correct me, if I am wrong. I read once that is 30% to 40% of the original size. But this also depends on how the data is tokenized and indexed. By an appropriate analyzer you can also get the opposite. E.g., if you use an analyzer adding additional data the is inserted into the indexed. > So, I want to check if i would be able to use 'CLucene' project in the client > side, and generate only the analysed data that needs to be stored in an > index. Then, i would transfer this data to the server (through socket or > curl upload), and index the analysed content on the server side. So, with > this approach, i want to avoid transfering entire files and transfer only the > indexable portion of the content as input to the server. Then on the server > side, i want to perform the necessary processing to create the index with > this input data. Is there any way/api to achieve these steps on both the > client and server side using CLucene. Or any way to achieve this by > digging into the CLucene codes/project ? Yes, Lucene is capable of merging indexes, and so is CLucene. But what I don't know, if the different versions will be a problem. If I am not wrong, Solr is based on Lucene 3.3 and CLucene on the code base of Lucene 2.3.2. But in the past I was able to open an index created by CLucene with Luke. If I optimized the index I got an index in to uptodate format of Lucene. So may be, if a direct merge isn't possible, a optimization will convert the index to the current format. Then the indexes can be merged. Kind regards, Veit |
From: Veit J. <nun...@go...> - 2011-08-25 18:40:23
|
Hi Timo! 2011/8/17 Timo Sirainen <ts...@ik...>: > Is there some API call to tell me how many segment files there are > currently, so I could try to guess if I should optimize the index (after > a small incremental update)? I don't know it excatly, but the SegmentInfos (part of IndexWriter) may be useful here. Kind regards, Veit |
From: Asharudeen <ash...@gm...> - 2011-08-18 11:23:00
|
Hi, Currently I am indexing documents using Solr, by directly adding files as 'req.addFile(fi);' or by sending the content of the file like 'req.addContentStream(stream);' (with solrj library). Actually, I need to index the documents from my local network to Solr server running on a remote network. The reason is, I would like to perform searches on server end and dont want to store the indexed data at client side. Typically, the approach is to transfer the entire file content to remote server (since, Solr server is running on different remote network). However, I believe, in Lucene the indexed data would only be 1% to 10% of the original file data. Plz correct me, if I am wrong. So, I want to check if i would be able to use 'CLucene' project in the client side, and generate only the analysed data that needs to be stored in an index. Then, i would transfer this data to the server (through socket or curl upload), and index the analysed content on the server side. So, with this approach, i want to avoid transfering entire files and transfer only the indexable portion of the content as input to the server. Then on the server side, i want to perform the necessary processing to create the index with this input data. Is there any way/api to achieve these steps on both the client and server side using CLucene. Or any way to achieve this by digging into the CLucene codes/project ? Regards, Rahul. |
From: Timo S. <ts...@ik...> - 2011-08-17 19:14:08
|
Is there some API call to tell me how many segment files there are currently, so I could try to guess if I should optimize the index (after a small incremental update)? |
From: theorist <eg...@ma...> - 2011-08-08 14:04:43
|
that's right! the filename field is tokenized. i have to use an untokenized one to delete documents. thank you! 08 августа 2011, 06:08 от Ben van Klinken <bva...@gm...>: |
From: Clemens <cz...@au...> - 2011-08-07 22:56:06
|
On Mon, 08 Aug 2011 10:47:17 +1200, Ben van Klinken <bva...@gm...> wrote: > Depends on the amount of data size you are talking about and how big > your batches are. > The second option doesn't require you to index twice. You create a > temporary index and you merge it using index.addIndexes. This is > actually a similar process which happens in memory when indexing > anyway. cool will try this. Regards, Clemens |
From: Ben v. K. <bva...@gm...> - 2011-08-07 22:53:05
|
How are you deleting the documents? On Sun, Aug 7, 2011 at 5:58 PM, theorist <eg...@ma...> wrote: > hello, > > i can still read documents in my index which have been deleted with IndexModifier::deleteDocuments even after flushing, optimizing, closing the index and then reopening it with IndexReader. what's wrong? > > thanks. > ------------------------------------------------------------------------------ > BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > The must-attend event for mobile developers. Connect with experts. > Get tools for creating Super Apps. See the latest technologies. > Sessions, hands-on labs, demos & much more. Register early & save! > http://p.sf.net/sfu/rim-blackberry-1 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > |
From: Ben v. K. <bva...@gm...> - 2011-08-07 22:47:43
|
Depends on the amount of data size you are talking about and how big your batches are. The second option doesn't require you to index twice. You create a temporary index and you merge it using index.addIndexes. This is actually a similar process which happens in memory when indexing anyway. Ben On Mon, Aug 8, 2011 at 6:54 AM, Clemens <cz...@au...> wrote: > > ok thanks! if I got it right there are two options. First, including the > file id into the query to only match a special file. For example, I store > the file path and the file content into the CLucene db. The file path > would be my file id I add into the query term. > Second, when updating a file in the database I first add the file to a > temp db and search this single entry database for the queries from my list > of live queries. > > Any suggestion/ideas which solution performs better? Indexing the file > twice sounds slower to me but maybe I'm wrong? > > thank you, > Clemens > > > On Sun, 07 Aug 2011 11:26:13 +1200, Ben van Klinken > <bva...@gm...> wrote: > > > Another approach would be to create your indexes to a temporary location > > and > > search that location for your live query before merging into the main > > index. > > > > Ben > > > > On Sun, Aug 7, 2011 at 9:05 AM, Itamar Syn-Hershko > > <it...@co...>wrote: > > > >> There isn't such thing built into clucene nor Java Lucene. You are > >> going to > >> have to keep a list of document IDs that once matched a query, and to > >> perform searches in the background every now and then with that > >> document ID > >> in it (use your IDs, not Lucene's internal docids). > >> > >> > >> On Sat, Aug 6, 2011 at 9:12 AM, Clemens <cz...@au...> > >> wrote: > >> > >>> Hi, > >>> > >>> not sure if this question better goes to the java lucene mailing list > >>> but > >>> you probably can help me too. > >>> > >>> I'm using clucene to implement a desktop search engine for the Haiku > >>> OS. I > >>> like to notify the user when a new document matches an existing query > >>> or a > >>> document not match anymore. Is there something like this build in? > >>> Otherwise I just need to check if an modified document matches a query. > >>> > >>> How can I check if a document satisfy a query? Or do I have to modify > >>> the > >>> query to only search a special document? What is the best to do it? > >>> > >>> thank you, > >>> Clemens > >>> > >>> > >>> ------------------------------------------------------------------------------ > >>> BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > >>> The must-attend event for mobile developers. Connect with experts. > >>> Get tools for creating Super Apps. See the latest technologies. > >>> Sessions, hands-on labs, demos & much more. Register early & save! > >>> http://p.sf.net/sfu/rim-blackberry-1 > >>> _______________________________________________ > >>> CLucene-developers mailing list > >>> CLu...@li... > >>> https://lists.sourceforge.net/lists/listinfo/clucene-developers > >>> > >>> > >> > >> > >> ------------------------------------------------------------------------------ > >> BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > >> The must-attend event for mobile developers. Connect with experts. > >> Get tools for creating Super Apps. See the latest technologies. > >> Sessions, hands-on labs, demos & much more. Register early & save! > >> http://p.sf.net/sfu/rim-blackberry-1 > >> _______________________________________________ > >> CLucene-developers mailing list > >> CLu...@li... > >> https://lists.sourceforge.net/lists/listinfo/clucene-developers > > ------------------------------------------------------------------------------ > BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA > The must-attend event for mobile developers. Connect with experts. > Get tools for creating Super Apps. See the latest technologies. > Sessions, hands-on labs, demos & much more. Register early & save! > http://p.sf.net/sfu/rim-blackberry-1 > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers |
From: Clemens <cz...@au...> - 2011-08-07 20:54:50
|
ok thanks! if I got it right there are two options. First, including the file id into the query to only match a special file. For example, I store the file path and the file content into the CLucene db. The file path would be my file id I add into the query term. Second, when updating a file in the database I first add the file to a temp db and search this single entry database for the queries from my list of live queries. Any suggestion/ideas which solution performs better? Indexing the file twice sounds slower to me but maybe I'm wrong? thank you, Clemens On Sun, 07 Aug 2011 11:26:13 +1200, Ben van Klinken <bva...@gm...> wrote: > Another approach would be to create your indexes to a temporary location > and > search that location for your live query before merging into the main > index. > > Ben > > On Sun, Aug 7, 2011 at 9:05 AM, Itamar Syn-Hershko > <it...@co...>wrote: > >> There isn't such thing built into clucene nor Java Lucene. You are >> going to >> have to keep a list of document IDs that once matched a query, and to >> perform searches in the background every now and then with that >> document ID >> in it (use your IDs, not Lucene's internal docids). >> >> >> On Sat, Aug 6, 2011 at 9:12 AM, Clemens <cz...@au...> >> wrote: >> >>> Hi, >>> >>> not sure if this question better goes to the java lucene mailing list >>> but >>> you probably can help me too. >>> >>> I'm using clucene to implement a desktop search engine for the Haiku >>> OS. I >>> like to notify the user when a new document matches an existing query >>> or a >>> document not match anymore. Is there something like this build in? >>> Otherwise I just need to check if an modified document matches a query. >>> >>> How can I check if a document satisfy a query? Or do I have to modify >>> the >>> query to only search a special document? What is the best to do it? >>> >>> thank you, >>> Clemens >>> >>> >>> ------------------------------------------------------------------------------ >>> BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA >>> The must-attend event for mobile developers. Connect with experts. >>> Get tools for creating Super Apps. See the latest technologies. >>> Sessions, hands-on labs, demos & much more. Register early & save! >>> http://p.sf.net/sfu/rim-blackberry-1 >>> _______________________________________________ >>> CLucene-developers mailing list >>> CLu...@li... >>> https://lists.sourceforge.net/lists/listinfo/clucene-developers >>> >>> >> >> >> ------------------------------------------------------------------------------ >> BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA >> The must-attend event for mobile developers. Connect with experts. >> Get tools for creating Super Apps. See the latest technologies. >> Sessions, hands-on labs, demos & much more. Register early & save! >> http://p.sf.net/sfu/rim-blackberry-1 >> _______________________________________________ >> CLucene-developers mailing list >> CLu...@li... >> https://lists.sourceforge.net/lists/listinfo/clucene-developers |