You can subscribe to this list here.
2003 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(16) |
Jul
(56) |
Aug
(2) |
Sep
(62) |
Oct
(71) |
Nov
(45) |
Dec
(6) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2004 |
Jan
(12) |
Feb
(22) |
Mar
|
Apr
(62) |
May
(15) |
Jun
(57) |
Jul
(4) |
Aug
(24) |
Sep
(7) |
Oct
(34) |
Nov
(81) |
Dec
(41) |
2005 |
Jan
(70) |
Feb
(51) |
Mar
(46) |
Apr
(16) |
May
(22) |
Jun
(34) |
Jul
(23) |
Aug
(13) |
Sep
(43) |
Oct
(42) |
Nov
(54) |
Dec
(68) |
2006 |
Jan
(81) |
Feb
(43) |
Mar
(64) |
Apr
(141) |
May
(37) |
Jun
(101) |
Jul
(112) |
Aug
(32) |
Sep
(85) |
Oct
(63) |
Nov
(84) |
Dec
(81) |
2007 |
Jan
(25) |
Feb
(64) |
Mar
(46) |
Apr
(28) |
May
(14) |
Jun
(42) |
Jul
(19) |
Aug
(34) |
Sep
(29) |
Oct
(25) |
Nov
(12) |
Dec
(9) |
2008 |
Jan
(15) |
Feb
(34) |
Mar
(37) |
Apr
(23) |
May
(18) |
Jun
(47) |
Jul
(28) |
Aug
(61) |
Sep
(29) |
Oct
(48) |
Nov
(24) |
Dec
(79) |
2009 |
Jan
(48) |
Feb
(50) |
Mar
(28) |
Apr
(10) |
May
(51) |
Jun
(22) |
Jul
(125) |
Aug
(29) |
Sep
(38) |
Oct
(29) |
Nov
(58) |
Dec
(32) |
2010 |
Jan
(15) |
Feb
(10) |
Mar
(12) |
Apr
(64) |
May
(4) |
Jun
(81) |
Jul
(41) |
Aug
(82) |
Sep
(84) |
Oct
(35) |
Nov
(43) |
Dec
(26) |
2011 |
Jan
(59) |
Feb
(25) |
Mar
(23) |
Apr
(14) |
May
(22) |
Jun
(8) |
Jul
(5) |
Aug
(20) |
Sep
(10) |
Oct
(12) |
Nov
(29) |
Dec
(7) |
2012 |
Jan
(1) |
Feb
(22) |
Mar
(9) |
Apr
(5) |
May
(2) |
Jun
|
Jul
(6) |
Aug
(2) |
Sep
|
Oct
(5) |
Nov
(9) |
Dec
(10) |
2013 |
Jan
(9) |
Feb
(3) |
Mar
(2) |
Apr
(4) |
May
(2) |
Jun
(1) |
Jul
(2) |
Aug
(5) |
Sep
|
Oct
(3) |
Nov
(3) |
Dec
(2) |
2014 |
Jan
(1) |
Feb
(2) |
Mar
|
Apr
(10) |
May
(3) |
Jun
|
Jul
|
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
(3) |
2015 |
Jan
(8) |
Feb
(3) |
Mar
(7) |
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
(1) |
Nov
(3) |
Dec
|
2016 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
(2) |
2018 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
2019 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
(8) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2020 |
Jan
|
Feb
|
Mar
|
Apr
(2) |
May
|
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
(1) |
Dec
|
2021 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
2023 |
Jan
|
Feb
|
Mar
(1) |
Apr
|
May
|
Jun
|
Jul
(4) |
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2025 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(1) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
From: Saiful K. <sai...@gm...> - 2012-12-30 20:06:19
|
Dear All, I am new to CLucene and I want to build and install clucene-core-2.3.3.4 in my Ubuntu-12.04LTS machine. I do need this IR system for academic research purpose and over the next several years I am looking forward to contribute to it. However, at the very first step while trying to configure it I am getting the following error. Please let me know if you need more information. Humbly request your quick response. -Thanks Compiling as RelWithDebInfo CMake Error at src/shared/cmake/MacroMustDefine.cmake:45 (MESSAGE): wcslen could not be found Call Stack (most recent call first): src/shared/CMakeLists.txt:97 (CHECK_REQUIRED_FUNCTIONS) |
From: Kostka B. <ko...@to...> - 2012-12-14 10:52:16
|
OK, merged. I also merged my previous post – bugfix in ConstantScoreQuery From: Itamar Syn-Hershko [mailto:it...@co...] Sent: Thursday, December 13, 2012 8:43 AM To: clu...@li... Subject: Re: [CLucene-dev] BitSet::nextSetBit very inefficient for sparse bit sets Feel free to merge it into master On Wed, Dec 12, 2012 at 4:27 PM, Kostka Bořivoj <ko...@to...<mailto:ko...@to...>> wrote: BitSet::nexSetBit is implemented very inefficient way for sparse bit sets. It searches for next bit set by per-bit iteration and bit shifting See OPTIMIZED_BITSET branch for better solution. It is approximately 8 times faster. It could be still improved (probably 4 times faster) by using uint32_t instead of current uint8_8 array, but it needs deeper changes. Borek ------------------------------------------------------------------------------ LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d _______________________________________________ CLucene-developers mailing list CLu...@li...<mailto:CLu...@li...> https://lists.sourceforge.net/lists/listinfo/clucene-developers |
From: Itamar Syn-H. <it...@co...> - 2012-12-13 07:42:59
|
Feel free to merge it into master On Wed, Dec 12, 2012 at 4:27 PM, Kostka Bořivoj <ko...@to...> wrote: > BitSet::nexSetBit is implemented very inefficient way for sparse bit sets. > It searches for next bit set by per-bit iteration and bit shifting > See OPTIMIZED_BITSET branch for better solution. It is approximately 8 > times faster. It could be still improved (probably 4 times faster) by > using uint32_t instead of current uint8_8 array, but it needs deeper > changes. > Borek > > > ------------------------------------------------------------------------------ > LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial > Remotely access PCs and mobile devices and provide instant support > Improve your efficiency, and focus on delivering more value-add services > Discover what IT Professionals Know. Rescue delivers > http://p.sf.net/sfu/logmein_12329d2d > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Kostka B. <ko...@to...> - 2012-12-12 14:27:26
|
BitSet::nexSetBit is implemented very inefficient way for sparse bit sets. It searches for next bit set by per-bit iteration and bit shifting See OPTIMIZED_BITSET branch for better solution. It is approximately 8 times faster. It could be still improved (probably 4 times faster) by using uint32_t instead of current uint8_8 array, but it needs deeper changes. Borek |
From: Ji C. <ji...@te...> - 2012-12-06 21:33:40
|
I think you did not add the "include" and "libs" correctly, dont just copy the files, you check here to specifier two libs and two folders include http://stackoverflow.com/questions/4789927/including-an-external-library-in-visual-studio-2010-project i hope you will get it ----- Mail original ----- De: "Rajeev kumar Singh" <raj...@gm...> À: clu...@li... Envoyé: Vendredi 7 Décembre 2012 01:35:37 Objet: [CLucene-dev] Getting link error Hello CLucene Developers, I am requesting you to help me in resolving this issue with CLucene. This is first time i am using CLucene. I have created a solution out of CLucene and copied the "include" and "lib" folder to my application (ASCII supported only) and i am accessing CLucene.h and the libraries to access the CLucene. When i am building (make or rebuild) my application folder i am getting this (pasted at end) error. I also created an independent solution like demo project and was able to access well but not sure why i am getting link error here. >From the error log it looks like that there is some issue with make file or the lib but after re-checking all i am not able find any issue. Please help. i might not have provided entire context but some at least so please let me know if you need some more info. I am using VS2008 Error: xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual bool __thiscall lucene::store::RAMDirectory::list(class stlpd_std::vector<class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> >,class stlpd_std::allocator<class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > > > *)const " (?list@RAMDirectory@store@lucene@@UBE_NPAV?$vector@V?$basic_string@DV?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@V?$allocator@V?$basic_string@DV?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@@2@@stlpd_std@@@Z) xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > __thiscall lucene::store::RAMDirectory::toString(void)const " (?toString@RAMDirectory@store@lucene@@UBE?AV?$basic_string@DV?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@XZ) xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > __thiscall lucene::store::Directory::getLockID(void)" (?getLockID@Directory@store@lucene@@UAE?AV?$basic_string@DV?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@XZ) xxxxxx.DLL : fatal error LNK1120: 3 unresolved externals -- Thanks, Rajeev. ------------------------------------------------------------------------------ LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d _______________________________________________ CLucene-developers mailing list CLu...@li... https://lists.sourceforge.net/lists/listinfo/clucene-developers |
From: Rajeev k. S. <raj...@gm...> - 2012-12-06 17:36:29
|
Hello CLucene Developers, I am requesting you to help me in resolving this issue with CLucene. This is first time i am using CLucene. I have created a solution out of CLucene and copied the "include" and "lib" folder to my application (ASCII supported only) and i am accessing CLucene.h and the libraries to access the CLucene. When i am building (make or rebuild) my application folder i am getting this (pasted at end) error. I also created an independent solution like demo project and was able to access well but not sure why i am getting link error here. >From the error log it looks like that there is some issue with make file or the lib but after re-checking all i am not able find any issue. Please help. i might not have provided entire context but some at least so please let me know if you need some more info. I am using VS2008 Error: xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual bool __thiscall lucene::store::RAMDirectory::list(class stlpd_std::vector<class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> >,class stlpd_std::allocator<class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > > > *)const " (?list@RAMDirectory@store@lucene@@UBE_NPAV?$vector@V?$basic_string@DV ?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@V?$allocator@V ?$basic_string@DV?$char_traits@D@stlpd_std@@V?$allocator@D@2@@stlpd_std@@@2@ @stlpd_std@@@Z) xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > __thiscall lucene::store::RAMDirectory::toString(void)const " (?toString@RAMDirectory @store@lucene@@UBE?AV?$basic_string@DV?$char_traits@D@stlpd_std@ @V?$allocator@D@2@@stlpd_std@@XZ) xxxxxx.obj : error LNK2001: unresolved external symbol "public: virtual class stlpd_std::basic_string<char,class stlpd_std::char_traits<char>,class stlpd_std::allocator<char> > __thiscall lucene::store::Directory::getLockID(void)" (?getLockID@Directory @store@lucene@@UAE?AV?$basic_string@DV?$char_traits@D@stlpd_std@ @V?$allocator@D@2@@stlpd_std@@XZ) xxxxxx.DLL : fatal error LNK1120: 3 unresolved externals -- Thanks, Rajeev. |
From: Ji C. <ji...@te...> - 2012-12-06 12:55:38
|
hi all, I'm new here, i use clucene a week ago, and i worked on the demo and found that it searches document. Now i want to finish the work like this : i have a log.txt, i want to search the key words and return the sentences that include the key_words. Can anyone help me? Best Regards |
From: Vitaly A. <vit...@gm...> - 2012-11-25 13:52:22
|
I am sorry I found the problem - different Character Set On Sun, Nov 25, 2012 at 3:09 PM, Vitaly Artemov <vit...@gm...>wrote: > Hi all, > I unable to build contrib library on Windows 7 64 bit. > I created clucene-contribs-lib solution(Visual Studio 2010) using CMAKE UI. > I added clucene-core.lib and clucene-shared.lib as additional dependencies. > But then I try to build it I get multiple link errors: > LanguageBasedAnalyzer.obj : error LNK2001: unresolved external symbol > "public: virtual class lucene::analysis::TokenStream * __cdecl > lucene::analysis::Analyzer::reusableTokenStream(char const *,class > lucene::util::Reader *)" (?reusableTokenStream@Analyzer@analysis@lucene > @@UEAAPEAVTokenStream@23@PEBDPEAVReader@util@3@@Z) ... > ... > ... > C:\Tools\Clucene\clucene-core-2.3.3.4\windows_x64_contrib\Release\clucene-contribs-lib.dll > : fatal error LNK1120: 34 unresolved externals > > It's seems that it for some reason clucene-contribs-lib can't find exports > from clucene-core.lib. > > Thanks in advance, Vitaly > |
From: Vitaly A. <vit...@gm...> - 2012-11-25 13:09:09
|
Hi all, I unable to build contrib library on Windows 7 64 bit. I created clucene-contribs-lib solution(Visual Studio 2010) using CMAKE UI. I added clucene-core.lib and clucene-shared.lib as additional dependencies. But then I try to build it I get multiple link errors: LanguageBasedAnalyzer.obj : error LNK2001: unresolved external symbol "public: virtual class lucene::analysis::TokenStream * __cdecl lucene::analysis::Analyzer::reusableTokenStream(char const *,class lucene::util::Reader *)" (?reusableTokenStream@Analyzer@analysis@lucene @@UEAAPEAVTokenStream@23@PEBDPEAVReader@util@3@@Z) ... ... ... C:\Tools\Clucene\clucene-core-2.3.3.4\windows_x64_contrib\Release\clucene-contribs-lib.dll : fatal error LNK1120: 34 unresolved externals It's seems that it for some reason clucene-contribs-lib can't find exports from clucene-core.lib. Thanks in advance, Vitaly |
From: Vitaly A. <vit...@gm...> - 2012-11-25 11:32:53
|
I checked CJKAnalyzer source and see only CJKTokenizer implementation in it. Is it means that I need to create specific analyzer to use it with CJKTokenizer? Thanks, Vitaly On Thu, Nov 22, 2012 at 1:13 PM, Freiholz Manuel <M.F...@ca...>wrote: > Hi,**** > > ** ** > > in my experience it’s the best way to create N-Grams for the Asian texts. > I think basic CJKAnalyzers already do it this way.**** > > ** ** > > Manuel**** > > ** ** > > *Von:* Vitaly Artemov [mailto:vit...@gm...] > *Gesendet:* Donnerstag, 22. November 2012 11:23 > *An:* clu...@li... > *Betreff:* Re: [CLucene-dev] Creating CLucene Index in a Database; > Support for Asian languages**** > > ** ** > > One more question about Asian languages: > I know that in Asian languages word boundaries are difficult issue. > How are you tokenize Asian texts? > Thank you, Vitaly**** > > On Thu, Nov 22, 2012 at 12:18 PM, Vitaly Artemov <vit...@gm...> > wrote:**** > > Thank you for your fast reply. > Can you please explain why Filesystem store better than Database. > We will use CLucene to index and search huge amount of data. > Vitaly**** > > ** ** > > On Thu, Nov 22, 2012 at 11:40 AM, Itamar Syn-Hershko <it...@co...> > wrote:**** > > inline**** > > ** ** > > On Thu, Nov 22, 2012 at 11:15 AM, Vitaly Artemov <vit...@gm...> > wrote:**** > > > Hello all, > I starting to evaluate Clucene engine for using in our product. > I have 2 questions. > > 1. Is It planned to add support(or it already exists) for creating index in > the Database instead of memory or filesystem? > I read that java Lucene has it by providing JdbcDirectory interface.** > ** > > ** ** > > Don't do that. Use the filesystem, it is much better for every aspect.**** > > **** > > > 2. I read in the FAQ that: > "CLucene is not limited to English, nor any other language. To index text > properly, you need to use an Analyzer appropriate for the language of the > text you are indexing. CLucene's default Analyzers work well for English. > There are a number of other Analyzers in "CLucene Sandbox", including > those for Chinese, Japanese, and Korean." > But "CLucene Sandbox" link is not works for some reason. Can you specify > link to Analyzers list?**** > > ** ** > > Take a look at CJKAnalyzer**** > > **** > > > Thanks in advance, Vitaly**** > > > > ------------------------------------------------------------------------------ > Monitor your physical, virtual and cloud infrastructure from a single > web console. Get in-depth insight into apps, servers, databases, vmware, > SAP, cloud infrastructure, etc. Download 30-day Free Trial. > Pricing starts from $795 for 25 servers or applications! > http://p.sf.net/sfu/zoho_dev2dev_nov > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers**** > > ** ** > > > > ------------------------------------------------------------------------------ > Monitor your physical, virtual and cloud infrastructure from a single > web console. Get in-depth insight into apps, servers, databases, vmware, > SAP, cloud infrastructure, etc. Download 30-day Free Trial. > Pricing starts from $795 for 25 servers or applications! > http://p.sf.net/sfu/zoho_dev2dev_nov > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers**** > > ** ** > > ** ** > > > ------------------------------------------------------------------------------ > Monitor your physical, virtual and cloud infrastructure from a single > web console. Get in-depth insight into apps, servers, databases, vmware, > SAP, cloud infrastructure, etc. Download 30-day Free Trial. > Pricing starts from $795 for 25 servers or applications! > http://p.sf.net/sfu/zoho_dev2dev_nov > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Freiholz M. <M.F...@ca...> - 2012-11-22 11:27:36
|
Hi, in my experience it's the best way to create N-Grams for the Asian texts. I think basic CJKAnalyzers already do it this way. Manuel Von: Vitaly Artemov [mailto:vit...@gm...] Gesendet: Donnerstag, 22. November 2012 11:23 An: clu...@li... Betreff: Re: [CLucene-dev] Creating CLucene Index in a Database; Support for Asian languages One more question about Asian languages: I know that in Asian languages word boundaries are difficult issue. How are you tokenize Asian texts? Thank you, Vitaly On Thu, Nov 22, 2012 at 12:18 PM, Vitaly Artemov <vit...@gm...<mailto:vit...@gm...>> wrote: Thank you for your fast reply. Can you please explain why Filesystem store better than Database. We will use CLucene to index and search huge amount of data. Vitaly On Thu, Nov 22, 2012 at 11:40 AM, Itamar Syn-Hershko <it...@co...<mailto:it...@co...>> wrote: inline On Thu, Nov 22, 2012 at 11:15 AM, Vitaly Artemov <vit...@gm...<mailto:vit...@gm...>> wrote: Hello all, I starting to evaluate Clucene engine for using in our product. I have 2 questions. 1. Is It planned to add support(or it already exists) for creating index in the Database instead of memory or filesystem? I read that java Lucene has it by providing JdbcDirectory interface. Don't do that. Use the filesystem, it is much better for every aspect. 2. I read in the FAQ that: "CLucene is not limited to English, nor any other language. To index text properly, you need to use an Analyzer appropriate for the language of the text you are indexing. CLucene's default Analyzers work well for English. There are a number of other Analyzers in "CLucene Sandbox", including those for Chinese, Japanese, and Korean." But "CLucene Sandbox" link is not works for some reason. Can you specify link to Analyzers list? Take a look at CJKAnalyzer Thanks in advance, Vitaly ------------------------------------------------------------------------------ Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov _______________________________________________ CLucene-developers mailing list CLu...@li...<mailto:CLu...@li...> https://lists.sourceforge.net/lists/listinfo/clucene-developers ------------------------------------------------------------------------------ Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov _______________________________________________ CLucene-developers mailing list CLu...@li...<mailto:CLu...@li...> https://lists.sourceforge.net/lists/listinfo/clucene-developers |
From: Vitaly A. <vit...@gm...> - 2012-11-22 10:22:55
|
One more question about Asian languages: I know that in Asian languages word boundaries are difficult issue. How are you tokenize Asian texts? Thank you, Vitaly On Thu, Nov 22, 2012 at 12:18 PM, Vitaly Artemov <vit...@gm...>wrote: > Thank you for your fast reply. > Can you please explain why Filesystem store better than Database. > We will use CLucene to index and search huge amount of data. > Vitaly > > > On Thu, Nov 22, 2012 at 11:40 AM, Itamar Syn-Hershko <it...@co...>wrote: > >> inline >> >> >> On Thu, Nov 22, 2012 at 11:15 AM, Vitaly Artemov <vit...@gm... >> > wrote: >> >>> >>> Hello all, >>> I starting to evaluate Clucene engine for using in our product. >>> I have 2 questions. >>> >>> 1. Is It planned to add support(or it already exists) for creating index >>> in >>> the Database instead of memory or filesystem? >>> I read that java Lucene has it by providing JdbcDirectory interface. >>> >> >> Don't do that. Use the filesystem, it is much better for every aspect. >> >> >>> >>> 2. I read in the FAQ that: >>> "CLucene is not limited to English, nor any other language. To index >>> text >>> properly, you need to use an Analyzer appropriate for the language of the >>> text you are indexing. CLucene's default Analyzers work well for English. >>> There are a number of other Analyzers in "CLucene Sandbox", including >>> those for Chinese, Japanese, and Korean." >>> But "CLucene Sandbox" link is not works for some reason. Can you >>> specify >>> link to Analyzers list? >>> >> >> Take a look at CJKAnalyzer >> >> >>> >>> Thanks in advance, Vitaly >>> >>> >>> ------------------------------------------------------------------------------ >>> Monitor your physical, virtual and cloud infrastructure from a single >>> web console. Get in-depth insight into apps, servers, databases, vmware, >>> SAP, cloud infrastructure, etc. Download 30-day Free Trial. >>> Pricing starts from $795 for 25 servers or applications! >>> http://p.sf.net/sfu/zoho_dev2dev_nov >>> _______________________________________________ >>> CLucene-developers mailing list >>> CLu...@li... >>> https://lists.sourceforge.net/lists/listinfo/clucene-developers >>> >>> >> >> >> ------------------------------------------------------------------------------ >> Monitor your physical, virtual and cloud infrastructure from a single >> web console. Get in-depth insight into apps, servers, databases, vmware, >> SAP, cloud infrastructure, etc. Download 30-day Free Trial. >> Pricing starts from $795 for 25 servers or applications! >> http://p.sf.net/sfu/zoho_dev2dev_nov >> _______________________________________________ >> CLucene-developers mailing list >> CLu...@li... >> https://lists.sourceforge.net/lists/listinfo/clucene-developers >> >> > |
From: Vitaly A. <vit...@gm...> - 2012-11-22 10:19:07
|
Thank you for your fast reply. Can you please explain why Filesystem store better than Database. We will use CLucene to index and search huge amount of data. Vitaly On Thu, Nov 22, 2012 at 11:40 AM, Itamar Syn-Hershko <it...@co...>wrote: > inline > > > On Thu, Nov 22, 2012 at 11:15 AM, Vitaly Artemov <vit...@gm...>wrote: > >> >> Hello all, >> I starting to evaluate Clucene engine for using in our product. >> I have 2 questions. >> >> 1. Is It planned to add support(or it already exists) for creating index >> in >> the Database instead of memory or filesystem? >> I read that java Lucene has it by providing JdbcDirectory interface. >> > > Don't do that. Use the filesystem, it is much better for every aspect. > > >> >> 2. I read in the FAQ that: >> "CLucene is not limited to English, nor any other language. To index text >> properly, you need to use an Analyzer appropriate for the language of the >> text you are indexing. CLucene's default Analyzers work well for English. >> There are a number of other Analyzers in "CLucene Sandbox", including >> those for Chinese, Japanese, and Korean." >> But "CLucene Sandbox" link is not works for some reason. Can you >> specify >> link to Analyzers list? >> > > Take a look at CJKAnalyzer > > >> >> Thanks in advance, Vitaly >> >> >> ------------------------------------------------------------------------------ >> Monitor your physical, virtual and cloud infrastructure from a single >> web console. Get in-depth insight into apps, servers, databases, vmware, >> SAP, cloud infrastructure, etc. Download 30-day Free Trial. >> Pricing starts from $795 for 25 servers or applications! >> http://p.sf.net/sfu/zoho_dev2dev_nov >> _______________________________________________ >> CLucene-developers mailing list >> CLu...@li... >> https://lists.sourceforge.net/lists/listinfo/clucene-developers >> >> > > > ------------------------------------------------------------------------------ > Monitor your physical, virtual and cloud infrastructure from a single > web console. Get in-depth insight into apps, servers, databases, vmware, > SAP, cloud infrastructure, etc. Download 30-day Free Trial. > Pricing starts from $795 for 25 servers or applications! > http://p.sf.net/sfu/zoho_dev2dev_nov > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Itamar Syn-H. <it...@co...> - 2012-11-22 09:40:12
|
inline On Thu, Nov 22, 2012 at 11:15 AM, Vitaly Artemov <vit...@gm...>wrote: > > Hello all, > I starting to evaluate Clucene engine for using in our product. > I have 2 questions. > > 1. Is It planned to add support(or it already exists) for creating index in > the Database instead of memory or filesystem? > I read that java Lucene has it by providing JdbcDirectory interface. > Don't do that. Use the filesystem, it is much better for every aspect. > > 2. I read in the FAQ that: > "CLucene is not limited to English, nor any other language. To index text > properly, you need to use an Analyzer appropriate for the language of the > text you are indexing. CLucene's default Analyzers work well for English. > There are a number of other Analyzers in "CLucene Sandbox", including > those for Chinese, Japanese, and Korean." > But "CLucene Sandbox" link is not works for some reason. Can you specify > link to Analyzers list? > Take a look at CJKAnalyzer > > Thanks in advance, Vitaly > > > ------------------------------------------------------------------------------ > Monitor your physical, virtual and cloud infrastructure from a single > web console. Get in-depth insight into apps, servers, databases, vmware, > SAP, cloud infrastructure, etc. Download 30-day Free Trial. > Pricing starts from $795 for 25 servers or applications! > http://p.sf.net/sfu/zoho_dev2dev_nov > _______________________________________________ > CLucene-developers mailing list > CLu...@li... > https://lists.sourceforge.net/lists/listinfo/clucene-developers > > |
From: Vitaly A. <vit...@gm...> - 2012-11-22 09:15:23
|
Hello all, I starting to evaluate Clucene engine for using in our product. I have 2 questions. 1. Is It planned to add support(or it already exists) for creating index in the Database instead of memory or filesystem? I read that java Lucene has it by providing JdbcDirectory interface. 2. I read in the FAQ that: "CLucene is not limited to English, nor any other language. To index text properly, you need to use an Analyzer appropriate for the language of the text you are indexing. CLucene's default Analyzers work well for English. There are a number of other Analyzers in "CLucene Sandbox", including those for Chinese, Japanese, and Korean." But "CLucene Sandbox" link is not works for some reason. Can you specify link to Analyzers list? Thanks in advance, Vitaly |
From: Kostka B. <ko...@to...> - 2012-11-15 15:23:06
|
Hi, ConstantScorer destructor deletes bits returned by filter->bits() method (called in constructor) without checking if filter->shouldDeleteBitSet() returns true. This causes double deletion of bits in case the filter is derived from AbstractCachingFilter. Proposed fix (in constantscorequery.cpp) attached |
From: Veit J. <nun...@go...> - 2012-10-25 20:47:16
|
Hi Paul, the Misc.h is part of the sublibrary clucene-shared. Looking into the CMakeLists.txt of this sublibrary, I discovered that the install command for adding the headers ist missing. I guess, somehing similar as in the CMakeLists.txt of the core (lines 224--233) has to be added to the CMakeLists.txt of shared. But I cannot try it in the moment. Maybe tomorrow. Best regards, Veit |
From: Paul G. | P. I. <pa...@pa...> - 2012-10-25 08:04:21
|
LS, We are trying to build/compile clucene for php ( http://pecl.php.net/package/clucene ) on a centos server ( 64bit ). For this we had first installed the clucene-core and clucene-core-devel packages. But no luck all errors. So we decided to build clucene ourselves. This went ok, but the compiling of clucene for php again failed. I think it has to do that there is no /usr/local/include/CLucene/util/Misc.h, as the error below states “’Misc’ has not been declared”. Can anybody help us out and see what is going wrong or how we can build the clucene including this Misc.h? The way we build clucene was Download tar.gz, unpacked Cd clucence mkdir build && cd build cmake .. make make install Then we went to clucene-0.0.9 ( pecl package ) Phpize ./configure Make ( here it went wrong see errors below ) /var/tmp/clucene/clucene.cpp:56: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp: In function âvoid zim_IndexSearcher___construct(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:219: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:220: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:226: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:227: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:227: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:233: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:233: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:241: error: âMiscâ has not been declared /var/tmp/clucene/clucene.cpp: In function âvoid zim_IndexSearcher_search(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:256: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:261: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:266: error: âMiscâ has not been declared /var/tmp/clucene/clucene.cpp: In function âvoid zim_IndexSearcher_close(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:286: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp: In function âvoid zim_Hits___construct(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:310: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:311: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:313: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:314: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:314: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:320: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:320: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp: In function âvoid zim_Hits_length(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:331: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:332: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:332: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp: In function âvoid zim_Hits_get(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:366: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:367: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:367: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:375: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:380: error: âMiscâ has not been declared /var/tmp/clucene/clucene.cpp:382: error: âMiscâ has not been declared /var/tmp/clucene/clucene.cpp: In function âvoid zim_Hits_id(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:414: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:415: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:415: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:424: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp: In function âvoid zim_Hits_score(int, zval*, zval**, zval*, int)â: /var/tmp/clucene/clucene.cpp:449: warning: deprecated conversion from string constant to âchar*â /var/tmp/clucene/clucene.cpp:450: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:450: warning: âvoid php_set_error_handling(zend_error_handling_t, zend_class_entry*)â is deprecated (declared at /usr/include/php/main/php.h:292) /var/tmp/clucene/clucene.cpp:459: warning: deprecated conversion from string constant to âchar*â make: *** [clucene.lo] Error 1 Regards, Paul Groeneweg |
From: Ahmed <ci7...@gm...> - 2012-10-08 09:33:40
|
<html style="direction: ltr;"> <head> <meta content="text/html; charset=ISO-8859-1" http-equiv="Content-Type"> <style type="text/css">body p { margin-bottom: 0cm; margin-top: 0pt; } </style> </head> <body style="direction: ltr;" bidimailui-detected-decoding-type="latin-charset" bgcolor="#FFFFFF" text="#000000"> <div class="moz-cite-prefix">Yes, i'm using a filter<br> <br> Le 08/10/2012 06:48, Veit Jahns a écrit :<br> </div> <blockquote cite="mid:CALGR=ePG...@ma..." type="cite"> <pre wrap="">Hi Ahmed! 2012/10/5 Ahmed <a class="moz-txt-link-rfc2396E" href="mailto:ci7...@gm..."><ci7...@gm...></a>: </pre> <blockquote type="cite"> <pre wrap="">Hi Sometimes when i search in my index i get this exception "bit out of range", what does it mean? </pre> </blockquote> <pre wrap=""> Its the index out of boundaries error from the BitSet class. Do you use filters for searching? Best regards Veit ------------------------------------------------------------------------------ Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! <a class="moz-txt-link-freetext" href="http://p.sf.net/sfu/newrelic-dev2dev">http://p.sf.net/sfu/newrelic-dev2dev</a> _______________________________________________ CLucene-developers mailing list <a class="moz-txt-link-abbreviated" href="mailto:CLu...@li...">CLu...@li...</a> <a class="moz-txt-link-freetext" href="https://lists.sourceforge.net/lists/listinfo/clucene-developers">https://lists.sourceforge.net/lists/listinfo/clucene-developers</a> </pre> </blockquote> <br> </body> </html> |
From: Veit J. <nun...@go...> - 2012-10-08 06:48:53
|
Hi Ahmed! 2012/10/5 Ahmed <ci7...@gm...>: > Hi > Sometimes when i search in my index i get this exception "bit out of range", > what does it mean? Its the index out of boundaries error from the BitSet class. Do you use filters for searching? Best regards Veit |
From: Ahmed <ci7...@gm...> - 2012-10-05 08:54:09
|
<html style="direction: ltr;"> <head> <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1"> <style type="text/css">body p { margin-bottom: 0cm; margin-top: 0pt; } </style> </head> <body style="direction: ltr;" bidimailui-detected-decoding-type="latin-charset" bgcolor="#FFFFFF" text="#000000"> Hi<br> Sometimes when i search in my index i get this exception "bit out of range", what does it mean?<br> thank you<br> </body> </html> |
From: Veit J. <nun...@go...> - 2012-08-15 09:58:55
|
Hi Atul! 2012/8/15 Atul Kulkarni <atu...@gm...>: > Hi All, > > I recently stumbled upon CLucene and am wondering if this is actively > developed anymore? CLucene is still active---more or less... For myself, I'm very busy with other projects distracting me from working on CLucene. > Also, is there a TODO list that I can look at to pickup a > bug or something like a bug tracker to help me pickup something simple to > start with? Firstly, there are some open tickets provided from users: http://sourceforge.net/tracker/?group_id=80013&atid=558446 Secondly, we also started to update CLucene to the one of the latest Lucene version. This port was supposed to be based on Lucene++, which is also a Lucene port. But it makes a heavy use of shared pointer. And we once wanted to make this the new CLucene, but reduce the usage of shared pointer and make it as fast as possible. Lucene++ is hosted at GitHub: https://github.com/luceneplusplus/LucenePlusPlus. Some tests and improvements are already done by Sergey. Can be found an Github also: https://github.com/drigh/LucenePlusPlus/ >From my point of view, it would be better to start with Lucene++. If you are intersted, you are invited to work on one of the issues. And I think I can spent some time to support, at least for answering your questions. Best regards, Veit |
From: Atul K. <atu...@gm...> - 2012-08-14 23:50:21
|
Hi All, I recently stumbled upon CLucene and am wondering if this is actively developed anymore? Also, is there a TODO list that I can look at to pickup a bug or something like a bug tracker to help me pickup something simple to start with? -- Regards, Atul Kulkarni |
From: Veit J. <nun...@go...> - 2012-07-20 22:22:55
|
Hi Mike, _search() is overloaded. There is also an implementation that takes a pointer to a Sort object as argument. I would look there, if these classes fits your need. But this depends also how do you put the dates in your index. As ISO 8601 or similar? Best regards, Veit |
From: Mike A. <mi...@au...> - 2012-07-20 11:42:15
|
I am doing a query and want to sort all the matched data by one of the fields (date). Whats the best way of doing this ? The search will potentially match a lot of rows - so for performance I've subclassed HitCollector and am adding all the returned documentIDs to a list of integers if that makes any difference : reader = IndexReader::open(...); q = QueryParser::parse(...); s = new IndexSearcher(reader); s->_search(q, NULL, &hc); class myHitCollector : public HitCollector { private : list<int32_t> hitList; public: int hitListCount; myHitCollector() : HitCollector() { clear(); hitListCount=0; } void clear() { hitList.clear(); } void push(int32_t doc) { hitList.push_back(doc); hitListCount++; } int pop() { int32_t doc=hitList.front(); hitList.pop_front(); hitListCount--; return doc; } void collect(const int32_t doc, const float_t score) { push(doc); } }; again - is this the best way ? Any help much appreciated! |