Release 0.2.0 - 20 Feb 2004
This release brings major reorganization of the code, grouping classes
into larger modules instead of the original Java style, as well as
rewriting several of the classes to be more Pythonic, removing
extraneous data structures and so forth; overall, the code has been
reduced by 20%. The public interface, indexer.py, has not changed;
other classes have not been changed significantly, other than being
moved to new modules.... read more
A minor release with mainly a bug fix in the indexer.Index wrapper.
Changes:
Changed default mergeFactor from 9 to 20 for better performance
Fixed the example in simple.py to use a Keyword for filename, instead of
stored, instead of a tokenized and stored Text field.
Tidied up SegmentInfos and FieldInfos to be more Pythonic.
Call close() on open searcher in indexer.Index.setupIndexer.
Minor release with fix for a windows-only bug.
Also indexer.Index now allow you to set the mergeFactor
for tuning Lupy's speed/space tradeoff.
-A-
Some minor changes to appease Python 2.3, although certain a couple of warnings about bit operations persisit. I'd appreciate help in clearing these up.
This release breaks some code:
Instead of field.Field.Keyword() you now use field.Keyword()
If you are using the Indexer wrapper, searches are now more accurate since it tokenizes the query first.
Performance improvements if you are using the indexer.Index wrapper.
Splitter is another Analyzer/Tokenizer derived from David Mertz's article in IBM Developerworks. It is faster than the existing Analyzer and more Pythonic, but it is *not* unicode. The indexer wrapper continues to use the Unicode tokenizer.
-A-
This realease fixes a bug introduced with the Unicode upgrade.
There is also a new wrapper to make indexing and search even easier.
from lupy.indexer import Index
i = Index('myindex', create=True)
i.index(text='some test text for Lupy')
i.find('"text text")
i.close()
Thanks to Martin Elaisson in Sweden, Lupy now correctly handles unicode.
Skl!