tag for release 2.18
update for RankLib 2.18 release
Added tag release-3.22 for changeset 9e50d2bd5aa6
Add ability to index Anserini's JSON format corpus file, bump release number
Update license
tag for release 2.17
update for RankLib 2.17 release
update for RankLib 2.17 release
Release 3.21 changes
Added tag release-3.21 for changeset 7843f2dbedad
Galago
Galago
Wiki Documentation can not be read Offline
Please see the new PDF file of Galago documentation in the Files tab, Documentation folder.
tag for release 2.16
update for RankLib 2.16 release
Removed tag release-20
Added tag release-3.20 for changeset da766f557e7f
Added tag release-20 for changeset 380da101eb18
Fix help text
Release 3.20 pom changes
Home
Updates to readme file
Galago
Galago
Galago
Galago
Galago
Changes to batch-search-with-reranker
Add batch-search-with-reranker
Add batch-search-with-reranker
Update the release of dependency-check
Pin Apache Thrift to 0.12.0
Martin, The SureFire output indicated the ProxyRetrievalTest failed with an unexpected exception, but to figure out what happened please run the "mvn package" step again with the -X flag. Capture the output in a file, and in that file find the part that starts with Running org.lemurproject.galago.core.retrieval.ProxyRetrievalTest and send me that, please. Thanks, Greg On Sun, Apr 25, 2021 at 5:59 AM Martin Vahi martinvahi@users.sourceforge.net wrote: Supposedly this file contains more information....
Added tag release-3.19 for changeset 42c6c5a9e4b4
Release 3.19 pom changes
tag for release 2.15
create tag directory
update for 2.15 release
Improve index version information
Update snowball and add arabic and tamil stemmers
Update snowball and add arabic and tamil stemmers
Thanks, Anil
Adding tag directory
update for 2.14 release
tag and release 3.18
Added tag release-3.18 for changeset 40d0a8780f4c
Alicia, galago doc produces un-stemmed terms. For dump-term-stats, use postings to get un-stemmed terms, use postings.krovetz or postings.porter to get stemmed terms: galago dump-term-stats indexes/myindex/postings | grep mckean mckean’s 1 1 mckeand 13 3 mckeans 9 7 mckean 128 59 mckeands 4 2 galago dump-term-stats indexes/myindex/postings.porter | grep mckean mckeand 17 3 mckean’ 1 1 mckean 137 60 galago dump-term-stats indexes/myindex/postings.krovetz | grep mckean mckean’s 1 1 mckeand 17 3 mckean...
Alicia, Yes, Galago lower-cases the terms and removes punctuation when it indexes. No, Galago does not have an option to return the tokens as a simple vector like you described. You will have to write a script to create the vector from the output you get from the DOC function. Greg