The ApacheLucene™ project develops open-source search software. The project releases a core search library, named Lucene™ core, as well as PyLucene, a Python binding for Lucene. Lucene Core is a Java library providing powerful indexing and search features, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. The PyLucene sub-project provides Python bindings for Lucene Core. The Apache Software Foundation provides support for the Apache community of open...
Lucene/Solr based search engine and workflow system
Important: This project has been moved to https://github.com/statsbiblioteket/summa/
Lucens (and Solr) based search engine with very flexible setup and workflow system. It supports incremental updates, hierarchical faceting and index lookup with low memory overhead.
Note: Although Summa is open source, the focus is on features used at Statsbiblioteket. No explicit resources has been allocated for support of external users.
This package contains different tools to add NLP capabilities for Lucene 4.x (it has been tested using Lucene version from 4.6.x to 4.8.1). Although it was originally developed for German, it is, mostly, language independent.
It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
We are making the Lucene Korean Analyzer. The Korean Language dictionary is most important element in Lucene Korean Analyzer. We are making the Korean Dictionary for lucene korean analyzer also.
216,000+ customers in over 135 countries grow their businesses with HubSpot
Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
A Java implementation of a desktop search engine based on ApacheLucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
TiTLi: A Google-like, full-text search tool for relational databases. It is very fast and independent of vendor specific RDBMS's like mysql, oracle etc. It uses ApacheLucene for indexing and provides some advanced features like similarity search.
A searcher and indexer to allow easy and fast locating of relevent information from a large collection of research papers. A Java backend with a web based frontend. Based on the Lucene indexer and searcher