Tree [beb368] 1.5 /
History



File Date Author Commit
lib 2014-07-19 Emmanuel Keller Emmanuel Keller [6f96ca] IcePDF patch for Open-JDK7
licenses 2013-06-10 Emmanuel Keller Emmanuel Keller [9d30f3] - Upgrade CXF to 2.7.5
shell 2014-03-29 Emmanuel Keller Emmanuel Keller [f34087] Support of specific classpath for external parser
src 2 days ago Emmanuel Keller Emmanuel Keller [d09a2c] Implements #1233. Few sync improvement in sched...
.gitignore 2014-06-28 Emmanuel Keller Emmanuel Keller [bb88d8] Implements #710. Adding new unit tests.
CHANGELOG.txt.gz 2014-10-14 Emmanuel Keller Emmanuel Keller [a2c223] 1.5.8 change log
NOTICE.txt 2013-06-10 Emmanuel Keller Emmanuel Keller [9d30f3] - Upgrade CXF to 2.7.5
README.md 2014-06-29 Emmanuel Keller Emmanuel Keller [1be1c3] Update readme links
build.xml 2014-07-27 Emmanuel Keller Emmanuel Keller [ee4138] Final RPM
oss_logo.png 2010-12-21 Emmanuel Keller Emmanuel Keller [ce4b39] Work in progress on this new features:
oss_team.jpg 2009-10-29 Emmanuel Keller Emmanuel Keller [e9b776] New feature 2888788: Upgrade Lucene to release ...
pom.xml 2014-10-28 Emmanuel Keller Emmanuel Keller [97547f] Fix CXF pom

Read Me

OpenSearchServer

http://www.opensearchserver.com

Copyright Emmanuel Keller / Jaeksoft (2008-2013)
This software is licensed under the GPL v3.

OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, ...) and the REST/RESTFul API you will be able to integrate quickly and easily advanced full-text search capabilities in your application. OpenSearchServer runs on Linux/Unix/BSD/Windows.

Quickstart

One requirement

You need to have a JAVA 6 (or newer) runtime on your server

Download the last ZIP or the TAR.GZ archive:

http://www.opensearchserver.com/#download

Deflate the content to get the following files:

  • FILE opensearchserver.jar -> the main library
  • FILE README.md -> this file
  • DIR data -> will contains your index
  • DIR server -> will contains servers files
  • FILE start.sh -> Shell to start the server on Unix
  • FILE start.bat -> Batch to start the server on Windows
  • FILE NOTICE.txt -> the third-party license informations
  • DIR LICENSES -> Contains the detailled licenses

Edit the parameters

Optionally, can you change the parameters in the start.sh/start.bat script:
- The allowed memory size
- The TCP port (9090 by default)

Start the server

cd opensearchserver ./start.sh

Go with the interface and/or the API

http://localhost:9090

Features

Search functions

  • Advanced full-text search features
  • Phonetic search
  • Advanced boolean search with query language
  • Clustered results with faceting and collapsing
  • Filter search using sub-requests (including negative filters)
  • Geolocation
  • Spell-checking
  • Relevance customization
  • Search suggestion facility (auto-completion)

Indexation

  • Supports 18 languages
  • Fields schema with analyzers in each language
  • Several filters: n-gram, lemmatization, shingle, stripping diacritic from words,…
  • Automatic language recognition
  • Named entity recognition
  • Word synonyms and expression synonyms
  • Export indexed terms with frequencies
  • Automatic classification

Document supported

  • HTML / XHTML
  • MS Office documents (Word, Excel, Powerpoint, Visio, Publisher)
  • OpenOffice documents
  • Adobe PDF (with OCR)
  • RTF, Plaintext
  • Audio files metadata (wav, mp3, AIFF, Ogg)
  • Torrent files
  • OCR over images

Crawlers

  • The web crawler for internet, extranet and intranet
  • The file systems crawler for local and remote files (NFS, SMB/CIFS, FTP, FTPS, SWIFT)
  • The database crawler for all JDBC databases (MySQL, PostgreSQL, Oracle, SQL Server, …)
  • Filter inclusion or exclusion with wildcards
  • Session parameters removal
  • SQL join and linked files support
  • Screenshot capture
  • Sitemap import

General

  • REST API (XML and JSON)
  • SOAP Web Service
  • Monitoring module
  • Index replication
  • Scheduler for management of periodic tasks
  • WordPress plugin and Drupal module