Showing 84 open source projects for "search engine project on java"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Vespa

    Vespa

    The open big data serving engine

    Make AI-driven decisions using your data, in real-time. At any scale, with unbeatable performance. Vespa is a full-featured text search engine and supports both regular text search and fast approximate vector search (ANN). This makes it easy to create high-performing search applications at any scale, whether you want to use traditional techniques or a modern vector-based approach. You can even combine both approaches efficiently in the same query, something no other engine can do....
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Firebird

    Firebird

    Firebird server, client and tools

    ...It has been used in production systems, under a variety of names, since 1981. To enhance the Firebird functionality, IBSurgeon has sponsored the development and now released for public use the free open source "IBSurgeon Full Text Search UDR" to perform full-text search queries within SQL and PSQL. UDR works with Firebird 3 and 4, for Windows, there are ready-to-use binaries, for Linux, it is necessary to build the UDR. The UDR is based on Lucene++ engine, with all the powerful features required for full-text search and with very fast performance (build as native C++ library). ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Skiplist-CPP

    Skiplist-CPP

    A tiny KV storage based on skiplist written in C++ language

    Skiplist-CPP is a lightweight key-value storage engine implemented in C++ using a skip list as its core data structure. It showcases how a log-structured, ordered index can deliver fast inserts, lookups, and deletes while remaining simple to implement and reason about. The project supplies a compact codebase with a clear separation between the skip list implementation and the storage operations that use it. Beyond in-memory operations, it demonstrates persistence through data dump to disk...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    This project provides a set of source connectors for Apache Flink® directly ingesting changes coming from different databases using Change Data Capture(CDC). CDC Connectors for Apache Flink® is a set of source connectors for Apache Flink®, ingesting changes from different databases using change data capture (CDC). CDC Connectors for Apache Flink® integrates Debezium as the engine to capture data changes. So it can fully leverage the ability of Debezium. Supports reading database snapshot and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    GeoTools, the Java GIS toolkit

    GeoTools, the Java GIS toolkit

    Toolkit for working with and mapping geospatial data

    GeoTools is an open source (LGPL) Java code library which provides standards compliant methods for the manipulation of geospatial data. GeoTools is an Open Source Geospatial Foundation project. The GeoTools library data structures are based on Open Geospatial Consortium (OGC) specifications.
    Leader badge
    Downloads: 205 This Week
    Last Update:
    See Project
  • 6
    Hibernate

    Hibernate

    An object relational-mapping (ORM) library for Java

    The Hibernate projects offer a suite of powerful Java libraries to work with data. It is best known for Hibernate ORM, which provides relational persistence for Java models and is an implementation of the Jakarta Persistence specification. Hibernate projects do not consistently release binaries or documentation to SourceForge anymore. For up-to-date information, refer to the Hibernate website: * Hibernate ORM: https://hibernate.org/orm/ * Hibernate Validator:...
    Leader badge
    Downloads: 349 This Week
    Last Update:
    See Project
  • 7

    WhiteBear

    a set of foundation classes for database engines

    WhiteBear is a set of foundation classes for database engines written in Java. It supports object and relational database, is highly portable and may be used in embedded or server environment. Project has been created from scratch and plan to includes storage engine, server, SQL interpreter, ODBC, JDBC and JDO interface
    Leader badge
    Downloads: 192 This Week
    Last Update:
    See Project
  • 8
    Firebird

    Firebird

    A powerful, cross platform, SQL database system

    Firebird RDBMS offers ANSI SQL features & runs on Linux, Windows & several Unix platforms. Features excellent concurrency & performance & powerful stored procedure & trigger language. More information and downloads at https://firebirdsql.org/, issue tracker is https://github.com/FirebirdSQL/firebird/issues
    Leader badge
    Downloads: 3,642 This Week
    Last Update:
    See Project
  • 9
    Conductor

    Conductor

    Conductor is a microservices orchestration engine

    We built Conductor to help us orchestrate microservices-based process flows at Netflix with the following features, a distributed server ecosystem, which stores workflow state information efficiently. Allow creation of process/business flows in which each individual task can be implemented by the same/different microservices. Simple interface to connect workers, which execute the tasks in workflows. Full operational control over workflows with the ability to pause, resume, restart, retry and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Aestel

    Aestel

    Applications for data management

    "Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    eXist-db

    eXist-db

    eXist-db is a feature rich Open Source native XML database

    eXist-db is a native XML database featuring efficient, index-based XQuery processing, extensions for keyword search, XUpdate support, XSLT support, XForms support, REST and tight integration with existing XML development tools. Moved to Github - https://www.github.com/exist-db/exist
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    The Lemur Project

    The Lemur Project

    Search engine and data mining applications and ClueWeb datasets.

    The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software, including the Indri search engine in C++, the Galago search engine research framework in Java, the RankLib learning to rank library, ClueWeb09 and ClueWeb12 datasets and the Sifaka data mining application.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 13
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    COPE is a persistence framework for Java. The persistent schema is specified within Java source code, no external files are needed. This project has been moved to github: http://exedio.github.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    ProC 3.0

    smart Workflow Engine

    ProC 3.0 is a scientific workflow engine to build, manage and execute workflows (pipelines) in heterogeneous environments, supporting GRID and other means of parallel processing. It includes a data management component (DMC) to transparently access databases for storage of results and automatically adds metadata to track the processing of data products, so that at every time a full processing history is available. The software was developed and used within the ESA Planck satellite mission.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CommandCentre

    CommandCentre

    Command Centre

    Free to join for developers who support the Free Software Foundation cause. This is my research project from 2003, which I recently updated with acceleration- and console changes (Surround Sound, Doppler Effects, Device Updates, Physics) in a long weekend but project should be considered abandoned. Additionally, the Performer libraries have been abandoned by Silicon Graphics. Minor setup- and make reconfiguration bugs, no changes planned. The Command Centre is a Simulation Programming...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Conserve
    Conserve is no longer hosted at SourceForge, we have moved to GitHub: https://github.com/ConserveORM/main
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    ldi

    Lucene Oracle Integration using Data Cartridge API

    Lucene Domain Index is full integration of Lucene project running inside the Oracle database using Oracle JVM. The integration provides a transparent detection of row data changes and an SQL layer for doing search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    LightAdmin

    Pluggable data administration UI interface for Java web applications

    The primary goal of the project is to speed up application development by bringing pluggable fully operational data management back-end for JPA based applications and to relieve your codebase for more important stuff.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Animal Shelter Manager is a complete computer solution for animal sanctuaries and shelters. Features complete animal management, document generation, full reporting, charts, internet publishing, pet search engine integration and more.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
    Leader badge
    Downloads: 111 This Week
    Last Update:
    See Project
  • 23
    Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    SQL CSV Editor

    Allows the editing of CSV files using SQL commands

    This program allows a user to modify a CSV file using SQL commands. This program uses a temporary H2 database to contain the CSV file, and accept SQL commands. You can then save this back to a CSV file without any data being distorted. This project only supports CSV comma delimited files. Currently any other CSV files are not supported. When editing the CSV you are sending SQL statements to a H2 database with a single table named CSV. Example command: DELETE FROM CSV WHERE...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Odalis

    A PHP/mySQL framework to manage any kind of processes

    Odalis - the Open Data Linking System - is an Information Management Framework for professional use. Manage any kind of processes and move data from all different sources to any destinations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB