Showing 18 open source projects for "index and search"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    ... search index, and finally answer the user question with an LLM agent.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • 3
    libpostal

    libpostal

    A C library for parsing/normalizing street addresses around the world

    ..., reviews). Yet even the simplest addresses are packed with local conventions, abbreviations and context, making them difficult to index/query effectively with traditional full-text search engines. This library helps convert the free-form addresses that humans use into clean normalized forms suitable for machine comparison and full-text indexing. Though libpostal is not itself a full geocoder, it can be used as a preprocessing step to make any geocoding application smarter, and simpler.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 5
    concordia

    concordia

    Powerful search library, best suited for computer-aided translation

    Concordia - Roman goddess of agreement. Concordance searcher - tool for translators who need their translations to "agree" with one standard. Concordia is a C++ library for fast text lookup in large corpora. It uses a RAM stored index, which takes up approximately 600MB of memory for a corpus of 2 million sentences. It is based on the idea of a suffix array, enhanced by the presence of other auxiliary data structures. The effects are stunning - Concordia is able to do simple substring...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This package contains different tools to add NLP capabilities for Lucene 4.x (it has been tested using Lucene version from 4.6.x to 4.8.1). Although it was originally developed for German, it is, mostly, language independent. It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    ldi

    Lucene Oracle Integration using Data Cartridge API

    Lucene Domain Index is full integration of Lucene project running inside the Oracle database using Oracle JVM. The integration provides a transparent detection of row data changes and an SQL layer for doing search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    medicalcodes

    ICD10CM ICD10PCS NDC (Drug Codes USA) LOINC

    N.B This project is DEPRECATED. Medicalcodes is a web application. It contains more than 460.000 codes useful to various medical fields. Is written in php, javascript, d3.js and is using postgresql as back end. Supports the following Coding Systems: 1. ICD10cm (2014) - Diagnostic Codes. Synonyms search is possible via the Pharmkgb DB and CTD DB - Links to MESH index is provided 2. ICD10pcs (2014) - Procedures Codes 3. CMS Diagnosis 4. CMS procedures list 5. National Drugs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Index biological data (genbank sheets, Uniprot...) in a Solr indexer, with index shard support and provides a query interface. Project goal is to create a virtual image with indexer and web interface to query and visualize biological data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 10
    MRS is a tool to quickly and easily store and index large flat file databanks and in a space efficient manner. It is currently used to index huge bioinformatics databanks but it is not limited to this area.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CORPSE (CORPus SEarch) is a powerful search engine written in Java. The aim is to provide an efficient implementation of a word level inverted index search with various cool functions that can be used on very large corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The system searches synonyms (and related words) in Wikipedia. WikIDF generates index database of Wikipedia (for Russian, English, and German). The continuation of this project is "wikokit" at code.google.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    APoDIx is a Portable Bio-Database Index system. It retrieves biological database information from the website of Oxford Journals, save them in a local position, and provides a viewer for display and search the information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    TextMine is for the Perl hacker who is grappling with the problems of managing unstructured text from various sources. You can use these text mining tools to search the Web, index text, extract entities, categorize your e-mail, and summarize documents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    geoLucene is an extension of Lucene that allows to effectively index and search documents that contain locational information (longitude/latitude). It uses R-tree as a spacial index. See http://www.gossamer-threads.com/lists/lucene/java-dev/53378.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Universal information crawler is a fast precise and reliable Internet crawler. Uicrawler is a program/automated script which browses the World Wide Web in a methodical, automated manner and creates the index of documents that it accesses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Albion is a digital document archive presentation package. Features include a searchable database, an integrated spell checker, and automatic thumbnail generation. As native Perl as possible, the database/photos/search-function can be delivered on CD!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next