Showing 50 open source projects for "data"

View related business solutions
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 1
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    ...It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    SWR

    SWR

    React Hooks library for remote data fetching

    The name “SWR” is derived from stale-while-revalidate, a HTTP cache invalidation strategy popularized by HTTP RFC 5861. SWR is a strategy to first return the data from cache (stale), then send the fetch request (revalidate), and finally come with the up-to-date data. With SWR, components will get a stream of data updates constantly and automatically. And the UI will be always fast and reactive. With just one single line of code, you can simplify the logic of data fetching in your project, and also have all many amazing features out-of-the-box. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Meilisearch

    Meilisearch

    An open-source, lightning-fast, and hyper-relevant search engine

    ...Search-as-you-type returns answers in less than 50 milliseconds. That's faster than the blink of an eye! Deploy in a matter of minutes. Smart presets let you start searching through your data with zero configuration. Send data to Meilisearch however you want, no need to match a schema or convert your dataset to a compatible format. Everyone makes mistakes! If typos break your search experience, many users will leave thinking what they were looking for just wasn't there. Start searching through your dataset in less than 5 minutes and quickly connect your codebase to Meilisearch with our official libraries. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 4
    truffleHog

    truffleHog

    Searches through git repositories for high entropy strings and secrets

    truffleHog searches through git repositories for high entropy strings and secrets, digging deep into commit history. TruffleHog runs behind the scenes to scan your environment for secrets like private keys and credentials, so you can protect your data before a breach occurs. Secrets can be found anywhere, so TruffleHog scans more than just code repositories, including SaaS and internally hosted software. With support for custom integrations and new integrations added all the time, you can secure your secrets across your entire environment. TruffleHog is developed by a team entirely comprised of career security experts. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    Fuse.js

    Fuse.js

    Lightweight fuzzy-search, in JavaScript

    ...These operators are used for filtering the data and getting precise results based on the given conditions.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    TNTSearch

    TNTSearch

    A fully featured full text search engine written in PHP

    TNTSearch is a full-text search engine written in PHP, designed to be integrated into Laravel and other PHP applications. It offers real-time, efficient indexing and searching of textual data using SQLite as its storage backend. TNTSearch is highly configurable and supports features like fuzzy searching, customizable ranking algorithms, and boolean search, making it a powerful tool for adding search functionality to websites and applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Laravel Scout Elasticsearch

    Laravel Scout Elasticsearch

    Search among multiple models with ElasticSearch and Laravel Scout

    The package provides the perfect starting point to integrate ElasticSearch into your Laravel application. It is carefully crafted to simplify the usage of ElasticSearch within the Laravel Framework. It’s built on top of the latest release of Laravel Scout, the official Laravel search package. Using this package, you are free to take advantage of all of Laravel Scout’s great features, and at the same time leverage the complete set of ElasticSearch’s search experience.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Katalog

    Katalog

    Catalog and Search files from permanent or removable drives

    Katalog is a desktop application to manage catalogs of disks and files: - Create catalogs from different sources or devices, - Search files even when the devices are disconnected, and find duplicates or differences - Organize your Collection of catalogs, Storage devices, and Virtual storage devices and get Statistics, - Data is stored in csv (tab separated) files for full control by the user, - Available in multiple languages - OpenSource and cross-platform (Linux Plasma and Windows 64 installer or portable). First use / tips - Simply start with the Create screen. Create your first catalog and experiment! - All data/catalog files are stored in the Settings/Collection folder. ...
    Leader badge
    Downloads: 421 This Week
    Last Update:
    See Project
  • 9

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . The actual development and issue tracking can be found here: https://bitbucket.org/cryanfuse/crgrep
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Scout Elasticsearch Driver

    Scout Elasticsearch Driver

    Offers advanced functionality for searching data in Elasticsearch

    This package offers advanced functionality for searching and filtering data in Elasticsearch.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    List.js

    List.js

    Library for adding search, sort, filters and flexibility to tables

    Tiny, invisible and simple, yet powerful and incredibly fast vanilla JavaScript that adds search, sort, filters and flexibility to plain HTML lists, tables, or anything. List.js can be used in three different ways. It can be on existing HTML, it can create it's own HTML or a combination of both methods. Works both lists, tables and almost anything else. E.g. <div>,<ul>,<table>, etc. Simple templating system that adds possibility to add, edit, remove items. Perfect library for adding search,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    pyFileSearcher

    pyFileSearcher

    simple searching tool for big fileservers

    pyFileSearcher was designed to be lightweight, easy to use, but capable of handling a large volume of files tool. A tool that I personally could use on large corporate servers to find out - which files have taken all my space in the last few days? It's free, it's opensource, it's for linux and windows. The program is written in Python 3 using the Qt5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    ftdetector

    File type detector library

    This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A catalog application for various media types - CD, DVD, NetDrives, USB flash keys, etc. It can import data from famous WhereIsIt Windows applicaion. In a word this is a try to make a WhereIsIt-like application for Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    JarOMine

    JarOMine

    Quickly Search THOUSANDS of Archives!

    If you have ever tried to locate files, classes, and resources buried amongst an ever shifting locus of ZIP and / or JAR files, we feel your pain! After spending far too much time searching hundreds of archives for moving targets, I decided to write JarOMine. Originally designed for locating Java classes, JarOMine works equally well with ZIP archives, too.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    bnf2xml

    simple BNF parser makes xml markup of matches

    bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Primes

    Calculate primes by using extremely fast sorting

    ...Unfortunately it has turned out that going this way is even more slowly than trying to find primes by brute force. So it can only be used as a test with heavy load for the sorting algorithm, which can be used for sorting any kind of data. And as already mentioned, it's just the most efficient tree-based sorting algorithm that you can get. But furthermore this way of finding primes interestingly leaves a hard nut to crack for mathematicians: In very rare cases it finds numbers that are not primes. For all primes below one million this phenomenon arises in exactly two cases: 31213 which is 7 * 7 * 7 * 7 * 13 336141 which is 3 * 3 * 13 * 13 * 13 * 17 Who can explain, why?
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Horace

    data organising system for arbitrary files

    Uses alternate data streams to provide arbitrary tagging and searching of files within NTFS and other modern file systems supporting alternate data streams. Customisable vocabulary provides searchable standardised tagging system of file associations. Unlike most other file archiving systems, no additional database is required, and the system is robust, and persists file attributes irrespective of renaming / moving / copying / modifying etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    hf

    bash history file filter

    hf (history filter) filters bash command history in a way similar to a file search. You can see the results as you type, then you can run or edit the selected command. If you use the Ctrl+r bash shortcut a lot, this will definitely make your life easier. hf is a fork of hstr: https://github.com/dvorka/hstr
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Infofuze

    Data migration/conversion library based on STX and XSLT transformation

    Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Wukong

    Wukong

    Highly customizable full-text search engine

    Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine through a channel, and the coroutine will segment the text and send it to a word segmentation through another channel. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Meresco is both an OAI Data Provider and a Service Provider. SourceForge is only used to host the source control (subversion). Sources: http://sources.meresco.org/ Binaries: http://repository.cq2.org/ Mail: http://groups.google.com/group/meresco
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB