data free download - SourceForge

Showing 50 open source projects for "data"

View related business solutions

Search Linux Clear Filters & Widen Search

Ship AI Apps Faster with Vertex AI
Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.

Try Vertex AI Free
Easily Host LLMs and Web Apps on Cloud Run
Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.

Try Cloud Run Free
1

Elasticsearch

A Distributed RESTful Search Engine

...It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.

Downloads: 11 This Week

Last Update: 2026-02-03
See Project
2

SWR

React Hooks library for remote data fetching

The name “SWR” is derived from stale-while-revalidate, a HTTP cache invalidation strategy popularized by HTTP RFC 5861. SWR is a strategy to first return the data from cache (stale), then send the fetch request (revalidate), and finally come with the up-to-date data. With SWR, components will get a stream of data updates constantly and automatically. And the UI will be always fast and reactive. With just one single line of code, you can simplify the logic of data fetching in your project, and also have all many amazing features out-of-the-box. ...

Downloads: 1 This Week

Last Update: 2026-02-01
See Project
3

Meilisearch

An open-source, lightning-fast, and hyper-relevant search engine

...Search-as-you-type returns answers in less than 50 milliseconds. That's faster than the blink of an eye! Deploy in a matter of minutes. Smart presets let you start searching through your data with zero configuration. Send data to Meilisearch however you want, no need to match a schema or convert your dataset to a compatible format. Everyone makes mistakes! If typos break your search experience, many users will leave thinking what they were looking for just wasn't there. Start searching through your dataset in less than 5 minutes and quickly connect your codebase to Meilisearch with our official libraries. ...

Downloads: 11 This Week

Last Update: 4 days ago
See Project
4

truffleHog

Searches through git repositories for high entropy strings and secrets

truffleHog searches through git repositories for high entropy strings and secrets, digging deep into commit history. TruffleHog runs behind the scenes to scan your environment for secrets like private keys and credentials, so you can protect your data before a breach occurs. Secrets can be found anywhere, so TruffleHog scans more than just code repositories, including SaaS and internally hosted software. With support for custom integrations and new integrations added all the time, you can secure your secrets across your entire environment. TruffleHog is developed by a team entirely comprised of career security experts. ...

Downloads: 14 This Week

Last Update: 1 day ago
See Project
Cut Cloud Costs with Google Compute Engine
Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.

Try Compute Engine
5

Fuse.js

Lightweight fuzzy-search, in JavaScript

...These operators are used for filtering the data and getting precise results based on the given conditions.

Downloads: 3 This Week

Last Update: 2025-02-03
See Project
6

TNTSearch

A fully featured full text search engine written in PHP

TNTSearch is a full-text search engine written in PHP, designed to be integrated into Laravel and other PHP applications. It offers real-time, efficient indexing and searching of textual data using SQLite as its storage backend. TNTSearch is highly configurable and supports features like fuzzy searching, customizable ranking algorithms, and boolean search, making it a powerful tool for adding search functionality to websites and applications.

Downloads: 0 This Week

Last Update: 2025-08-25
See Project
7

Laravel Scout Elasticsearch

Search among multiple models with ElasticSearch and Laravel Scout

The package provides the perfect starting point to integrate ElasticSearch into your Laravel application. It is carefully crafted to simplify the usage of ElasticSearch within the Laravel Framework. It’s built on top of the latest release of Laravel Scout, the official Laravel search package. Using this package, you are free to take advantage of all of Laravel Scout’s great features, and at the same time leverage the complete set of ElasticSearch’s search experience.

Downloads: 0 This Week

Last Update: 2025-08-26
See Project
8

Katalog

Catalog and Search files from permanent or removable drives

Katalog is a desktop application to manage catalogs of disks and files: - Create catalogs from different sources or devices, - Search files even when the devices are disconnected, and find duplicates or differences - Organize your Collection of catalogs, Storage devices, and Virtual storage devices and get Statistics, - Data is stored in csv (tab separated) files for full control by the user, - Available in multiple languages - OpenSource and cross-platform (Linux Plasma and Windows 64 installer or portable). First use / tips - Simply start with the Create screen. Create your first catalog and experiment! - All data/catalog files are stored in the Settings/Collection folder. ...

Downloads: 421 This Week

Last Update: 2026-02-10
See Project
9

UniversalTextExtractor

Command-line toolset for extracting text from files

Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.

Downloads: 0 This Week

Last Update: 2026-01-17
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Common Resource Grep - crgrep

Common Resource Grep

CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . The actual development and issue tracking can be found here: https://bitbucket.org/cryanfuse/crgrep

3 Reviews

Downloads: 1 This Week

Last Update: 2023-04-23
See Project
11

DynaQ

Innovative text document search. http://dynaq.opendfki.de for details.

The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de

Downloads: 0 This Week

Last Update: 2021-08-05
See Project
12

Scout Elasticsearch Driver

Offers advanced functionality for searching data in Elasticsearch

This package offers advanced functionality for searching and filtering data in Elasticsearch.

Downloads: 1 This Week

Last Update: 2024-04-16
See Project
13

List.js

Library for adding search, sort, filters and flexibility to tables

Tiny, invisible and simple, yet powerful and incredibly fast vanilla JavaScript that adds search, sort, filters and flexibility to plain HTML lists, tables, or anything. List.js can be used in three different ways. It can be on existing HTML, it can create it's own HTML or a combination of both methods. Works both lists, tables and almost anything else. E.g. <div>,<ul>,<table>, etc. Simple templating system that adds possibility to add, edit, remove items. Perfect library for adding search,...

Downloads: 0 This Week

Last Update: 2021-06-01
See Project
14

pyFileSearcher

simple searching tool for big fileservers

pyFileSearcher was designed to be lightweight, easy to use, but capable of handling a large volume of files tool. A tool that I personally could use on large corporate servers to find out - which files have taken all my space in the last few days? It's free, it's opensource, it's for linux and windows. The program is written in Python 3 using the Qt5.

Downloads: 0 This Week

Last Update: 2019-07-01
See Project
15

ftdetector

File type detector library

This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON,...

Downloads: 0 This Week

Last Update: 2019-04-08
See Project
16

QCat

A catalog application for various media types - CD, DVD, NetDrives, USB flash keys, etc. It can import data from famous WhereIsIt Windows applicaion. In a word this is a try to make a WhereIsIt-like application for Linux.

Downloads: 0 This Week

Last Update: 2018-04-07
See Project
17

Aperture

Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.

6 Reviews

Downloads: 4 This Week

Last Update: 2017-11-04
See Project
18

JarOMine

Quickly Search THOUSANDS of Archives!

If you have ever tried to locate files, classes, and resources buried amongst an ever shifting locus of ZIP and / or JAR files, we feel your pain! After spending far too much time searching hundreds of archives for moving targets, I decided to write JarOMine. Originally designed for locating Java classes, JarOMine works equally well with ZIP archives, too.

Downloads: 0 This Week

Last Update: 2016-09-22
See Project
19

bnf2xml

simple BNF parser makes xml markup of matches

bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...

Downloads: 0 This Week

Last Update: 2016-04-08
See Project
20

Primes

Calculate primes by using extremely fast sorting

...Unfortunately it has turned out that going this way is even more slowly than trying to find primes by brute force. So it can only be used as a test with heavy load for the sorting algorithm, which can be used for sorting any kind of data. And as already mentioned, it's just the most efficient tree-based sorting algorithm that you can get. But furthermore this way of finding primes interestingly leaves a hard nut to crack for mathematicians: In very rare cases it finds numbers that are not primes. For all primes below one million this phenomenon arises in exactly two cases: 31213 which is 7 * 7 * 7 * 7 * 13 336141 which is 3 * 3 * 13 * 13 * 13 * 17 Who can explain, why?

Downloads: 0 This Week

Last Update: 2016-04-18
See Project
21

Horace

data organising system for arbitrary files

Uses alternate data streams to provide arbitrary tagging and searching of files within NTFS and other modern file systems supporting alternate data streams. Customisable vocabulary provides searchable standardised tagging system of file associations. Unlike most other file archiving systems, no additional database is required, and the system is robust, and persists file attributes irrespective of renaming / moving / copying / modifying etc.

Downloads: 0 This Week

Last Update: 2016-03-21
See Project
22

hf

bash history file filter

hf (history filter) filters bash command history in a way similar to a file search. You can see the results as you type, then you can run or edit the selected command. If you use the Ctrl+r bash shortcut a lot, this will definitely make your life easier. hf is a fork of hstr: https://github.com/dvorka/hstr

Downloads: 0 This Week

Last Update: 2016-03-15
See Project
23

Infofuze

Data migration/conversion library based on STX and XSLT transformation

Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.

Downloads: 0 This Week

Last Update: 2014-03-05
See Project
24

Wukong

Highly customizable full-text search engine

Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine through a channel, and the coroutine will segment the text and send it to a word segmentation through another channel. ...

Downloads: 0 This Week

Last Update: 2022-02-10
See Project
25

Meresco

Meresco is both an OAI Data Provider and a Service Provider. SourceForge is only used to host the source control (subversion). Sources: http://sources.meresco.org/ Binaries: http://repository.cq2.org/ Mail: http://groups.google.com/group/meresco

1 Review

Downloads: 0 This Week

Last Update: 2013-05-28
See Project