Showing 146 open source projects for "data"

View related business solutions
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • Build AI Apps with Gemini 3 on Vertex AI Icon
    Build AI Apps with Gemini 3 on Vertex AI

    Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

    Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
    Try Vertex AI Free
  • 1
    Logstash

    Logstash

    Centralize, transform and stash your data

    Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 2
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    ...It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 3
    syslog-ng

    syslog-ng

    Log management solution that improves the performance of SIEM

    ...Instead of deploying multiple agents on hosts, organizations can unify their log data collection and management. syslog-ng Store Box provides automated archiving, tamper-proof encrypted storage, granular access controls to protect log data. The largest appliance can store up to 10TB of raw logs.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    fluentbit

    fluentbit

    Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX

    ...No more OOM errors! Integration with all your technology, cloud-native services, containers, streaming processors, and data backends. Fully event-driven design leverages the operating system API for performance and reliability. All operations to collect and deliver data are asynchronous.
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Grafana Loki

    Grafana Loki

    Multi-tenant log aggregation system inspired by Prometheus

    ...It does not index the contents of the logs, but rather a set of labels for each log stream. Unlike other logging systems, Loki is built around the idea of only indexing metadata about your logs: labels (just like Prometheus labels). Log data itself is then compressed and stored in chunks in object stores such as S3 or GCS, or even locally on the filesystem. A small index and highly compressed chunks simplifies the operation and significantly lowers the cost of Loki. Loki supports multi-tenancy so that data between tenants is completely separated. Multi-tenancy is achieved through a tenant ID (which is represented as an alphanumeric string). ...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 6
    Beats

    Beats

    Lightweight data shippers of the Elastic Stack

    Beats is a platform of lightweight, single-use data shippers written in Go. It captures all sorts of operational data from hundreds or thousands of machines and systems and sends them to Logstash or Elasticsearch. Beats is lightweight in that it has a small installation footprint, uses limited system resources and has no runtime dependencies. It consists of: Auditbeat (for audit data), Filebeat (for log files), Functionbeat (serverless shipper), Heartbeat (for uptime monitoring), Metricbeat (for metrics), Packetbeat (for monitoring network data), and Winlogbeat (for Windows Event logs), with many other Beats created by the community.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Serilog

    Serilog

    Simple .NET logging with fully-structured events

    ...It is easy to set up, has a clean API, and is portable between recent .NET platforms. Unlike other logging libraries, Serilog is built with powerful structured event data in mind. Serilog message templates are a simple DSL extending .NET format strings. Parameters can be named, and their values are serialized as properties on the event for incredible searching and sorting flexibility. Serilog is a diagnostic logging library for .NET applications. It is easy to set up, has a clean API, and runs on all recent .NET platforms. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Netdata

    Netdata

    Open-source systems performance monitor

    Netdata is a well-crafted real time performance monitor to detect anomalies in your system infrastructure. Visualize many types of data including disk activity, SQL queries, website visitors and more. This tool is useful to monitor linux servers.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    Osquery

    Osquery

    SQL operating system instrumentation and monitoring framework

    ...The tools make low-level operating system analytics and monitoring both performant and intuitive. Osquery exposes an operating system as a high-performance relational database. This allows you to write SQL queries to explore operating system data. With osquery, SQL tables represent abstract concepts such as running processes, loaded kernel modules, open network connections, browser plugins, hardware events or file hashes.Osquery queries your devices like a database. Osquery uses basic SQL commands to leverage a relational data-model to describe a device. Frequently, attackers will leave a malicious process running but delete the original binary on disk. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 10
    mtail

    mtail

    Extract internal monitoring data from application logs

    Extract internal monitoring data from application logs for collection in a time-series database. mtail is a tool for extracting metrics from application logs to be exported into a timeseries database or timeseries calculator for alerting and dashboarding. It fills a monitoring niche by being the glue between applications that do not export their own internal state (other than via logs) and existing monitoring systems, such that system operators do not need to patch those applications to instrument them or writing custom extraction code for every such application. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    parseable

    parseable

    Parseable is a log analytics system written in Rust

    Parseable is a log analytics platform, built for the modern, cloud native era. Parseable uses a index-free mechanism to organize and query data allowing low latency, and high throughput ingestion and query.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PSR Log

    PSR Log

    An interface that describes a logger

    The main goal is to allow libraries to receive a Psr\Log\LoggerInterface object and write logs to it in a simple and universal way. The LoggerInterface exposes eight methods to write logs to the eight RFC 5424 levels (debug, info, notice, warning, error, critical, alert, emergency). Every method accepts a string as the message, or an object with a toString() method. Implementors MAY have special handling for the passed objects. If that is not the case, implementors MUST cast it to a string....
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Timbre

    Timbre

    Pure Clojure/Script logging library

    ...Getting even the simplest logging working can be comically hard, and it often gets worse at scale as your needs become more sophisticated. Timbre offers an all Clojure/Script alternative that's fast, deeply flexible, easy to configure with pure Clojure data, and that just works out the box. No XML. Easily filter logging calls by any combination of: level, namespace, and appended. Zero overhead compile-time level/ns elision. Useful built-in appenders for out-the-box Clj+Cljs logging. Powerful, easy-to-configure rate limits and async logging. Logs as Clojure values (v3+). Small, simple, cross-platform codebase. 3rd-party tools, appenders, etc. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Logbook

    Logbook

    An extensible Java library for HTTP request and response logging

    Logbook is an extensible Java library to enable complete request and response logging for different client- and server-side technologies. It satisfies a special need by a) allowing web application developers to log any HTTP traffic that an application receives or sends b) in a way that makes it easy to persist and analyze it later. This can be useful for traditional log analysis, meeting audit requirements or investigating individual historic traffic issues. Logbook is ready to use out of...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    morgan

    morgan

    HTTP request logger middleware for node.js

    ...The function is expected to return a string that will be the log line, or undefined / null to skip logging. Write log line on request instead of response. This means that a requests will be logged even if the server crashes, but data from the response (like the response code, content length, etc.) cannot be logged. To define a token, simply invoke morgan.token() with the name and a callback function.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    FASTER

    FASTER

    Fast persistent recoverable log and key-value store + cache

    ...It supports both sync and async interfaces, handles disk errors, and supports checksums. FASTER KV is a concurrent key-value store + cache (available in C# and C++) that is designed for point lookups and heavy updates. FASTER supports data larger than memory, by leveraging fast external storage (local or cloud). It also supports consistent recovery using a new checkpointing technique that lets applications trade-off performance for commit latency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    perf2rrd

    Performance Data collector for Monitoring.

    perf2rrd is a program written in Java (tm) which captures performance data output by the Nagios (R) or Icinga monitoring systems and stores it in round robin database files as implememented by RRDtool.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Syslogd2

    High capacity syslog data collection, filtering, and management.

    Syslogd2 is a syslog daemon that has been completely re-imagined specifically for use in network environments. It is multi-threaded, scalable and versatile with features designed for both network and host managers. Each Syslogd2 binary is customized from a set of over 20 features at compile-time. It can support input from text files, named-pipes, Linux kernel and user-defined Linux and (both IPv4 and IPv6) IP sockets (both UDP and TCP). It provides a pre-loadable name-cache that can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    DBD2

    A multi-threaded, multi-database tcp-based database insertion app.

    ...The client sends whatever information the user puts on the command-line to the back-end daemon via TCP. Upon successful transmission, the client exits. On failure, it backs off and retries until its life-time timer expires. The back-end daemon accepts data from Syslogd2 or its own client and inserts that data (and envelope information) into one or more databases using MySql client library or dbi drivers (also on SourceForge). The client needs no configuration (all input is via command-line). It is intended for hosts to make database entries without needing database client or interface code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SageMaker Experiments Python SDK

    SageMaker Experiments Python SDK

    Experiment tracking and metric logging for Amazon SageMaker notebooks

    ...Each step in the workflow is described by a Trial Component. There is no relationship between Trial Components such as ordering. Trial Component: A description of a single step in a machine learning workflow. For example data cleaning, feature extraction, model training, model evaluation, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Netflow

    Netflow

    free netflow analyzer

    This is a set of perl and php scripts that are used to collect, process, store and display netflow data.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    NDW - Network Data Wiping Hard Disks HDD

    NDW - Network Data Wiping Hard Disks HDD

    Wipe erase data from hdds on the network, unlimited hard disks

    ...As soon as the workstation is powered on, it boots off the network via the server and begins wiping all the attached hard disk drives. Once all the hard drives are wiped, data related to each hard disk is stored in the central server’s database. Read WIKI Automated unattended network based data erasure wiping Comprehensive Reporting Web GUI reporting Generates certificate of data erasure Barcodes Labels Graphs Pie / Bar / Line Charts Hard Disks bad sectors & health reporting Hard Disk SMART Data collection Database Driven Distributed System Hard Disk Drives health reports Part Open Source * Friendly GUI Graphical User Interface Mass Hard Disk Drive Destruction Data Wiping HDD Data Eraser Storage Array Data Wipe
    Downloads: 10 This Week
    Last Update:
    See Project
  • 24

    mqottrace

    An mqtt client to subscribe to owntracks events

    For owntracks, see https://owntracks.org/ mqottrace collects location data from the mqtt server and writes it into an sqlite3 database. You can convert the data into gpx tracks with mqott2gpx.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ddrescueview

    ddrescueview

    Graphical viewer for GNU ddrescue mapfiles

    This small tool allows the user to graphically examine ddrescue's mapfiles in a user friendly GUI application. The Main window displays a block grid with each block's color representing the block types it contains. Many people know this type of view from defragmentation programs. The program is written in Object Pascal using the Lazarus IDE. It can be compiled to run on Windows, GNU/Linux and probably OS X, too. The downloads include binaries for the respective target platform and the...
    Leader badge
    Downloads: 64 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →