Showing 30539 open source projects for "data"

View related business solutions
  • Outgrown Windows Task Scheduler? Icon
    Outgrown Windows Task Scheduler?

    Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

    Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.
    Download Free Tool
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    jsPDF

    jsPDF

    HTML5 client solution for generating PDFs

    The leading HTML5 client solution for generating PDFs. Perfect for event tickets, reports, certificates, you name it! PDFs are ubiquitous across the web, with virtually every enterprise relying on them to share documents. We created jsPDF to solve a major problem with how pdf files were being generated. We decided to make it open-source to allow a community of developers to expand on it.
    Downloads: 34 This Week
    Last Update:
    See Project
  • 2
    Scrapy

    Scrapy

    A fast, high-level web crawling and web scraping framework

    ...It can be used for data mining, monitoring and automated testing.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    Logstash

    Logstash

    Centralize, transform and stash your data

    Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    AKHQ

    AKHQ

    Kafka GUI for Apache Kafka to manage topics, topics data, etc.

    Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more. Enabling your teams to search and explore data in a unified console, while supporting its administration and integration within your ecosystem. Multi-Cluster vision into a central console, available in Multi-Cloud environments. Enabling users to access, search and get insights from your topics, including Live Tail.
    Downloads: 15 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Union Pandera

    Union Pandera

    Light-weight, flexible, expressive statistical data testing library

    The open-source framework for precision data testing for data scientists and ML engineers. Pandera provides a simple, flexible, and extensible data-testing framework for validating not only your data but also the functions that produce them. A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Nuclio

    Nuclio

    High-Performance Serverless event and data processing platform

    Nuclio is an open source and managed serverless platform used to minimize development and maintenance overhead and automate the deployment of data-science-based applications. Real-time performance running up to 400,000 function invocations per second. Portable across low laptops, edge, on-prem and multi-cloud deployments. The first serverless platform supporting GPUs for optimized utilization and sharing. Automated deployment to production in a few clicks from Jupyter notebook. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    LibreTube

    LibreTube

    An alternative frontend for YouTube, for Android

    YouTube has an extremely invasive privacy policy that relies on using user data in unethical ways. They store a lot of your personal data - ranging from ideas, music taste, content, political opinions, and much more than you think. This project is aimed at improving the users' privacy by being independent from Google and bypassing their data collection. Therefore, the app is using the Piped API, which uses proxies to circumvent Google's data collection and includes some other additional features.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    Matomo

    Matomo

    Alternative to Google Analytics that gives you full control over data

    Google Analytics alternative that protects your data and your customers' privacy. Take back control with Matomo – a powerful web analytics platform that gives you 100% data ownership. You could lose your customers’ trust and risk damaging your reputation if people learn their data is used for Google’s “own purposes”. By choosing the ethical alternative, Matomo, you won’t make privacy sacrifices or compromise your site.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 9
    nango

    nango

    A single API for all your integrations.

    ...It should be the only API you need to integrate to your app. Nango is an open-source solution for integrating third-party APIs with applications, simplifying API authentication, data syncing, and management.
    Downloads: 3 This Week
    Last Update:
    See Project
  • A unique web-based, end-to-end innovative LOS for residential mortgage lending Icon
    A unique web-based, end-to-end innovative LOS for residential mortgage lending

    Lenders, Banks, Credit Unions, and Brokers in need of a cloud solution to accelerate their lending portfolio

    LendingPad is a cloud-native, enterprise loan origination system (LOS) built to modernize mortgage lending for lenders, brokers, bankers and credit unions. Designed by mortgage professionals, the platform emphasizes speed, clarity and ease of use — helping teams close loans faster while delivering a superior borrower experience.
    Learn More
  • 10
    theHarvester

    theHarvester

    E-mails, subdomains and names

    ...Use it for open source intelligence (OSINT) gathering to help determine a company's external threat landscape on the internet. The tool gathers emails, names, subdomains, IPs and URLs using multiple public data sources.
    Downloads: 55 This Week
    Last Update:
    See Project
  • 11
    Node exporter

    Node exporter

    Exporter for machine metrics

    Power your metrics and alerting with a leading open-source monitoring solution. Prometheus implements a highly dimensional data model. Time series are identified by a metric name and a set of key-value pairs. PromQL allows slicing and dicing of collected time series data in order to generate ad-hoc graphs, tables, and alerts. Prometheus has multiple modes for visualizing data: a built-in expression browser, Grafana integration, and a console template language. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 12
    OpenRefine

    OpenRefine

    A free, open source, powerful tool for working with messy data

    OpenRefine is a powerful Java-based tool designed to work with messy data and improve it. With OpenRefine you can load data, understand it, clean it up, transform it, reconcile it, and augment it with web services and external data. It allows you to do this all from a web browser and in the convenience and privacy of your own computer. OpenRefine keeps all data securely in your computer by running a small server on it, using your web browser to interact with it. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    v2ray-rules-dat

    v2ray-rules-dat

    V2Ray routing rules file enhanced version, which can replace V2Ray

    v2ray-rules-dat is a repository that compiles and distributes enhanced rule data (domain lists, geo-IP/geo-domain data, block/proxy/detect lists) intended for use with tools like V2Ray, Xray-core, and similar network/proxy frameworks. The dataset serves as an alternative or supplement to official geoip/ geosite data files, often providing more up-to-date, community-curated entries — enabling better routing, blocking, or traffic management when using those proxy tools. ...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 14
    pgModeler

    pgModeler

    Open-source data modeling tool designed for PostgreSQL

    pgModeler is an open-source data modeling tool for PostgreSQL, enabling visual creation and management of database schemas. It supports reverse engineering from existing databases, model validation, and SQL export, providing a full-featured GUI for database design. pgModeler is suited for developers, DBAs, and analysts who want to streamline schema creation and documentation.
    Downloads: 63 This Week
    Last Update:
    See Project
  • 15
    Bayesian Statistics

    Bayesian Statistics

    This repository holds slides and code for a full Bayesian statistics

    ...Bayesian statistics is an approach to inferential statistics based on Bayes' theorem, where available knowledge about parameters in a statistical model is updated with the information in observed data. The background knowledge is expressed as a prior distribution and combined with observational data in the form of a likelihood function to determine the posterior distribution. The posterior can also be used for making predictions about future events. Bayesian statistics is a departure from classical inferential statistics that prohibits probability statements about parameters and is based on asymptotically sampling infinite samples from a theoretical population and finding parameter values that maximize the likelihood function. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    JSON Crack

    JSON Crack

    Innovative and open-source visualization application

    JSON Crack is a free, open-source data visualization app capable of visualizing data formats such as JSON, YAML, XML, CSV and more, into interactive graphs. With its intuitive and user-friendly interface, JSON Crack makes it easy to explore, analyze, and understand even the most complex data structures. Whether you're a developer working on a large-scale project or a data enthusiast looking to uncover hidden insights, JSON Crack has the tools and features you need to unlock the full potential of your data.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    pointblank

    pointblank

    Data quality assessment and metadata reporting for data frames

    With the pointblank package it’s really easy to methodically validate your data whether in the form of data frames or as database tables. On top of the validation toolset, the package gives you the means to provide and keep up-to-date with the information that defines your tables. For table validation, the agent object works with a large collection of simple (yet powerful!) validation functions. We can enable much more sophisticated validation checks by using custom expressions, segmenting the data, and by selective mutations of the target table. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Apexcharts.js

    Apexcharts.js

    Interactive JavaScript Charts built on SVG

    ...You can create a combination of different charts, sync them and give your desired look with unlimited possibilities. Below is an example of synchronized charts with github style. Zoom, Pan, Scroll through data. Make selections and load other charts using those selections. An example showing some interactivity. Another approach to Drill down charts where one selection updates the data of other charts. Annotations allows you to write custom text on specific values or on axes values. Valuable to expand the visual appeal of your chart and make it more informative.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    Redash

    Redash

    Connect to any data source, easily visualize and share your data

    Redash is an essential tool to help you make sense of your data. It allows everyone, regardless of level of technical know-how to harness the power of data. SQL users connect, query, visualize and share data easily and efficiently, allowing everyone in their organization to use the data. Redash combines the power and comfort of an SQL client with the collaborative benefits of a cloud-based service.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Npgsql

    Npgsql

    Npgsql is the .NET data provider for PostgreSQL

    ...Npgsql is the open source .NET data provider for PostgreSQL. It allows you to connect and interact with PostgreSQL server using .NET.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 22
    QSV

    QSV

    Blazing-fast Data-Wrangling toolkit

    qsv is a fast, command-line CSV data toolkit written in Rust that extends the capabilities of xsv. It’s designed to make working with CSV files at scale easy and efficient, offering over 40 powerful subcommands for tasks like querying, sampling, splitting, deduplicating, and more. qsv is ideal for data engineers, analysts, and developers who need high-performance CSV manipulation on the command line.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    CTGAN

    CTGAN

    Conditional GAN for generating synthetic tabular data

    CTGAN is a collection of Deep Learning based synthetic data generators for single table data, which are able to learn from real data and generate synthetic data with high fidelity. If you're just getting started with synthetic data, we recommend installing the SDV library which provides user-friendly APIs for accessing CTGAN. The SDV library provides wrappers for preprocessing your data as well as additional usability features like constraints. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    XCharts

    XCharts

    A charting and data visualization library for Unity

    A charting and data visualization library for Unity. Unity data visualization chart plugin. A UGUIpowerful, easy-to-use, parameter-configurable data visualization chart plug-in. It supports ten built-in charts. A powerful, easy-to-use, configurable charting and data visualization library for Unity. Visual configuration of parameters, real-time preview of effects, and pure code drawing without additional resources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    ADX MCP Server

    ADX MCP Server

    A Model Context Protocol (MCP) server that enables AI assistants

    The Azure Data Explorer MCP Server is a Model Context Protocol (MCP) server that enables AI assistants to query and analyze Azure Data Explorer databases through standardized interfaces. It allows the execution of Kusto Query Language (KQL) queries and exploration of data within Azure Data Explorer clusters. ​
    Downloads: 5 This Week
    Last Update:
    See Project