data free download - SourceForge

Showing 30500 open source projects for "data"

View related business solutions

Outgrown Windows Task Scheduler?
Free diagnostic identifies where your workflow is breaking down—with instant analysis of your scheduling environment.

Windows Task Scheduler wasn't built for complex, cross-platform automation. Get a free diagnostic that shows exactly where things are failing and provides remediation recommendations. Interactive HTML report delivered in minutes.

Download Free Tool
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
1

data.table

Extends base R’s data for high-performance data manipulation

data.table is an R package that extends base R’s data.frame for high-performance data manipulation. It offers concise syntax, blazing speed, and memory-efficient operations. It supports fast file reading/writing, joins, grouping, reshaping, and updates by reference. It is heavily used in large data workflows, big data in R, production pipelines, etc. Extremely efficient grouping/aggregation/summarization; can handle very large datasets (hundreds of millions to billions of rows) in memory (if available). ...

Downloads: 2 This Week

Last Update: 2026-01-27
See Project
2

Data Formulator

Create rich visualizations with AI

To create rich visualizations, data analysts often need to iterate back and forth among data processing and chart specification to achieve their goals. To achieve this, analysts need not only proficiency in data transformation and visualization tools but also efforts to manage the branching history consisting of many different versions of data and charts. Recent LLM-powered AI systems have greatly improved visualization authoring experiences, for example by mitigating manual data transformation barriers via LLMs' code generation ability. ...

Downloads: 1 This Week

Last Update: 2026-01-27
See Project
3

Laravel Data

Powerful data objects for Laravel

This package enables the creation of rich data objects which can be used in various ways. Using this package you only need to describe your data once.

Downloads: 4 This Week

Last Update: 2026-01-28
See Project
4

MDN data

This repository contains general data for Web technologies

This repository contains general data for Web technologies and is maintained by the MDN team at Mozilla.

Downloads: 4 This Week

Last Update: 7 days ago
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Explorer

Series (one-dimensional) and dataframes (two-dimensional)

Explorer brings series (one-dimensional) and data frames (two-dimensional) to Elixir for fast data exploration.

Downloads: 0 This Week

Last Update: 2025-08-17
See Project
6

data-diff

Efficiently diff rows across two different databases

We're excited to announce the launch of a new open-source product, data-diff that makes comparing datasets across databases fast at any scale. data-diff automates data quality checks for data replication and migration. In modern data platforms, data is constantly moving between systems, and at the modern data volume and complexity, systems go out of sync all the time. Until now, there has not been any tooling to ensure that when the data is correctly copied. ...

Downloads: 0 This Week

Last Update: 2024-02-20
See Project
7

Orange Data Mining

Orange: Interactive data analysis

Open source machine learning and data visualization. Build data analysis workflows visually, with a large, diverse toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Even your multidimensional data can become sensible in 2D, especially with clever attribute ranking and selections. ...

Downloads: 47 This Week

Last Update: 2025-12-20
See Project
8

Micronaut Data

Ahead of Time Data Repositories

...The problem is worse when combined with Hibernate which maintains its own meta-model as you end up with duplicate meta-models. Micronaut Data instead moves this model into the compiler. Both GORM and Spring Data use regular expressions and pattern matching in combination with runtime generated proxies to translate a method definition on a Java interface into a query at runtime. No such runtime translation exists in Micronaut Data and this work is carried out by the Micronaut compiler at compilation time.

Downloads: 1 This Week

Last Update: 2026-01-26
See Project
9

Profile Data

Analyze computation-communication overlap in V3/R1

profile-data is a repository that publishes profiling traces and metrics from DeepSeek’s training and inference infrastructure (especially during DeepSeek-V3 / R1 experiments). The profiling data targets insights into computation-communication overlap, pipeline scheduling (e.g. DualPipe), and how MoE / EP / parallelism strategies interact in real systems.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
10

atk4/data

Data Access PHP Framework for SQL & high-latency databases

ATK Data is a data persistence and modeling framework for PHP, developed as part of the Agile Toolkit. It provides a high-level abstraction for working with databases, making it easier to define and manipulate data models with minimal boilerplate code. It supports various SQL and NoSQL databases and integrates seamlessly with Agile UI and other PHP frameworks.

Downloads: 0 This Week

Last Update: 2025-04-22
See Project
11

Form-Data

A module to create readable `"multipart/form-data"` streams

A library to create readable "multipart/form-data" streams. Can be used to submit forms and file uploads to other web applications.

Downloads: 0 This Week

Last Update: 2025-07-17
See Project
12

Data-Juicer

Data processing for and with foundation models

Data-Juicer is an open-source data processing and augmentation framework designed to enhance the quality and diversity of datasets for machine learning tasks. It includes a modular pipeline for scalable data transformation.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
13

Dynamic Data

Reactive collections based on Rx.Net

...However, typical applications are much more complicated and may apply a filter, transform the original dto and apply a sort. Even with these simple everyday operations, the complexity of the code is quickly magnified. Dynamic data has been developed to remove the tedious code of dynamically maintaining collections. It has grown to become functionally very rich with at least 60 collection-based operations which amongst other things enable filtering, sorting, grouping, joining different sources, transforms, binding, pagination, data virtualization, expiration, disposal management plus more.

Downloads: 0 This Week

Last Update: 2025-06-03
See Project
14

Azure Data Studio

A data management tool that enables working with other SQL tools

Azure Data Studio is a cross-platform database tool for data professionals who use on-premises and cloud data platforms on Windows, macOS, and Linux. Azure Data Studio offers a modern editor experience with IntelliSense, code snippets, source control integration, and an integrated terminal. It's engineered with the data platform user in mind, with the built-in charting of query result sets and customizable dashboards.

Downloads: 7 This Week

Last Update: 2025-06-12
See Project
15

sq data wrangler

sq data wrangler

sq is a command line tool that provides jq-style access to structured data sources: SQL databases, or document formats like CSV or Excel. sq executes jq-like queries, or database-native SQL. It can join across sources: join a CSV file to a Postgres table, or MySQL with Excel. sq outputs to a multitude of formats including JSON, Excel, CSV, HTML, Markdown and XML, and can insert query results directly to a SQL database. sq can also inspect sources to view metadata about the source structure (tables, columns, size). ...

Downloads: 2 This Week

Last Update: 6 days ago
See Project
16

Synthetic Data Kit

Tool for generating high quality Synthetic datasets

Synthetic Data Kit is a CLI-centric toolkit for generating high-quality synthetic datasets to fine-tune Llama models, with an emphasis on producing reasoning traces and QA pairs that line up with modern instruction-tuning formats. It ships an opinionated, modular workflow that covers ingesting heterogeneous sources (documents, transcripts), prompting models to create labeled examples, and exporting to fine-tuning schemas with minimal glue code.

Downloads: 1 This Week

Last Update: 2025-10-25
See Project
17

NYC Taxi Data

Import public NYC taxi and for-hire vehicle (Uber, Lyft)

The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...

Downloads: 2 This Week

Last Update: 2025-10-01
See Project
18

browser-compat-data

This repository contains compatibility data for Web technologies

The browser-compat-data ("BCD") project contains machine-readable browser (and JavaScript runtime) compatibility data for Web technologies, such as Web APIs, JavaScript features, CSS properties, and more. Our goal is to document accurate compatibility data for Web technologies, so web developers may write cross-browser compatible websites more easily. BCD is used in web apps and software such as MDN Web Docs, CanIUse, Visual Studio Code, WebStorm and more.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
19

Data Annotator for Machine Learning

Data annotator for machine learning

Data annotator for machine learning allows you to centrally create, manage and administer annotation projects for machine learning. Data Annotator for Machine Learning (DAML) is an application that helps machine learning teams facilitate the creation and management of annotations. Active learning with uncertain sampling to query unlabeled data.

Downloads: 0 This Week

Last Update: 2024-03-11
See Project
20

AWESOME DATA SCIENCE

Awesome Data Science repository to learn and apply for real world

An open source Data Science repository to learn and apply towards solving real world problems. This is a shortcut path to start studying Data Science. Just follow the steps to answer the questions, "What is Data Science and what should I study to learn Data Science?" Data Science is one of the hottest topics on the Computer and Internet farmland nowadays.

Downloads: 0 This Week

Last Update: 6 days ago
See Project
21

Spring Data Neo4j

Provide support to increase developer productivity in Java

...The template programming model is equivalent to other Spring templates and builds the basis for interaction with the graph and is also used for the Spring Data repository support. Spring Data Neo4j is a core part of the Spring Data project which aims to provide convenient data access for NoSQL databases. Spring Data builds on Spring Framework, check the spring.io web-site for a wealth of reference documentation. If you are just starting out with Spring, try one of the guides.

Downloads: 0 This Week

Last Update: 2026-01-16
See Project
22

Spring Data REST

Simplifies building hypermedia-driven REST web services

Spring Data REST is part of the umbrella Spring Data project and makes it easy to build hypermedia-driven REST web services on top of Spring Data repositories. Spring Data REST builds on top of Spring Data repositories, analyzes your application’s domain model and exposes hypermedia-driven HTTP resources for aggregates contained in the model.

Downloads: 0 This Week

Last Update: 2026-01-16
See Project
23

Spring Data MongoDB

Provide support to increase developer productivity in Java

The primary goal of the Spring Data project is to make it easier to build Spring-powered applications that use new data access technologies such as non-relational databases, map-reduce frameworks, and cloud-based data services. The Spring Data MongoDB project aims to provide a familiar and consistent Spring-based programming model for new datastores while retaining store-specific features and capabilities.

Downloads: 0 This Week

Last Update: 2026-01-16
See Project
24

Spring Data Redis

Provides support to increase developer productivity in Java

Provides support to increase developer productivity in Java when using Redis, a key-value store. Uses familiar Spring concepts such as a template class for core API usage and lightweight repository-style data access. The primary goal of the Spring Data project is to make it easier to build Spring-powered applications that use new data access technologies such as non-relational databases, map-reduce frameworks, and cloud-based data services. Connection package as low-level abstraction across multiple Redis drivers (Lettuce and Jedis). Exception translation to Spring’s portable Data Access exception hierarchy for Redis driver exceptions. ...

Downloads: 0 This Week

Last Update: 2026-01-16
See Project
25

Spring Data JPA

Simplifies the development of creating a JPA-based data access layer

Spring Data JPA, part of the larger Spring Data family, makes it easy to easily implement JPA-based repositories. This module deals with enhanced support for JPA-based data access layers. It makes it easier to build Spring-powered applications that use data access technologies. Implementing a data access layer of an application has been cumbersome for quite a while.

Downloads: 0 This Week

Last Update: 2026-01-16
See Project