Page 3 | you free download

Showing 110 open source projects for "you"

View related business solutions

Data Management Python Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Old File Delete

Clean up old files with a single click.

OldFileDelete (OFD) is a lightweight and efficient utility designed for those who value minimalism and order. The app helps you instantly clear selected folders of accumulated digital clutter. Featuring a modern flat design, the interface is intuitive: simply select a folder, specify the number of days, and the program will find and remove outdated files. No complex settings—just cleanliness and speed.

Downloads: 0 This Week

Last Update: 2026-01-05
See Project
2

seaborn

Statistical data visualization in Python

Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. Seaborn helps you explore and understand your data. Its plotting functions operate on dataframes and arrays containing whole datasets and internally perform the necessary semantic mapping and statistical aggregation to produce informative plots. Its dataset-oriented, declarative API lets you focus on what the different elements of your plots mean, rather than on the details of how to draw them. ...

Downloads: 0 This Week

Last Update: 2024-01-25
See Project
3

dbt-re-data

re_data - fix data issues before your users & CEO would discover them

re_data is an open-source data reliability framework for the modern data stack. Currently, re_data focuses on observing the dbt project (together with underlaying data warehouse - Postgres, BigQuery, Snowflake, Redshift). Data transformations in re_data are implemented and exposed as models & macros in this dbt package. Gather all relevant outputs about your data in one place using our cloud. Invite your team and debug it easily from there. Go back in time, and see your past metadata. Set up...

Downloads: 0 This Week

Last Update: 2023-12-21
See Project
4

Ubix Linux

The Pocket Datalab

...Ubix Linux is easily accessible, resource-efficient and completely portable on a simple USB key. Ubix Linux is a perfect toolset for learning data analysis and artificial intelligence basics on small to medium datasets. You can find additional information, technical guidance, and user credentials on the project website https://ubix-linux.sourceforge.io/ or on the project subreddit https://reddit.com/r/UbixLinux.

Downloads: 1 This Week

Last Update: 2024-03-08
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

PipeRider

Code review for data in dbt

You can compare two previously generated reports or use a single command to compare the differences between the current branch and the main branch. The latter is designed specifically for code review scenarios. In our pull requests on GitHub, we not only want to know which files have been changed, but also the impact of these changes on the data.

Downloads: 0 This Week

Last Update: 2023-11-22
See Project
6

SageMaker Inference Toolkit

Serve machine learning models within a Docker container

Serve machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. Once you have a trained model, you can include it in a Docker container that runs your inference code. A container provides an effectively isolated environment, ensuring a consistent runtime regardless of where the container is deployed. ...

Downloads: 0 This Week

Last Update: 2023-10-25
See Project
7

scArches

Reference mapping for single-cell genomics

Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data. scArches allows your single-cell query data to be analyzed by integrating it into a reference atlas. By mapping your data into an integrated reference you can transfer cell-type annotation from reference to query, identify disease states by mapping to healthy atlas, and advanced applications such as imputing missing data modalities or spatial locations.

Downloads: 0 This Week

Last Update: 2023-06-13
See Project
8

TexGen

TexGen is a geometric textile modelling software package to be used for obtaining engineering properties of woven textiles and textile composites. Citing TexGen We would be grateful if you could acknowledge use of TexGen where appropriate and suggest using one of the following references: L P Brown and A C Long. "Modelling the geometry of textile reinforcements for composites: TexGen", Chapter 8 in "Composite reinforcements for optimum performance (Second Edition)", ed. P Boisse, Woodhead Publishing Ltd, 2021, ISBN: 978-0-12-819005-0. https://doi.org/10.1016/B978-0-12-819005-0.00008-3 Lin, H., Brown, L. ...

2 Reviews

Downloads: 70 This Week

Last Update: 2025-06-10
See Project
9

Tributary

Streaming reactive and dataflow graphs in Python

Tributary is a library for constructing dataflow graphs in Python. Unlike many other DAG libraries in Python (airflow, luigi, prefect, dagster, dask, kedro, etc), tributary is not designed with data/etl pipelines or scheduling in mind. Instead, tributary is more similar to libraries like mdf, loman, pyungo, streamz, or pyfunctional, in that it is designed to be used as the implementation for a data model. One such example is the greeks library, which leverages tributary to build data models...

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

Swiple

Swiple enables you to easily observe, understand, validate data

...Experience hassle-free integration with Swiple's zero-infrastructure and zero-code setup. Seamlessly incorporate data quality checks into your existing workflows without any coding or infrastructure changes, allowing you to focus on what matters most - your data. Save engineers weeks of time generating data quality checks. Swiple analyzes your dataset and builds data quality checks based on what is observed in your data. You just pick the ones you want.

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
11

missingno

Missing data visualization module for Python

Messy datasets? Missing values? missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset. Just pip install missingno to get started. This quickstart uses a sample of the NYPD Motor Vehicle Collisions Dataset dataset. The msno.matrix nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion. ...

Downloads: 1 This Week

Last Update: 2023-02-26
See Project
12

Visdom

A tool for creating, organizing, and sharing data visualizations

...Organize your visualization space programmatically or through the UI to create dashboards for live data, inspect results of experiments, or debug experimental code. Visdom has a simple set of features that can be composed for various use-cases. The UI begins as a blank slate, you can populate it with plots, images, and text. These appear in windows that you can drag, drop, resize, and destroy. The windows live in envs and the state of envs is stored across sessions. You can download the content of windows, including your plots in svg.

Downloads: 0 This Week

Last Update: 2023-02-08
See Project
13

SQLBucket

Lightweight library to write, orchestrate and test your SQL ETL

...It gives the possibility to set variables and introduces some control flow using the fantastic Jinja2 library. It also implements a very simplistic unit and integration test framework where you can validate the results of your ETL in the form of SQL checks. With SQLBucket, you can apply TDD principles when writing data pipelines. To start working, you need to instantiate your SQLBucket core object with the project_folder parameter. That folder will contain all your SQL ETL. The python file where you create your SQLBucket object is also a good place to instantiate your command line interface.

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
14

rocket-bi

An open-source web-based self-service BI for analytical databases

...It enables data analysts and business users alike to easily integrate different data sources, perform advanced data analysis, ad hoc, and more. With an easy-to-use editor, you can create personalized reports, build interactive business dashboards and generate actionable business insights. Rocket.BI also allows collaboration as working together with other people in the organization.

Downloads: 1 This Week

Last Update: 2023-01-19
See Project
15

Bloxs

Build dashboards in Jupyter Notebook with numeric and chart boxes

Bloxs is a simple Python package that helps you display information in an attractive way (formed in blocks). Perfect for building dashboards, reports and apps in the notebook.

Downloads: 0 This Week

Last Update: 2023-12-21
See Project
16

AWS Step Functions Data Science SDK

For building machine learning (ML) workflows and pipelines on AWS

The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately. The best way to quickly review how the AWS Step Functions Data Science SDK works is to review the related example notebooks. These notebooks provide code and descriptions for creating and running workflows in AWS Step Functions Using the AWS Step Functions Data Science SDK. ...

Downloads: 0 This Week

Last Update: 2022-07-07
See Project
17

XISMuS

X-Ray Imaging Software for Multiple Samples

...IMPORTANT FIXES in respect to base v2.0.0 version: v.2.5.0 introduces the Differential Attenuation and Cube Viewer utilities, and migrates user database to *.json files v2.4.3 fixes a with K element in the fit-approx method v2.4.3 fixes and issue where saving plots with fit-approx or a auto-wizard could freeze the software v2.4.2 introduces Image Viewer to Mosaic v2.4.1 fixes an issue in merging H5 or EDF datasets with Mosaic Full changelog at https://linssab.github.io/history X-Ray Fluorescence Imaging Software for Multiple Samples is an open source software to manipulate and study macro-X-Ray Fluorescence (MA-XRF) datasets. XISMuS also works as a sample management tool, where you can easily change between datasets (samples) and compare, cross-interact and normalize them.

Downloads: 0 This Week

Last Update: 2022-03-13
See Project
18

StreamAlert

StreamAlert is a serverless, realtime data analysis framework

StreamAlert is a serverless, real-time data analysis framework that empowers you to ingest, analyze, and alert on data from any environment, using data sources and alerting logic you define. Computer security teams use StreamAlert to scan terabytes of log data every day for incident detection and response. Incoming log data will be classified and processed by the rules engine. Alerts are then sent to one or more outputs.

Downloads: 0 This Week

Last Update: 2022-04-01
See Project
19

ML workspace

All-in-one web-based IDE specialized for machine learning

All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly configured, optimized, and integrated. ...

Downloads: 2 This Week

Last Update: 2022-07-12
See Project
20

Jupytab

Display in Tableau data from Jupyter notebooks

Jupytab allows you to explore in Tableau data which is generated dynamically by a Jupyter Notebook. You can thus create Tableau data sources in a very flexible way using all the power of Python. This is achieved by having Tableau access data through a web server created by Jupytab.

Downloads: 0 This Week

Last Update: 2024-01-09
See Project
21

MMdnn

Tools to help users inter-operate among deep learning frameworks

...The "MM" stands for model management, and "dnn" is the acronym of deep neural network. We implement a universal converter to convert DL models between frameworks, which means you can train a model with one framework and deploy it with another. During the model conversion, we generate some code snippets to simplify later retraining or inference. We provide a model collection to help you find some popular models. We provide a model visualizer to display the network architecture more intuitively. We provide some guidelines to help you deploy DL models to another hardware platform.

Downloads: 0 This Week

Last Update: 2021-09-30
See Project
22

repo2docker GitHub Action

A GitHub action to build data science environment images

Trigger repo2docker to build a Jupyter enabled Docker image from your GitHub repository and push this image to a Docker registry of your choice. This will automatically attempt to build an environment from configuration files found in your repository. Images generated by this action are automatically tagged with both latest and <SHA> corresponding to the relevant commit SHA on GitHub. Both tags are pushed to the Docker registry specified by the user. If an existing image with the latest tag...

Downloads: 0 This Week

Last Update: 2024-01-08
See Project
23

SageMaker Containers

Create SageMaker-compatible Docker containers

Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process.

Downloads: 0 This Week

Last Update: 2022-07-12
See Project
24

NYCOpenData-Profiling-Analysis

Open Data Profiling, Quality and Analysis on NYC OpenData dataset

Open data often comes with little or no metadata. You will profile a large collection of open data sets and derive metadata that can be used for data discovery, querying, and identification of data quality problems. For each column, identify and summarize the semantic types present in the column. These can be generic types (e.g., city, state) or collection-specific types (NYU school names, NYC agency).

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
25

nonechucks

Deal with bad samples in your dataset dynamically

nonechucks is a library that provides wrappers for PyTorch's datasets, samplers and transforms to allow for dropping unwanted or invalid samples dynamically. What if you have a dataset of 1000s of images, out of which a few dozen images are unreadable because the image files are corrupted? Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you have an AlternateIndexSampler, and you want to be able to move to dataset[6] after dataset[4] fails while attempting to load! ...

Downloads: 0 This Week

Last Update: 2023-06-12
See Project