Showing 3125 open source projects for "data"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 1
    Data Science Notes

    Data Science Notes

    Curated collection of data science learning materials

    Data Science Notes is a large, curated collection of data science learning materials, with explanations, code snippets, and structured notes across the typical end-to-end workflow. It spans foundational math and statistics through data wrangling, visualization, machine learning, and practical project organization. The content emphasizes hands-on understanding by pairing narrative notes with runnable examples, making it useful for both self-study and classroom settings. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    pylatexenc

    pylatexenc

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex

    Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion. Python 3.4 or 2.7. The library is designed to be as backward-compatible as reasonably possible and is able to run on old Python versions should it be necessary. (Use the setup.py script directly if you have Python 3.7, poetry doesn't seem to work with old Python versions.) The pylatexenc.latexencode module provides a function unicode_to_latex() which converts a Unicode string into LaTeX text and escape sequences....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    ...It uses a causal encoder-decoder architecture with skip connections, optimized with losses defined both in the time domain and frequency domain to better suppress noise while preserving speech. Unlike models that operate on spectrograms alone, this design enables lower latency and coherent waveform output. The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. The project supports both offline denoising (batch inference) and live audio processing (e.g. via loopback audio interfaces), making it practical for real-time use in calls or recording. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Big List of Naughty Strings

    Big List of Naughty Strings

    List of strings which have a high probability of causing issues

    The Big List of Naughty Strings is a community-maintained catalog of “gotcha” inputs that commonly break software, from unusual Unicode to SQL and script injection payloads. It exists so developers and QA engineers can easily test edge cases that normal test data would miss, such as zero-width characters, right-to-left marks, emojis, foreign alphabets, and long or malformed strings. By throwing these strings at forms, APIs, databases, and UIs, teams can discover encoding bugs, sanitizer gaps, rendering issues, and security oversights early. The list is language-agnostic and repository-friendly, meaning you can consume it from CI pipelines or local scripts with minimal setup. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Text Gen

    Text Gen

    Almost state of art text generation library

    ...Text gen is a python library that allow you build a custom text generation model with ease. Something sweet built with Tensorflow and Pytorch(coming soon). Load your data, your data must be in a text format. Download the example data from the example folder. Tune your model to know the best optimizer, activation method to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    speedtest-cli

    speedtest-cli

    Command line interface for testing internet bandwidth using speedtest

    ...Test the internet connection of your Linux desktop, a remote server or even lower-powered devices such as the Raspberry Pi with the Speedtest Server Network. Set up automated scripts to collect connection performance data, including trends over time.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    earthengine-py-notebooks

    earthengine-py-notebooks

    A collection of 360+ Jupyter Python notebook examples

    ...Users can quickly adapt the examples for their own remote sensing, environmental monitoring, or spatial data science projects, and can run the code in environments like Google Colab.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    MLOps Course

    MLOps Course

    Learn how to design, develop, deploy and iterate on ML apps

    The MLOps Course by Goku Mohandas is an open-source curriculum that teaches how to combine machine learning with solid software engineering to build production-grade ML applications. It is structured around the full lifecycle: data pipelines, modeling, experiment tracking, deployment, testing, monitoring, and iteration. The repository itself contains configuration, code examples, and links to accompanying lessons hosted on the Made With ML site, which provide detailed narrative explanations and diagrams. Instead of focusing only on model training, the course emphasizes best practices like modular code design, CI/CD, containerization, reproducibility, and responsible ML (including monitoring and feedback loops). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    XLM (Cross-lingual Language Model)

    XLM (Cross-lingual Language Model)

    PyTorch original implementation of Cross-lingual Language Model

    XLM (Cross-lingual Language Model) is a family of multilingual pretraining methods that align representations across languages to enable strong zero-shot transfer. It popularized objectives like Masked Language Modeling (MLM) across many languages and Translation Language Modeling (TLM) that jointly trains on parallel sentence pairs to tighten cross-lingual alignment. Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Data Warehouse to Data and AI platform with BigQuery Icon
    Go from Data Warehouse to Data and AI platform with BigQuery

    Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.

    BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
    Try BigQuery Free
  • 10
    BMC

    BMC

    Notes on Scientific Computing for Biomechanics

    This repository is a collection of lecture notes and code on scientific computing and data analysis for Biomechanics and Motor Control.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Semantic Segmentation in PyTorch

    Semantic Segmentation in PyTorch

    Semantic segmentation models, datasets & losses implemented in PyTorch

    Semantic segmentation models, datasets and losses implemented in PyTorch. PyTorch and Torchvision needs to be installed before running the scripts, together with PIL and opencv for data-preprocessing and tqdm for showing the training progress. PyTorch v1.1 is supported (using the new supported tensoboard); can work with earlier versions, but instead of using tensoboard, use tensoboardX. Poly learning rate, where the learning rate is scaled down linearly from the starting value down to zero during training. Considered as the go-to scheduler for semantic segmentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PS-Drone

    PS-Drone

    Programming a Parrot AR.Drone 2.0 with Python - The Easy Way

    The PS-Drone-API is a full featured SDK, written in and for Python, for Parrot's AR.Drone 2.0. It was designed to be easy to learn, but it offers the full set of the possibilities of the AR.Drone 2.0, including Sensor-Data (aka NavData), Configuration and full Video-support. The video function is not restricted to mere viewing, it is also possible to analyze video images data using OpenCV2. Obviously, the PS-Drone is perfect for teaching purposes; however, even the requirements for professional purposes can be satisfied. PS-Drone comes with a tutorial, explaining its most important commands and the drone's most important sensor values. ...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    Keepsake

    Keepsake

    Version control for machine learning

    Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage. You can get the data back out using the command-line interface or a notebook.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    sar2html
    Sar2html is web based frontend for performance monitoring. It converts sar binary data to graphical format and keep historical data in it's database. Project homepage is here: https://github.com/cemtan/sar2html.git Supported Operating Systems: HPUX 11.11, 11.23, 11,31 Solaris 5.9, 5.10, 5.11 Redhat 3, 4, 5, 6, 7 Suse 8, 9, 10, 11, 12 Ubuntu 18, 20 If you have customers facing performance problems on operating systems listed above you may send sar2ascii to collect performance data. ...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    Pension-Calculator

    Pension-Calculator

    A simple retirement benefit calculator app written in python with GUI.

    A simple retirement benefit calculator app written in python with GUI specifically for West Bengal State Govt. employees (India). This app is not associated with Govt. of West Bengal. This app is for informational purpose only. This project is open source under GNU public license v3.0
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    MetaHash

    MetaHash

    #MetaHash is a next-generation network based on the Blockchain 4.0

    ...The #MetaHash project implements a solution that simultaneously possesses all the qualities of Blockchain new generation, namely: - Decentralization, which ensures absolute trust of users to the network; - Full synchronization, thanks to which each user has instant access to the most relevant data; - А transaction speed of up to 3 seconds, which provides unconditional ease of system using and a guarantee of the full operation of decentralized applications; - High bandwidth, which allows the network to keep transaction prices low due to the fact that a similar amount of resources can confirm more transactions per unit time; - Low commissions, due to which a huge number of potential customers will be interested in using the network for transfer and storage of payment means and technical data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    flatten_json

    flatten_json

    Flatten JSON in Python

    Flattens JSON objects in Python. flatten_json flattens the hierarchy in your object which can be useful if you want to force your objects into a table.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    pyment

    pyment

    Format and convert Python docstrings and generates patches

    Create, update or convert docstrings in existing Python files, managing several styles. This Python3 program intends to help Python programmers to enhance inside code documentation using docstrings. It is useful for code not well documented, or code without docstrings, or some not yet or partially documented code, or a mix of all of this. It can be helpful also to harmonize or change a project docstring style format. It will parse one or several python scripts and retrieve existing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ipfs-api-mount

    ipfs-api-mount

    Mount IPFS directory as local FS

    Mount IPFS directory as local FS. go-ipfs daemon has this function but as of version 0.9.1 it's slow. ipfs-api-mount aims to be more efficient. For sequential access to random data it's ~3 times slower than ipfs cat but also ~20 times faster than cating files mounted by go-ipfs. It's supposed that FS mounted by go-ipfs daemon is slow because of file structure being accessed in every read. By adding caching one can improve performance a lot. Apart from mounting one specified CID you can also mount whole IPFS namespace. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TRACARDI - Customer Data Platform

    TRACARDI - Customer Data Platform

    TRACARDI free open-source customer data platform

    TRACARDI is easy to use and free GUI for Apache Unomi. Unomi is an open source Customer Data Platform that allows anyone to collect user profiles and manage them in a very robust way. TRACARDI with is API first approach enables you to collect data from multiple channels. Regardless if it is web site, mobile app or CRM system open Api let you send data for further processing. Integrate data into one consistent user profile.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Objectron

    Objectron

    A dataset of short, object-centric video clips

    The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. In each video, the camera moves around the object, capturing it from different angles. The data also contain manually annotated 3D bounding boxes for each object, which describe the object’s position, orientation, and dimensions. The dataset consists of 15K annotated video clips supplemented with over 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    BerryNet

    BerryNet

    Deep learning gateway on Raspberry Pi and other edge devices

    ...Further, multiple edge devices can create a distributed AIoT network. At DT42, we believe that bringing deep learning to edge devices is the trend towards the future. It not only saves costs of data transmission and storage but also makes devices able to respond according to the events shown in the images or videos without connecting to the cloud. One of the applications of this intelligent gateway is to use the camera to monitor the place you care about. For example, Figure 3 shows the analyzed results from the camera hosted in the DT42 office. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Ansible Config Management Database

    Ansible Config Management Database

    Generate host overview from ansible fact gathering output

    Ansible-cmdb takes the output of Ansible's fact gathering and converts it into a static HTML overview page (and other things) containing system configuration information. It supports multiple types of output (HTML, CSV, SQL, etc) and extends information gathered by Ansible with custom data. For each host, it also shows the groups, host variables, custom variables and machine-local facts. Some templates, such as txt_table and html_fancy, support columns. If a template supports columns, you can use the --columns / -c command-line option to specify which columns to show. The --columns take a comma-separated list of columns (no spaces!) ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    WiFiSafe

    Small utility software conceived to share WiFi connections between com

    ...You have successfully shared the password-protected WiFi connections from the first computer into the second one. Warning: the software will create two folders, "profiles" and "temp", do not remove those folders during use. The encrypted data will be stored within the profiles folder
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    matrislib is a general data processing library, containing : - files reading/writting - data plotting - general process on signals and images. It also contain an "app" package that gives the user a full Software based on matrislib. This software is called "matris" and provide an GUI to make data processing. User guide is in written in French. Licensed under the terms of the CECILL License (see matrislib.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB