Showing 629 open source projects for "data"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    SageMaker Containers

    SageMaker Containers

    Create SageMaker-compatible Docker containers

    Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Brand new cheatsheets and handouts

    Brand new cheatsheets and handouts

    Matplotlib 3.1 cheat sheet

    ...It lays out common use cases (plot types, styling, figure configuration, saving/exporting, subplot layout, etc.) in a concise and organized format — often serving as a “cheat sheet” for rapid look-up. For practitioners working on data-heavy projects, dashboards, or research code where plotting is frequent, it helps speed up development by reducing context-switching and documentation navigation overhead. It is especially useful when you know roughly what you want (e.g. “I need a scatter + histogram marginal plot”) but don’t remember the exact Matplotlib call.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Tensor2Tensor

    Tensor2Tensor

    Library of deep learning models and datasets

    Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    ...It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible setup and Makefile-driven workflow streamline tasks like spinning up services, loading datasets, training models, and generating candidate lists. Because it’s built around Spark’s scalable primitives, Albedo can experiment on substantial snapshots of GitHub metadata rather than toy corpora. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 5
    Alfred-Workflow

    Alfred-Workflow

    Full-featured library for writing Alfred 3 & 4 workflows

    Alfred-Workflow is a Python helper library for Alfred 2, 3 and 4 workflow authors, developed and hosted on GitHub. Alfred workflows typically take user input, fetch data from the Web or elsewhere, filter them and display results to the user. Alfred-Workflow takes care of a lot of the details for you, allowing you to concentrate your efforts on your workflow’s functionality. Alfred-Workflow supports macOS 10.7+ (Python 2.7). Easily launch background tasks (daemons) to keep your workflow responsive. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Coronavirus Tracker API

    Coronavirus Tracker API

    API for tracking the global coronavirus (COVID-19, SARS-CoV-2)

    Provides up-to-date data about Coronavirus outbreak. Includes numbers about confirmed cases, deaths and recovered. Support multiple data sources. Specify source parameter with ?source=nyt. NYT also provides a timeseries! To view timelines of cases by US counties use ?source=nyt&timelines=true. Currently, 3 different data sources are available to retrieve the data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    TensorFlow Object Counting API

    TensorFlow Object Counting API

    The TensorFlow Object Counting API is an open source framework

    ...Please contact if you need professional object detection & tracking & counting project with super high accuracy and reliability! You can train TensorFlow models with your own training data to built your own custom object counter system! If you want to learn how to do it, please check one of the sample projects, which cover some of the theory of transfer learning and show how to apply it in useful projects. The development is on progress! The API will be updated soon, the more talented and light-weight API will be available in this repo! ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Pinject

    Pinject

    A pythonic dependency injection library

    ...Because bindings are just Python functions and classes, refactoring remains straightforward and the DI graph is easy to reason about. Pinject is particularly useful for medium-to-large services where configuration, logging, data clients, and business logic need clean separation without resorting to manual plumbing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    An Open Source IEC 61131-3 Integrated Development Environment, providing PLCOpen SoftPLC programming, CanOpen IO's, and SVG based HMI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    gditools

    gditools

    A Python program/library aimed at GD-ROM image files.

    This Python program/library is designed to handle GD-ROM image (GDI) files. It can be used to list files, extract data, generate sorttxt file, extract bootstrap (IP.BIN) file and more. This project can be used in standalone mode, in interactive mode or as a library in another Python program (check the 'addons' folder to learn how). For your convenience, you can use the gditools.py GUI program supplied in the Files section (optional). To use this project you must install the Python 2.7.x branch release binaries. ...
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 11
    jsonfield

    jsonfield

    A reusable Django model field for storing ad-hoc JSON data

    jsonfield is a reusable model field that allows you to store validated JSON, automatically handling serialization to and from the database. To use, add jsonfield.JSONField to one of your models. Note: django.contrib.postgres now supports PostgreSQL's jsonb type, which includes extended querying capabilities. If you're an end user of PostgreSQL and want full-featured JSON support, then it is recommended that you use the built-in JSONField. However, jsonfield is still useful when your app...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Slacker

    Slacker

    Full-featured Python interface for the Slack API

    Slacker is a full-featured Python interface for the Slack API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CrypTen

    CrypTen

    A framework for Privacy Preserving Machine Learning

    ...Although not yet production-ready, CrypTen focuses on advancing real-world secure ML applications, such as training and inference over private datasets, without exposing sensitive data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The TRANSIMS Studio application is an integrated development environment for the TRansportation ANalysis and SIMulation System (TRANSIMS). Components include a run time environment to execute TRANSIMS in parallel, as well as a full featured GUI.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    DEBay

    DEBay

    Deconvolutes qPCR data to estimate cell-type-specific gene expression

    DEBay: Deconvolution of Ensemble through Bayes-approach DEBay estimates cell type-specific gene expression by deconvolution of quantitative PCR data of a mixed population. It will be useful in experiments where the segregation of different cell types in a sample is arduous, but the proportion of different cell types in the sample can be measured. DEBay uses the population distribution data and the qPCR data to calculate the relative expression of the target gene in different cell types in the sample. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Graph Nets library

    Graph Nets library

    Build Graph Nets in Tensorflow

    Graph Nets, developed by Google DeepMind, is a Python library designed for constructing and training graph neural networks (GNNs) using TensorFlow and Sonnet. It provides a high-level, flexible framework for building neural architectures that operate directly on graph-structured data. A graph network takes graphs as inputs, consisting of edges, nodes, and global attributes, and produces updated graphs with modified feature representations at each level. This library implements the foundational ideas from DeepMind’s paper “Relational Inductive Biases, Deep Learning, and Graph Networks”, offering tools to explore relational reasoning and message-passing neural networks. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Python basic brick interface
    This project is a set of usefull classes to manage data from de database to the user interface. Based on a dictionary who match the DB table, the panel will be created.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    RecNN

    RecNN

    Reinforced Recommendation toolkit built around pytorch 1.7

    This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    Python DB-API 2.0 module for ADO

    Use Microsoft ADO to read/write most data bases.

    Python module that makes it easy to use Microsoft ADO for connecting with databases and other data sources. For CPython users, this module is included as part of pywin32 (https://github.com/mhammond/pywin32), and continued support will take place there. This fork will be maintained for Iron Python users. * Documentation at http://adodbapi.sourceforge.net/quick_reference.pdf or quick_reference.odb in the .zip
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    PixieDust

    PixieDust

    Python Helper library for Jupyter Notebooks

    PixieDust is an open source Python helper library that works as an add-on to Jupyter notebooks to improve the user experience of working with data. It also fills a gap for users who have no access to configuration files when a notebook is hosted on the cloud.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Optimized Storage for temporal Data

    open Optimized Storage of time series data

    Beta version. Base class for optimized storage of time series data. Uses any kind of relational database. Cross plateform with multiple languages (C++, C#, Java). Conditional storage based on value variation : DeltaValue and DeltaTime params. Get back data without losts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DocBook Authoring and Publishing Suite

    DocBook Authoring and Publishing Suite

    DocBook Publishing Made Easy

    The DAPS project moved to https://github.com/openSUSE/daps The SUSE XSL Stylesheets have moved to https://github.com/openSUSE/suse-xsl To join the discussion, under https://github.com/openSUSE/daps/discussions
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Twint

    Twint

    An advanced Twitter scraping & OSINT tool written in Python

    Twint is an advanced open-source Twitter scraping and OSINT tool written in Python that extracts tweets, user data, followers, likes, and more—without relying on Twitter’s API—making it highly useful for researchers, analysts, and hobbyists who want to bypass rate limits and access public Twitter data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    yapydata

    yapydata

    Lower-Layer unified data - JSON, XML, YAML + INI, CFG, properties

    The yapydata - Yet Another Python Data - provides a unified interface for the access to various data syntaxes. Therefore it encapsulates the libraries by offering a common API with the canonical internal data as JSON compatible Python in-memory structure. The application is foreseen in particular for the lower layer of the software stack including setup-tools. Thus it uses standard libraries only whenever possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    abu

    abu

    Abu quantitative trading system (stocks, options, futures, bitcoin)

    Abu Quantitative Integrated AI Big Data System, K-Line Pattern System, Classic Indicator System, Trend Analysis System, Time Series Dimension System, Statistical Probability System, and Traditional Moving Average System conduct in-depth quantitative analysis of investment varieties, completely crossing the user's complex code quantification stage, more suitable for ordinary people to use, towards the era of vectorization 2.0.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB