Showing 629 open source projects for "data"

View related business solutions
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Grow.dev

    Grow.dev

    A declarative website generator designed for high-quality websites

    Grow.dev is a static site generator optimized for building highly interactive, localized microsites. Grow.dev focuses on providing optimal workflows and developer ergonomics for creating projects that are highly maintainable in the long term. Grow.dev encourages a strong but simple separation of content and presentation and makes maintaining content in different locales and environments a snap.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Fairseq

    Fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python

    Fairseq(-py) is a sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling and other text generation tasks. We provide reference implementations of various sequence modeling papers. Recent work by Microsoft and Google has shown that data parallel training can be made significantly more efficient by sharding the model parameters and optimizer state across data parallel workers. These ideas are encapsulated in the new FullyShardedDataParallel (FSDP) wrapper provided by fairscale. Fairseq can be extended through user-supplied plug-ins. Models define the neural network architecture and encapsulate all of the learnable parameters. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Whisper Library

    Whisper Library

    Whisper is a file-based time-series database format for Graphite

    Whisper is one of three components within the Graphite project. Whisper is a fixed-size database, similar in design and purpose to RRD (round-robin-database). It provides fast, reliable storage of numeric data over time. Whisper allows for higher resolution (seconds per point) of recent data to degrade into lower resolutions for long-term retention of historical data. Copies data from src in dst, if missing. Unlike whisper-merge, don't overwrite data that's already present in the target file, but instead, only add the missing data (e.g. where the gaps in the target file are). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Blankly

    Blankly

    Easily build, backtest and deploy your algo in just a few lines

    ​Blankly is a live trading engine, backtest runner and development framework wrapped into one powerful open-source package. Models can be instantly backtested, paper traded, sandbox tested and run live by simply changing a single line. We built blankly for every type of quant including training & running ML models in the same environment, cross-exchange/cross-symbol arbitrage, and even long/short positions on stocks (all with built-in WebSockets). Blankly is the first framework to enable...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • 5
    m2cgen

    m2cgen

    Transform ML models into a native code

    ...Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies. Some models force input data to be particular type during prediction phase in their native Python libraries. Currently, m2cgen works only with float64 (double) data type. You can try to cast your input data to another type manually and check results again. Also, some small differences can happen due to specific implementation of floating-point arithmetic in a target language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 1,361 This Week
    Last Update:
    See Project
  • 7
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    ...AugLy is a great library to utilize for augmenting your data in model training, or to evaluate the robustness gaps of your model! We designed AugLy to include many specific data augmentations that users perform in real life on internet platforms like Facebook's -- for example making an image into a meme, overlaying text/emojis on images/videos, reposting a screenshot from social media. While AugLy contains more generic data augmentations as well, it will be particularly useful to you if you're working on a problem like copy detection, hate speech detection, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Binarytree

    Binarytree

    Python library for studying Binary Trees

    Binarytree is Python library that lets you generate, visualize, inspect and manipulate binary trees. Skip the tedious work of setting up test data, and dive straight into practicing algorithms. Heaps and BSTs (binary search trees) are also supported. Binarytree supports another representation which is more compact but without the indexing properties. Traverse trees using different algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AWS Step Functions Data Science SDK

    AWS Step Functions Data Science SDK

    For building machine learning (ML) workflows and pipelines on AWS

    The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 10
    q - Text as Data

    q - Text as Data

    Run SQL directly on CSV or TSV files

    q is a command line tool that allows direct execution of SQL-like queries on CSVs/TSVs (and any other tabular text files). q treats ordinary files as database tables, and supports all SQL constructs, such as WHERE, GROUP BY, JOINs etc. It supports automatic column name and column type detection, and provides full support for multiple encodings. q fully supports all types of encoding. Use -e data-encoding to set the input data encoding, -Q query-encoding to set the query encoding, and use -E output-encoding to set the output encoding. Sensible defaults are in place for all three parameters. Files with BOM: Files which contain a BOM (Byte Order Mark) are not properly supported inside python's csv module. q contains a workaround that allows reading UTF8 files which contain a BOM - Use -e utf-8-sig for this. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Smart Contract Sanctuary

    Smart Contract Sanctuary

    A home for ethereum smart contracts

    ...Contains smart contract sources for various networks, grouped by the first two chars of the contract address. A scriptable semantic grep utility for solidity (crunch numbers, find specific contracts, extract data) Semgrep is a fast, open-source, static analysis tool for finding bugs and enforcing code standards at editor, commit, and CI time, and now supports Solidity! A powerful online code search service that can be used to search the sanctuary without cloning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Interpret-Text

    Interpret-Text

    State-of-the-art explainers for text-based machine learning models

    A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard. Interpret-Text builds on Interpret, an open source python package for training interpretable models and helping to explain blackbox machine learning systems. We have added extensions to support text models. Interpret-Text incorporates community-developed interpretability techniques for NLP models and a visualization dashboard to view the results....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    StreamAlert

    StreamAlert

    StreamAlert is a serverless, realtime data analysis framework

    StreamAlert is a serverless, real-time data analysis framework that empowers you to ingest, analyze, and alert on data from any environment, using data sources and alerting logic you define. Computer security teams use StreamAlert to scan terabytes of log data every day for incident detection and response. Incoming log data will be classified and processed by the rules engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    ...If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. Once you have finished trying out the system you can edit the volume in the docker-compose.yml file where it says ./data/photos to mount wherever you usually keep photos. System database, thumbnails and other cache data is stored separately from the photos so shouldn't pollute the area. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PersonGen

    PersonGen

    A minor Project in Python which uses the RandomUser API .

    A Small Program in Python That Makes Use of RandomUser API To Generate Random Person Data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TensorFlow Examples

    TensorFlow Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

    ...For clarity and educational value, each example is accompanied by explanatory comments or markdown cells to illustrate what the code does and why — a design that makes it especially suitable for self-learners or students following along with real data. Besides raw implementations, the repo often shows best practices using higher-level constructs (e.g. dataset pipelines, estimators, layers) which reflect modern TensorFlow workflows rather than only textbook-style code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    itamm

    itamm

    Tool to design and share enterprise solutions, services and processes

    The tool is for people who design, analyze, optimize and develop processes, services and solution architectures. IT(A)-MM is a tool to design models of solutions, services and enterprise processes. It allows you to visualize data using popular BPMN and ArchiMate visualization notation. It also has its own extensible notation for visualizing enterprise environment objects. IT(A)-MM is easy to use and allows you to use it wherever you are. Using IT(A)-MM can be the first step towards deploy of the AIAS "A-STACK" to the enterprise environment. AIAS "A-STACK" is the complex tool for enterprise architecture management, IT infrastructure monitoring, meta-data management. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ReconSpider

    ReconSpider

    Most Advanced Open Source Intelligence (OSINT) Framework

    ...ReconSpider can be used by Infosec Researchers, Penetration Testers, Bug Hunters, and Cyber Crime Investigators to find deep information about their target. ReconSpider aggregate all the raw data, visualize it on a dashboard, and facilitate alerting and monitoring on the data. Recon Spider also combines the capabilities of Wave, Photon and Recon Dog to do a comprehensive enumeration of attack surfaces. Reconnaissance is a mission to obtain information by various detection methods, about the activities and resources of an enemy or potential enemy, or geographic characteristics of a particular area. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    ...The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. It supports video I/O pipelines, data augmentation, distributed training, and mixed precision computation for large-scale experiments. PyTorchVideo also connects seamlessly with other Meta AI tools such as Detectron2 and PyTorch3D for multimodal video analysis. Designed to accelerate research and deployment, it serves as a unified framework for reproducible, high-performance video AI development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    restless

    restless

    A lightweight REST miniframework for Python

    ...Works great with Django, Flask, Pyramid & Tornado, but should be useful for many other Python web frameworks. Based on the lessons learned from Tastypie & other REST libraries. Restless tries to be RESTful by default, but flexible enough. The main Resource class has data methods (that you implement) for all the main RESTful actions. It also uses HTTP status codes as correctly as possible. Restless is BYOD (bring your own data) and hence, works with almost any ORM/data source. If you can import a module to work with the data & can represent it as JSON, Restless can work with it. Restless is small & easy to keep in your head. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ReinventCommunity

    ReinventCommunity

    Jupyter Notebook tutorials for REINVENT 3.2

    This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ML workspace

    ML workspace

    All-in-one web-based IDE specialized for machine learning

    All-in-one web-based development environment for machine learning. The ML workspace is an all-in-one web-based IDE specialized for machine learning and data science. It is simple to deploy and gets you started within minutes to productively built ML solutions on your own machines. This workspace is the ultimate tool for developers preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch, Keras, Sklearn) and dev tools (e.g., Jupyter, VS Code, Tensorboard) perfectly configured, optimized, and integrated. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Arraymancer

    Arraymancer

    A fast, ergonomic and portable tensor library in Nim

    Arraymancer is a tensor and deep learning library for the Nim programming language, designed for high-performance numerical computations and machine learning applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TransPose

    TransPose

    PyTorch Implementation for "TransPose, Keypoint localization

    TransPose is a human pose estimation model based on a CNN feature extractor, a Transformer Encoder, and a prediction head. Given an image, the attention layers built in Transformer can efficiently capture long-range spatial relationships between keypoints and explain what dependencies the predicted keypoints locations highly rely on.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Pytorch Points 3D

    Pytorch Points 3D

    Pytorch framework for doing deep learning on point clouds

    ...Core implementation of common components for point cloud deep learning - greatly simplifying the creation of new models. 4 Base Convolution base classes to simplify the implementation of new convolutions. Each base class supports a different data format.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB