Showing 3125 open source projects for "data"

View related business solutions
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 1
    MatchZoo

    MatchZoo

    Facilitating the design, comparison and sharing of deep text models

    ...Generate pair-wise training data on-the-fly, evaluate model performance using customized callbacks on validation data. MatchZoo is dependent on Keras and Tensorflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    YouTube-8M

    YouTube-8M

    Starter code for working with the YouTube-8M dataset

    youtube-8m is Google’s open source starter code and reference implementation for training and evaluating machine learning models on the YouTube-8M dataset, one of the largest video understanding datasets publicly released. The repository provides a complete pipeline for video-level and frame-level modeling using TensorFlow, including data reading, model training, evaluation, and inference. It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    VkCoolLoader

    VkCoolLoader

    Download music from 'VKontakte' without any problems! Windows-only

    ...This simple&free Windows-only app allows you download any track you like. WARNING: this app needs your log-in and password to download tracks. The app`s creator guarantees, that your personal data aren`t stored, transferred to the third-parties, processed or used in any other way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    AutoBench

    This program is a benchmark site data extraction util program

    This program is a program that extracts the latest CPU, GPU, Drive and RAM performance scores and rankings from benchmark sites. The Output Data is saved as a csv, xlsx and xls file. CPU information is written by model name and score. GPU information is written by model name and score. Drive information is written by model name and score. RAM information is written by model name and score.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    yapydata

    yapydata

    Lower-Layer unified data - JSON, XML, YAML + INI, CFG, properties

    The yapydata - Yet Another Python Data - provides a unified interface for the access to various data syntaxes. Therefore it encapsulates the libraries by offering a common API with the canonical internal data as JSON compatible Python in-memory structure. The application is foreseen in particular for the lower layer of the software stack including setup-tools. Thus it uses standard libraries only whenever possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    BotSlayer

    BotSlayer

    BotSlayer Community Edition

    ...The system is easily installed and configured in the cloud to monitor bot activity around a standing user-defined query. All you need is a Twitter developer app key to fetch data from the Twitter streaming API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PyTorch-BigGraph

    PyTorch-BigGraph

    Generate embeddings from large-scale graph-structured data

    PyTorch-BigGraph (PBG) is a system for learning embeddings on massive graphs—think billions of nodes and edges—using partitioning and distributed training to keep memory and compute tractable. It shards entities into partitions and buckets edges so that each training pass only touches a small slice of parameters, which drastically reduces peak RAM and enables horizontal scaling across machines. PBG supports multi-relation graphs (knowledge graphs) with relation-specific scoring functions,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    abu

    abu

    Abu quantitative trading system (stocks, options, futures, bitcoin)

    Abu Quantitative Integrated AI Big Data System, K-Line Pattern System, Classic Indicator System, Trend Analysis System, Time Series Dimension System, Statistical Probability System, and Traditional Moving Average System conduct in-depth quantitative analysis of investment varieties, completely crossing the user's complex code quantification stage, more suitable for ordinary people to use, towards the era of vectorization 2.0.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NCVTK: A VTK-based tool to visualize data stored in the NetCDF file format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 10
    NiftyNet

    NiftyNet

    An open-source convolutional neural networks platform for research

    ...NiftyNet’s modular structure is designed for sharing networks and pre-trained models. Using this modular structure you can get started with established pre-trained networks using built-in tools. Adapt existing networks to your imaging data. Quickly build new solutions to your own image analysis problems. NiftyNet currently supports medical image segmentation and generative adversarial networks. NiftyNet is not intended for clinical use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    setuplib

    setuplib

    Extensions for setuptools - detailed information on entry points

    The *setuplib* package provides core functions for the query of meta information and installation repositories of *Python* packages. It provides query and filter options on the installed packages and the available information, while displaying the result data in various formats, e.g. as table, list, or JSON, XML, YAML, CSV, etc. The provided commands and extension points integrate seamless into the standard *setuptools* and/or *distutils*. The *setuplib* is member of the group *setuplib* of extension components for the common *setup.py* installer. The targeted users are mainly developers, advanced users, and system administrators.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    yapyutils

    yapyutils

    Utilities for platform indepentent low-level system APIs.

    ...These are e.g. used for extensions of the *setuptools* and *distutils*, thus reduce the package dependency and avoid circular dependencies whenever possible by using standard packages and classes only. The more complex and complete data packages are provided for higher application layer functionality. The current release contains: * *yapyutils.modules* A utility to locate and load modules by a given name and/or file system path name, based on the *sys.path* variable. * *yapyutils.files* Search and location of files, e.g. modules and configuration files...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    platformids

    platformids

    OS and Distribution Release Enumeration

    ...This enables the development of portable generic code for arbitrary platforms in IT and IoT landscapes consisting of heterogeneous physical and virtual runtime environments. The introduced hierarchical bitmask vectors enable for fast and efficient platform specific code and data selection for OS and distributions with routines for specific platform releases. The supported version numbering comprise various release schemes such as classical version numbers with variable segments and optional release names, * AlpineLinux-3.8.1 * CentOS-6.10 * Debian-9.6 * Fedora31 * OS-X-10.6.8 * Ubuntu-18.04 * armbian-5.76 * cygwin-2.9.0 * opensuse-15.1 * raspbian-9.4 * slackware-14.2 * solaris-11.3 variations of numbering schemes and continous deployment * CentOS-7.6-1810 * NT-6.3.9600 * archlinux-2018.12.01 * kali-linux-2019.1 * NT-10.0.1809
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    I3D models trained on Kinetics

    I3D models trained on Kinetics

    Convolutional neural network model for video classification

    ...The project provides TensorFlow and Sonnet-based implementations, pretrained checkpoints, and example scripts for evaluating or fine-tuning models. It also offers sample data, including preprocessed video frames and optical flow arrays, to demonstrate how to run inference and visualize outputs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    nonechucks

    nonechucks

    Deal with bad samples in your dataset dynamically

    ...Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you have an AlternateIndexSampler, and you want to be able to move to dataset[6] after dataset[4] fails while attempting to load! PyTorch's data processing module expects you to rid your dataset of any unwanted or invalid samples before you feed them into its pipeline, and provides no easy way to define a "fallback policy" in case such samples are encountered during dataset iteration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WALKOFF

    WALKOFF

    A flexible, easy to use, automation framework

    ...WALKOFF puts the tools in your hands to easily automate the tedious repetitive tasks dragging your operations down. Act smarter with WALKOFF by automatically gathering data, analyzing data, or visualizing data customized to your requirements. Act faster with WALKOFF by integrating the capabilities you already own to dynamically respond on your terms to your fast-moving environment. Drag and drop workflow editor. Sharable apps and workflows. Deploy on Windows or Linux. Python 2.7 and 3.4+. Scale to your needs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    An open source framework for LC-MS based proteomics and metabolomics. OpenMS offers data structures and algorithms for the processing of mass spectrometry data. The library is written in C++. Our source code and wiki lives on GitHub (https://github.com/OpenMS/OpenMS).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    SkpyBackup
    ...https://github.com/SkpyBackup/SkpyBackup Using: https://skpy.t.allofti.me https://pandas.pydata.org https://pypi.org/project/auto-py-to-exe https://visualstudio.microsoft.com/pt-br/vs/community Get this data from your contact list: *id *name *location *language *avatar *mood *phones *birthday *authorised *blocked *favourite
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Nebula docs

    Nebula docs

    Documentation repo of nebula orchestration system

    Nebula is a open source distributed Docker orchestrator designed for massive scales (tens of thousands of servers/worker devices), unlike Mesos/Swarm/Kubernetes it has the ability to have workers distributed on high latency connections (such as the internet) yet have the pods(containers) be managed centrally with changes taking affect (almost) immediately, this makes Nebula ideal for managing a vast cluster of servers\devices across the globe, some example use cases are appliances\virtual appliances located at clients data centers, edge computing, and POS systems. Ever wandered how your going to push an update to that smart fridge your company is working on as it's thousands of devices around the globe? wish you could have the assurance that your service will always use the latest code\envvars\etc in all of it's edge locations? want the ability to stop\start a globally distributed service with a single command? ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Coursebook

    Coursebook

    Introductory Systems Programming Textbook for University of Illinois

    Welcome to the systems programming coursebook! This repository houses a high-quality, open-source introductory systems programming textbook used by the CS 341: System Programming course at the University of Illinois at Urbana-Champaign The book assumes that you have taken a programming language course and are familiar with assembly instructions. All of the code and instruction will be in C, as it is the de-facto language of the Linux Kernel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Pydicom by examples

    Pydicom by examples

    Basic and intermediate examples of DICOM library with Jupyter

    Basic and intermediate examples to read, modify and write DICOM files with Python code using Jupyter - To install Jupyter - https://jupyter.org/install ====== All examples are based on Pydicom. An open source library - https://pydicom.github.io/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    CCZero (中国象棋Zero)

    CCZero (中国象棋Zero)

    Implement AlphaZero/AlphaGo Zero methods on Chinese chess

    ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    InferSent

    InferSent

    InferSent sentence embeddings

    InferSent is a supervised sentence embedding method that learns universal representations from Natural Language Inference data and transfers well to many downstream tasks. It uses a BiLSTM encoder with max-pooling to produce fixed-length sentence vectors that capture semantics beyond bag-of-words statistics. Trained on large NLI datasets, the embeddings generalize across tasks like sentiment analysis, entailment, paraphrase detection, and semantic similarity with simple linear classifiers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MLBox

    MLBox

    MLBox is a powerful Automated Machine Learning python library

    MLBox is a powerful Automated Machine Learning python library. Fast reading and distributed data preprocessing/cleaning/formatting. Highly robust feature selection and leak detection. Accurate hyper-parameter optimization in high-dimensional space. State-of-the-art predictive models for classification and regression (Deep Learning, Stacking, LightGBM,...) Prediction with model interpretation. MLBox has been developed and used by many active community members.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Olex2 is visualisation software for small-molecule crystallography developed at Durham University/EPSRC. It provides comprehensive tools for crystallographic model manipulation for the end user and an extensible development framework for programmers. The project has been supported by Olexsys Ltd since 2010.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB