Showing 3125 open source projects for "data"

View related business solutions
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    xTuring

    xTuring

    Easily build, customize and control your own LLMs

    xTuring is an open-source AI personalization software. xTuring makes it easy to build and control LLMs by providing a simple interface to personalize LLMs to your own data and application. xTuring provides fast, efficient and simple fine-tuning of LLMs, such as LLaMA, GPT-J, Galactica, and more. By providing an easy-to-use interface for fine-tuning LLMs to your own data and application, xTuring makes it simple to build, customize and control LLMs. The entire process can be done inside your computer or in your private cloud, ensuring data privacy and security.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Elementary Algorithms

    Elementary Algorithms

    Book of elementary algorithms and data structures

    This book introduces elementary algorithms and data structure. It includes side-by-side comparison of purely functional realization and their imperative counterpart. From 2020/12, I started re-writing this book. The PDF can be downloaded for preview (EN, 中文). The 1st edition in Chinese (中文) was published in 2017. I recently switched my focus to the Mathematics of programming, the new book is also available in (github).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Parveshdhull AutoTyper

    Parveshdhull AutoTyper

    A Data Entry Tool for Windows and Linux

    Sometimes we have to write content in programs where copy-paste is not allowed, like in data entry software Notepad RT. There are many tools available online but almost all of them only provide trial versions. And requires big payment for continued access. And even if they are free, it is not wise to give complete access to a keyboard to any third-party software. So I wrote this simple-short python script that reads content from a text file then simulates keyboard typing.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    nodejsscan

    nodejsscan

    nodejsscan is a static security code scanner for Node.js applications

    Static security code scanner (SAST) for Node.js applications powered by libsast and semgrep. nodejsscan is a static security code scanner for Node.js applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • 5

    ViReMa

    Viral Recombination Mapper

    ...Feb 2014 - Our paper (Open Access) is available at Nucleic Acids Research: "Discovery of functional genomic motifs in viruses with ViReMa–a Virus Recombination Mapper–for analysis of next-generation sequencing data" http://nar.oxfordjournals.org/content/42/2/e11 This is an on-going project and updates will be regularly posted. Please get in touch with any questions, problems or suggestions.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6

    Python Laboratory Operations Toolkit

    many useful snippets for using python in a laboratory

    A toolkit of Python software useful in a laboratory data acquisition and analysis environment. Includes support for such protocols as VXI-11 (and its extension, LXI), Vernier LabPro (now very old), and National Instruments DSTP (now very old). Also includes data analysis and modelling tidbits. Python3 updates are on the way in the very near future for the biggest packages. the vxi11 package is fully up-to-date, although see the blog post about python 3.13
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Originally a reimplementation of OpenGroupware's ZideStore. While compatible with legacy ZideStore Coils provides a sophisticated workflow system with ETL and integration capabilities and superior WebDAV/CalDAV features and compatibility. The workflow engine suppports processes described in BPML and provides integration with a variety of services include SSH, LPD, LDAP, and relation databases.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Lightning-Hydra-Template

    Lightning-Hydra-Template

    PyTorch Lightning + Hydra. A very user-friendly template

    ...A collection of best practices for efficient workflow and reproducibility. Thoroughly commented - you can use this repo as a reference and educational resource. Not fitted for data engineering - the template configuration setup is not designed for building data processing pipelines that depend on each other. PyTorch Lightning, a lightweight PyTorch wrapper for high-performance AI research. Think of it as a framework for organizing your PyTorch code. Hydra, a framework for elegantly configuring complex applications. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NuPIC

    NuPIC

    Numenta platform for intelligent computing

    ...At the core of HTM are time-based continuous learning algorithms that store and recall spatial and temporal patterns. NuPIC is suited to a variety of problems, particularly anomaly detection and prediction of streaming data sources. For more information, see numenta.org or the NuPIC Forum. If you want to build the dependent nupic.bindings from source, you should build and install from nupic.core prior to installing nupic (since a PyPI release will be installed if nupic.bindings isn't yet installed). To install from local source code, run from the repository root. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 10
    Detic

    Detic

    Code release for "Detecting Twenty-thousand Classes

    Detic (“Detecting Twenty-thousand Classes using Image-level Supervision”) is a large-vocabulary object detector that scales beyond fully annotated datasets by leveraging image-level labels. It decouples localization from classification, training a strong box localizer on standard detection data while learning classifiers from weak supervision and large image-tag corpora. A shared region proposal backbone feeds a flexible classification head that can expand to tens of thousands of categories without exhaustive box annotations. The system supports zero- or few-shot extension to novel categories via semantic embeddings and class name supervision, making “open-world” detection practical. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Srilanka-Holidays

    Sri Lankan official holidays open-source API & data

    An open-source API to get Sri Lankan holidays and a list of Sri Lankan holidays in several file formats for easy integration with your applications. Holidays are updated every year and currently contains holidays starting from 2021. - Reliable and accurate local data updated every year - Available as both API and downloadable files in CSV, JSON, XML, ICS - Holiday categorizations (Public, Bank, Mercantile, Poya, etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TradingGym

    TradingGym

    Trading backtesting environment for training reinforcement learning

    TradingGym is a toolkit (in Python) for creating trading and backtesting environments, especially for reinforcement learning agents, but also for simpler rule-based algorithms. It follows a design inspired by OpenAI Gym, offering various environments, data formats (tick data and OHLC), and tools to simulate trading with costs, position limits, observation windows etc. Licensed under MIT. This training environment was originally designed for tickdata, but also supports OHLC data format. WIP. The list contains the feature columns to use in the trading status.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MathTranslate

    MathTranslate

    translate scientific papers in latex, especially arxiv papers

    This is a project to translate LaTeX documents, especially scientific papers, from any language to any language. LaTeX expressions like math expressions are perfectly kept unchanged. LaTeX documents can finally be compiled into PDF files. Especially it can be directly applied to translate arXiv papers since it provides the LaTeX source code of most of the papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AnyTrading

    AnyTrading

    The most simple, flexible, and comprehensive OpenAI Gym trading

    gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LangChain Apps on Production with Jina

    LangChain Apps on Production with Jina

    Langchain Apps on Production with Jina & FastAPI

    ...You can benefit from the scalability and serverless architecture of the cloud without sacrificing the ease and convenience of local development. And if you prefer, you can also deploy your LangChain apps on your own infrastructure to ensure data privacy. With long chain-serve, you can craft REST/WebSocket APIs, spin up LLM-powered conversational Slack bots, or wrap your LangChain apps into FastAPI packages on the cloud or on-premises.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    funNLP

    funNLP

    Resources, corpora, and tools for Chinese natural language processing

    ...The project is highly community-oriented, frequently updated with contributions and new resources, and it’s widely used in both academic and applied NLP research. Its value lies in providing not just tools but also curated, domain-specific data, which can be hard to find elsewhere.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Hermes -- a set of libraries and tools for the support of scientific simulation and data archival.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Odoo ERP CRM System For Local Intranets

    Odoo ERP CRM System For Local Intranets

    Web,Sales,Finance,Inventory,Manufacture,HR,Marketing,Services,Efficacy

    This VM is created for 2 reasons: 1. Very little initial setup work required to make a ERP live, within minutes. 2. This system should keep running for Years, without requiring Updates / Breakages. If you are new to Virtual Machines, then please watch the Video below ( taken from my other project. just replace td with ec wherever mentioned ) After starting this VM, please login to its administration panel with: Website Address: https://ec.local/ ( Accept Any Warnings ) Admin...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    find-similar

    find-similar

    User-friendly library to find similar objects

    The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making. https://github.com/findsimilar/find-similar - GitHub repo http://demo.findsimilar.org/ - Demo project and tutorial https://docs.findsimilar.org/ - Documentation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Consistency Models

    Consistency Models

    Official repo for consistency models

    consistency_models is the repository for Consistency Models, a new family of generative models introduced by OpenAI that aim to generate high-quality samples by mapping noise directly into data — circumventing the need for lengthy diffusion chains. It builds on and extends diffusion model frameworks (e.g. based on the guided-diffusion codebase), adding techniques like consistency distillation and consistency training to enable fast, often one-step, sample generation. The repo is implemented in PyTorch and includes support for large-scale experiments on datasets like ImageNet-64 and LSUN variants. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    showyourwork

    showyourwork

    A workflow for reproducible and open scientific articles

    showyourwork is a workflow management tool for open source scientific articles. If you want to make your research article reproducible, extensible, transparent, or just downright awesome, you’ve found the right tool. showyourwork automates your entire workflow, packaging it into a self-contained recipe that anyone can follow and, at the click of a button, use to reproduce your results.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Shynet

    Shynet

    Modern, privacy-friendly, and detailed web analytics

    ...They require handing all of your visitors' info to a third-party company They use cookies to track visitors across sessions, so you need to have those annoying cookie notices. They collect so much personal data that even the NSA is jealous. They are closed source and/or expensive, often with limited data portability. They are hard to use. Shynet has none of these caveats. You host it yourself, so the data is yours. It works without cookies, so you don't need any intrusive cookie notices. It collects just enough data to be useful, but not enough to be creepy. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Crawlab

    Crawlab

    Distributed web crawler admin platform for spiders management

    ...Tasks are scheduled by the task scheduler module in the master node, and received by the task handler module in worker nodes, which executes these tasks in task runners. Task runners are actually processes running spider or crawler programs, and can also send data through gRPC (integrated in SDK) to other data sources, e.g. MongoDB.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    DeepH-pack

    DeepH-pack

    Deep neural networks for density functional theory Hamiltonian

    DeepH-pack is the official implementation of the DeepH (Deep Hamiltonian) method described in the paper Deep-learning density functional theory Hamiltonian for efficient ab initio electronic-structure calculation and in the Research Briefing. DeepH-pack supports DFT results made by ABACUS, OpenMX, FHI-aims or SIESTA and will support HONPAS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    macara

    macara

    A converter for seamless transformation of files, data, and media ...

    This application consolidates various scripts, including an AI feature (rembg), into a singular platform. The design of this software is evolutionary, allowing for the seamless integration of additional scripts, menus, or windows as needed. Serving as a versatile tool, it facilitates efficient file management, especially when handling a substantial volume of images, whether sorting by name or other attributes. These scripts are crafted to complement generative art AI technologies like Dall-e...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB