Showing 18 open source projects for "data"

View related business solutions
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 1
    Label Studio

    Label Studio

    Label Studio is a multi-type data labeling and annotation tool

    ...Support for multiple data types including images, audio, text, HTML, time-series, and video.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 2
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Computer Vision Annotation Tool (CVAT)

    Computer Vision Annotation Tool (CVAT)

    Interactive video and image annotation tool for computer vision

    ...It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for computer vision tasks. CVAT supports several annotation formats. Format selection can be done after clicking on the Upload annotation and Dump annotation buttons.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 4
    Cleanlab

    Cleanlab

    The standard data-centric AI package for data quality and ML

    cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 5
    DotVVM

    DotVVM

    Open source MVVM framework for Web Apps

    ...The controls render simple HTML which can be styled easily. MVVM pattern and data-binding expressions are used to access the UI components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Toloka-Kit

    Toloka-Kit

    Toloka-Kit is a Python library for working with Toloka API

    ...There’s no need to validate JSON files and work with them directly. Support of both synchronous and asynchronous (via async/await) executions. Streaming support: build complex pipelines which send and receive data in real-time. For example, you can pass data between two related projects: one for data labeling, and another for its validation. AutoQuality feature which automatically finds the best fitting quality control rules for your project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Lightly

    Lightly

    A python library for self-supervised learning on images

    ...We, at Lightly, are passionate engineers who want to make deep learning more efficient. That's why - together with our community - we want to popularize the use of self-supervised methods to understand and curate raw image data. Our solution can be applied before any data annotation step and the learned representations can be used to visualize and analyze datasets. This allows selecting the best core set of samples for model training through advanced filtering. We provide PyTorch, PyTorch Lightning and PyTorch Lightning distributed examples for each of the models to kickstart your project. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    bbox-visualizer

    bbox-visualizer

    Make drawing and labeling bounding boxes easy as cake

    Make drawing and labeling bounding boxes easy as cake. This package helps users draw bounding boxes around objects, without doing the clumsy math that you'd need to do for positioning the labels. It also has a few different types of visualizations you can use for labeling objects after identifying them. There are optional functions that can draw multiple bounding boxes and/or write multiple labels on the same image, but it is advisable to use the above functions in a loop in order to have...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Adala

    Adala

    Adala: Autonomous DAta (Labeling) Agent framework

    Adala is a data-centric AI framework focused on dataset curation, annotation, and validation. It helps AI teams manage high-quality training datasets by providing tools for data auditing, error detection, and quality assessment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 10
    Email to Calendar Event ETE

    Email to Calendar Event ETE

    The python App/Skrypt automaticly add important events into calendar.

    It is use AI running localy and model you can choose. Supproted two API first is as default is Llama, second if full LM Studio api. Skrypt have a tool for automatic add to scheduler or cron-not tested enought. Scrypt now not working with Microsoft outlook and Google gmail, for certifications and api polici reasons . Fuly tested on Seznam.cz* services provider, if you have difrent provier with same type of security or autentification it will be working. *Email is using standart...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Acharya

    Acharya

    A Data Centric annotation tool for your Named Entity Recognition

    ...Import/export datasets in multiple formats, train a model and use it to aid in the annotation process. Setup an MLOps pipeline to experiment with different algorithms on the same data and increase their accuracy and performance in a data-centric way. Gain insights about your training & test data, distribution of annotated entities, and decide how to curate your data for better accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Compose

    Compose

    A machine learning tool for automated prediction engineering

    ...Then subsequent cutoff times are spaced apart using gaps. Starting from each cutoff time, a window determines the amount of data, also referred to as a data slice, to pass into a labeling function.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    ...It congregates information from different MaxQuant or DiaNN/MSFragger output files into a master file suitable explicitly for protein neo-termini analyses. The central anchor for the data congregation is the modificationSpecificPeptides.txt or diann-output.pr_matrix.tsv file - additional data is inferred from different other source files from the corresponding folder. Maybe also useful for normal proteomics purposes but this script is heavily optimized for protein neo-termini identification and validation. A graphical interface is available as Yoğurtlu_MANTI (a Perl/Tk script) + execut. application versions for Win1x without the need to have Perl installed locally. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    MANTI.pl / muda.pl

    MANTI.pl / muda.pl

    muda.pl - MQ unified data assembler

    ...It congregates information from 4 different MaxQuant output files into a master file suitable explicitly for protein neo-termini analyses. The central anchor for the data congregation is the modificationSpecificPeptides.txt file - additional data is inferred from different other source files from the MaxQuant txt folder but the starting point for the data assembly is solely the modificationSpecificPeptides.txt file. Maybe also useful for normal proteomics purposes but this script is heavily optimized for protein neo-termini identification and validation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MetaErg

    MetaErg

    Metagenome Annotation Pipeline

    MetaErg is a stand-alone and fully automated metagenome and metaproteome annotation pipeline published at: https://www.frontiersin.org/articles/10.3389/fgene.2019.00999/full. If you are using this pipeline for your work, please cite: Dong X and Strous M (2019) An Integrated Pipeline for Annotation and Visualization of Metagenomic Contigs. Front. Genet. 10:999. doi: 10.3389/fgene.2019.00999 The instructions on configuring and running the MetaErg pipeline is available at GitHub...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    BioRec:Bird Census field data annotation

    Recognizing biological data from a notebook.

    This project helps to digitize field data for a certain Bird Census method. Namely, bird census based on personal inspection or small (~10 km^2) regions with recording birds' position and behaviour on paper. This project makes it easy to annotate such field data and to make this data available for statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Speechalyzer

    Speechalyzer

    Process large speech data wrt transcription, labeling and annotation

    Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    RNA-Seq Data Annotation Pipeline
    We developed a RNA-Seq Data Annotation Pipeline named RNADAP, which measure genes expression in isoform level, work with high speed and less memory usage. Besides, our pipeline can be compatible with results from different mapping software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.
Try Free →