Data Labeling Tools for Mac

View 17 business solutions

Browse free open source Data Labeling tools and projects for Mac below. Use the toggles on the left to filter open source Data Labeling tools by OS, license, language, programming language, and project status.

  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    Computer Vision Annotation Tool (CVAT)

    Computer Vision Annotation Tool (CVAT)

    Interactive video and image annotation tool for computer vision

    Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for computer vision tasks. CVAT supports several annotation formats. Format selection can be done after clicking on the Upload annotation and Dump annotation buttons.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 2
    Label Studio

    Label Studio

    Label Studio is a multi-type data labeling and annotation tool

    The most flexible data annotation tool. Quickly installable. Build custom UIs or use pre-built labeling templates. Detect objects on image, bboxes, polygons, circular, and keypoints supported. Partition image into multiple segments. Use ML models to pre-label and optimize the process. Label Studio is an open-source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models. The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. Multi-user labeling sign up and login, when you create an annotation it's tied to your account. Configurable label formats let you customize the visual interface to meet your specific labeling needs. Support for multiple data types including images, audio, text, HTML, time-series, and video.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 3
    Cleanlab

    Cleanlab

    The standard data-centric AI package for data quality and ML

    cleanlab helps you clean data and labels by automatically detecting issues in a ML dataset. To facilitate machine learning with messy, real-world data, this data-centric AI package uses your existing models to estimate dataset problems that can be fixed to train even better models. cleanlab cleans your data's labels via state-of-the-art confident learning algorithms, published in this paper and blog. See some of the datasets cleaned with cleanlab at labelerrors.com. This package helps you find label issues and other data issues, so you can train reliable ML models. All features of cleanlab work with any dataset and any model. Yes, any model: PyTorch, Tensorflow, Keras, JAX, HuggingFace, OpenAI, XGBoost, scikit-learn, etc. If you use a sklearn-compatible classifier, all cleanlab methods work out-of-the-box.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Diffgram

    Diffgram

    Training data (data labeling, annotation, workflow) for all data types

    From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.
    Downloads: 5 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Compose

    Compose

    A machine learning tool for automated prediction engineering

    Compose is a machine learning tool for automated prediction engineering. It allows you to structure prediction problems and generate labels for supervised learning. An end user defines an outcome of interest by writing a labeling function, then runs a search to automatically extract training examples from historical data. Its result is then provided to Featuretools for automated feature engineering and subsequently to EvalML for automated machine learning. Prediction problems are structured by using a label maker and a labeling function. The label maker automatically extracts data along the time index to generate labels. The process starts by setting the first cutoff time after the minimum amount of data. Then subsequent cutoff times are spaced apart using gaps. Starting from each cutoff time, a window determines the amount of data, also referred to as a data slice, to pass into a labeling function.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Toloka-Kit

    Toloka-Kit

    Toloka-Kit is a Python library for working with Toloka API

    Toloka-Kit is a Python library for working with Toloka API. The API allows you to build scalable and fully automated human-in-the-loop ML pipelines, and integrate them into your processes. The toolkit makes integration easier. You can use it with Jupyter Notebooks. Support for all common Toloka use cases: creating projects, adding pools, uploading tasks, and so on. Toloka entities are represented as Python classes. You can use them instead of accessing the API using JSON representations. There’s no need to validate JSON files and work with them directly. Support of both synchronous and asynchronous (via async/await) executions. Streaming support: build complex pipelines which send and receive data in real-time. For example, you can pass data between two related projects: one for data labeling, and another for its validation. AutoQuality feature which automatically finds the best fitting quality control rules for your project.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    bbox-visualizer

    bbox-visualizer

    Make drawing and labeling bounding boxes easy as cake

    Make drawing and labeling bounding boxes easy as cake. This package helps users draw bounding boxes around objects, without doing the clumsy math that you'd need to do for positioning the labels. It also has a few different types of visualizations you can use for labeling objects after identifying them. There are optional functions that can draw multiple bounding boxes and/or write multiple labels on the same image, but it is advisable to use the above functions in a loop in order to have full control over your visualizations.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Lightly

    Lightly

    A python library for self-supervised learning on images

    A python library for self-supervised learning on images. We, at Lightly, are passionate engineers who want to make deep learning more efficient. That's why - together with our community - we want to popularize the use of self-supervised methods to understand and curate raw image data. Our solution can be applied before any data annotation step and the learned representations can be used to visualize and analyze datasets. This allows selecting the best core set of samples for model training through advanced filtering. We provide PyTorch, PyTorch Lightning and PyTorch Lightning distributed examples for each of the models to kickstart your project. Lightly requires Python 3.6+ but we recommend using Python 3.7+. We recommend installing Lightly in a Linux or OSX environment. With lightly, you can use the latest self-supervised learning methods in a modular way using the full power of PyTorch. Experiment with different backbones, models, and loss functions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    DotVVM

    DotVVM

    Open source MVVM framework for Web Apps

    DotVVM is an open-source framework for ASP.NET. It lets you create web apps using the MVVM pattern, with just C# and HTML. DotVVM can be used to build new ASP.NET Core web apps, or to modernize legacy ASP.NET apps and migrate them to .NET 5. Save your time with GridView, FileUpload and other components shipped with the framework. Don't spend the time building an API. Just load data from the database and use data-binding to display them. DotVVM needs less than 100 kB of JavaScript code. It's smaller than other ASP.NET-based frameworks. DotVVM offers a free Visual Studio extension giving you all the comfort you are used to. DotVVM comes with ready-made components you can use in your HTML files. The state and user interactions are handled in view models - C# classes. The controls render simple HTML which can be styled easily. MVVM pattern and data-binding expressions are used to access the UI components.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    Acharya

    Acharya

    A Data Centric annotation tool for your Named Entity Recognition

    A data-centric annotation tool to increase the accuracy of your Named Entity Recognition projects which helps rapidly identify and fix labeling errors in your dataset. Import/export datasets in multiple formats, train a model and use it to aid in the annotation process. Setup an MLOps pipeline to experiment with different algorithms on the same data and increase their accuracy and performance in a data-centric way. Installation and Setup for Acharya are not required, Acharya runs the initial setup when run for the first time. Rapidly identify and fix labeling errors in your dataset. Import/export datasets in multiple formats, train a model and use it to aid in the annotation process. Setup an MLOps pipeline to experiment with different algorithms on the same data and increase their accuracy and performance in a data-centric way. Gain insights about your training & test data, distribution of annotated entities, and decide how to curate your data for better accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Adala

    Adala

    Adala: Autonomous DAta (Labeling) Agent framework

    Adala is a data-centric AI framework focused on dataset curation, annotation, and validation. It helps AI teams manage high-quality training datasets by providing tools for data auditing, error detection, and quality assessment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    BioRec:Bird Census field data annotation

    Recognizing biological data from a notebook.

    This project helps to digitize field data for a certain Bird Census method. Namely, bird census based on personal inspection or small (~10 km^2) regions with recording birds' position and behaviour on paper. This project makes it easy to annotate such field data and to make this data available for statistical analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    MANTI is a one-stop shop N-termini annotation & evaluation solution. MANTI was previously (un)known as muda.pl ahead of v3.7, the project was renamed to MANTI.pl with v3.7 on 2019-06-24. It congregates information from different MaxQuant or DiaNN/MSFragger output files into a master file suitable explicitly for protein neo-termini analyses. The central anchor for the data congregation is the modificationSpecificPeptides.txt or diann-output.pr_matrix.tsv file - additional data is inferred from different other source files from the corresponding folder. Maybe also useful for normal proteomics purposes but this script is heavily optimized for protein neo-termini identification and validation. A graphical interface is available as Yoğurtlu_MANTI (a Perl/Tk script) + execut. application versions for Win1x without the need to have Perl installed locally. For a very detailed explanation of script parameters and the evaluation strategy, please consult the extensive manual PDF
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MANTI.pl / muda.pl

    MANTI.pl / muda.pl

    muda.pl - MQ unified data assembler

    -------- ATTENTION START: RENAMING muda.pl was renamed to MANTI.pl with v3.7, project development can be tracked on the MANTI project page on sourceforge.net. Old versions remain here for archival purposes. -------- ATTENTION END muda.pl is an evaluation script (written in Perl) without great dependencies. It congregates information from 4 different MaxQuant output files into a master file suitable explicitly for protein neo-termini analyses. The central anchor for the data congregation is the modificationSpecificPeptides.txt file - additional data is inferred from different other source files from the MaxQuant txt folder but the starting point for the data assembly is solely the modificationSpecificPeptides.txt file. Maybe also useful for normal proteomics purposes but this script is heavily optimized for protein neo-termini identification and validation. For a more thorough explanation of script parameters and evaluation strategy, please consult the extensive manual PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RNA-Seq Data Annotation Pipeline
    We developed a RNA-Seq Data Annotation Pipeline named RNADAP, which measure genes expression in isoform level, work with high speed and less memory usage. Besides, our pipeline can be compatible with results from different mapping software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Speechalyzer

    Speechalyzer

    Process large speech data wrt transcription, labeling and annotation

    Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo