Showing 442 open source projects for "model-builder"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    smclarify

    smclarify

    Fairness aware machine learning. Bias detection and mitigation

    ...A facet can have value(s) that designates that sample as "sensitive". Bias detection and mitigation for datasets and models. The label is a column or feature which is the target for training a machine learning model. The label can have value(s) that designates that sample as having a "positive" outcome. A bias measure is a function that returns a bias metric. A bias metric is a numerical value indicating the level of bias detected as determined by a particular bias measure. A collection of bias metrics for a given dataset or a combination of a dataset and model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    FFCV

    FFCV

    Fast Forward Computer Vision (and other ML workloads!)

    ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    OGB

    OGB

    Benchmark datasets, data loaders, and evaluators for graph machine

    The Open Graph Benchmark (OGB) is a collection of realistic, large-scale, and diverse benchmark datasets for machine learning on graphs. OGB datasets are automatically downloaded, processed, and split using the OGB Data Loader. The model performance can be evaluated using the OGB Evaluator in a unified manner. OGB is a community-driven initiative in active development. We expect the benchmark datasets to evolve. OGB provides a diverse set of challenging and realistic benchmark datasets that are of varying sizes and cover a variety graph machine learning tasks, including prediction of node, link, and graph properties. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    LightFM

    LightFM

    A Python implementation of LightFM, a hybrid recommendation algorithm

    LightFM is a Python implementation of a number of popular recommendation algorithms for both implicit and explicit feedback, including efficient implementation of BPR and WARP ranking losses. It's easy to use, fast (via multithreaded model estimation), and produces high-quality results. It also makes it possible to incorporate both item and user metadata into the traditional matrix factorization algorithms. It represents each user and item as the sum of the latent representations of their features, thus allowing recommendations to generalize to new items (via item features) and to new users (via user features).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    Keras Attention Mechanism

    Keras Attention Mechanism

    Attention mechanism Implementation for Keras

    ...The attention is expected to be the highest after the delimiters. An overview of the training is shown below, where the top represents the attention map and the bottom the ground truth. As the training progresses, the model learns the task and the attention map converges to the ground truth. We consider many 1D sequences of the same length. The task is to find the maximum of each sequence. We give the full sequence processed by the RNN layer to the attention layer. We expect the attention layer to focus on the maximum of each sequence.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Merlion

    Merlion

    A Machine Learning Framework for Time Series Intelligence

    Merlion is a Python library for time series intelligence. It provides an end-to-end machine learning framework that includes loading and transforming data, building and training models, post-processing model outputs, and evaluating model performance. It supports various time series learning tasks, including forecasting, anomaly detection, and change point detection for both univariate and multivariate time series. This library aims to provide engineers and researchers a one-stop solution to rapidly develop models for their specific time series needs, and benchmark them across multiple time series datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Sockeye

    Sockeye

    Sequence-to-sequence framework, focused on Neural Machine Translation

    ...It implements distributed training and optimized inference for state-of-the-art models, powering Amazon Translate and other MT applications. For a quickstart guide to training a standard NMT model on any size of data, see the WMT 2014 English-German tutorial. If you are interested in collaborating or have any questions, please submit a pull request or issue. You can also send questions to sockeye-dev-at-amazon-dot-com. Developers may be interested in our developer guidelines. Starting with version 3.0.0, Sockeye is also based on PyTorch. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    BERTScore

    BERTScore

    BERT score for text generation

    Automatic Evaluation Metric described in the paper BERTScore: Evaluating Text Generation with BERT (ICLR 2020). We now support about 130 models (see this spreadsheet for their correlations with human evaluation). Currently, the best model is Microsoft/debate-large-online, please consider using it instead of the default roberta-large in order to have the best correlation with human evaluation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Knet

    Knet

    Koç University deep learning framework

    Knet.jl is a deep learning package implemented in Julia, so you should be able to run it on any machine that can run Julia. It has been extensively tested on Linux machines with NVIDIA GPUs and CUDA libraries, and it has been reported to work on OSX and Windows. If you would like to try it on your own computer, please follow the instructions on Installation. If you would like to try working with a GPU and do not have access to one, take a look at Using Amazon AWS or Using Microsoft Azure. If...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    ML Visuals

    ML Visuals

    ML Visuals contains figures and templates which you can reuse

    ML Visuals is an open-source project that provides a collection of reusable diagrams, templates, and visual resources designed to improve communication in machine learning research and education. The repository contains professional-quality figures that illustrate machine learning concepts such as neural networks, optimization methods, model architectures, and common deep learning techniques. These visuals are intended to help researchers, educators, and students create clearer presentations, blog posts, and scientific papers. The project is maintained as a collaborative community effort where contributors can add new diagrams or visual components. Many of the visuals are designed using editable formats such as Google Slides, making it easy for users to customize them for their own work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CodeContests

    CodeContests

    Large dataset of coding contests designed for AI and ML model training

    CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder, Codeforces, CodeChef, Aizu, and HackerEarth. Each problem includes structured metadata, problem descriptions, paired input/output test cases, and multiple correct and incorrect solutions in various programming languages. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Mars Framework

    Mars Framework

    Mars is a tensor-based unified framework for large-scale data

    Mars is a distributed computing framework designed to scale scientific computing and data science workloads across large clusters while preserving the familiar programming interfaces of common Python libraries. The project provides a tensor-based execution model that extends the capabilities of tools such as NumPy, pandas, and scikit-learn so that large datasets can be processed in parallel without rewriting code for distributed environments. Its architecture automatically divides large computational tasks into smaller chunks that can be executed across multiple nodes in a cluster, allowing complex analytics, machine learning workflows, and data transformations to run efficiently at scale. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    2020 Machine Learning Roadmap

    2020 Machine Learning Roadmap

    A roadmap connecting many of the most important concepts

    ...The repository organizes machine learning knowledge into a structured roadmap that helps learners understand how different concepts connect within the field. It outlines the typical workflow of solving machine learning problems, starting from problem formulation and data preparation to model training and evaluation. The roadmap also highlights the major technologies and frameworks commonly used in machine learning development. In addition to describing technical tools, the project includes recommended learning resources that help users study the underlying mathematics and algorithms behind machine learning systems. The roadmap is often used as a high-level orientation tool for beginners who want to understand the broader landscape of machine learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DialoGPT

    DialoGPT

    Large-scale pretraining for dialogue

    DialoGPT is an open-source conversational language model developed by Microsoft Research for generating natural dialogue responses using large-scale transformer architectures. The system is built on the GPT-2 architecture and is designed specifically for multi-turn conversation tasks, enabling machines to produce coherent responses during interactive dialogue. The model was trained on a massive dataset of approximately 147 million conversational exchanges extracted from Reddit discussion threads, allowing it to learn patterns of natural human conversation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    learn-machine-learning-in-two-months

    learn-machine-learning-in-two-months

    Essential Knowledge for learning Machine Learning in two months

    ...The project compiles curated resources, tutorials, and practical notebooks that introduce fundamental topics such as mathematics for machine learning, Python programming, and essential libraries like NumPy and TensorFlow. It progressively moves from foundational theory to more advanced subjects including regression, classification, neural networks, and model deployment. The repository emphasizes understanding the underlying principles of machine learning while also providing practical exercises and examples that allow learners to build and experiment with real models. Many sections include notebooks and code examples that demonstrate how algorithms are implemented and trained using modern machine learning frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LSTMs for Human Activity Recognition

    LSTMs for Human Activity Recognition

    Human Activity Recognition example using TensorFlow on smartphone

    LSTM-Human-Activity-Recognition is a machine learning project that demonstrates how recurrent neural networks can be used to recognize human activities from sensor data. The repository implements a deep learning model based on Long Short-Term Memory (LSTM) networks to classify physical activities using time-series data collected from wearable sensors. The project uses the well-known Human Activity Recognition dataset derived from smartphone accelerometer and gyroscope signals. Through the use of sequential neural network architectures, the system learns patterns in motion data that correspond to activities such as walking, sitting, standing, or climbing stairs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GFPGAN

    GFPGAN

    GFPGAN aims at developing Practical Algorithms

    ...It leverages rich and diverse priors encapsulated in a pretrained face GAN (e.g., StyleGAN2) for blind face restoration. Add V1.3 model, which produces more natural restoration results, and better results on very low-quality / high-quality inputs.
    Downloads: 106 This Week
    Last Update:
    See Project
  • 18
    Machine Learning Git Codebook

    Machine Learning Git Codebook

    For extensive instructor led learning

    ...The project is designed as a self-paced learning resource that walks learners through the full data science workflow, including data preprocessing, exploratory analysis, feature engineering, and model development. It covers a wide range of machine learning techniques such as decision trees, clustering methods, nearest neighbor algorithms, anomaly detection, and probabilistic classifiers. The repository organizes these topics into sequential notebooks that explain theoretical concepts while allowing users to experiment directly with code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    KoboldAI

    KoboldAI

    Your gateway to GPT writing

    ...Stories can be played like a Novel, a text adventure game or used as a chatbot with an easy toggles to change between the multiple gameplay styles. This makes KoboldAI both a writing assistant, a game and a platform for so much more. The way you play and how good the AI will be depends on the model or service you decide to use. No matter if you want to use the free, fast power of Google Colab, your own high end graphics card, an online service you have an API key for (Like OpenAI or Inferkit) or if you rather just run it slower on your CPU you will be able to find a way to use KoboldAI that works for you.
    Leader badge
    Downloads: 112 This Week
    Last Update:
    See Project
  • 20
    Elephas

    Elephas

    Distributed Deep learning with Keras & Spark

    ...Elephas implements a class of data-parallel algorithms on top of Keras, using Spark's RDDs and data frames. Keras Models are initialized on the driver, then serialized and shipped to workers, alongside with data and broadcasted model parameters. Spark workers deserialize the model, train their chunk of data and send their gradients back to the driver. The "master" model on the driver is updated by an optimizer, which takes gradients either synchronously or asynchronously. Hyper-parameter optimization with elephas is based on hyperas, a convenience wrapper for hyperopt and keras.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    cortex

    cortex

    Production infrastructure for machine learning at scale

    Cortex is an open-source platform designed for building, deploying, and managing machine learning applications in production environments. The framework provides infrastructure tools that allow developers to transform trained machine learning models into scalable web services. Cortex handles many operational challenges associated with deploying AI systems, such as managing dependencies, orchestrating data pipelines, and scaling services under load. Developers can define machine learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Coqui STT

    Coqui STT

    The deep learning toolkit for speech-to-text

    Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Apache TVM

    Apache TVM

    TVM Documentation in Chinese Simplified

    tvm-cn is a community-driven project that provides Chinese documentation for the Apache TVM deep learning compiler stack. Apache TVM is an open-source system designed to optimize and deploy machine learning models efficiently across different hardware platforms such as CPUs, GPUs, and ARM devices. The goal of the repository is to centralize translated learning materials and technical documentation so that Chinese-speaking developers can study the TVM ecosystem more easily. The project...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Yellowbrick

    Yellowbrick

    Visual analysis and diagnostic tools to facilitate ML selection

    Yellowbrick extends the Scikit-Learn API to make model selection and hyperparameter tuning easier. Under the hood, it’s using Matplotlib. Yellowbrick is a suite of visual diagnostic tools called "Visualizers" that extend the scikit-learn API to allow human steering of the model selection process. In a nutshell, Yellowbrick combines scikit-learn with matplotlib in the best tradition of the scikit-learn documentation, but to produce visualizations for your machine learning workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Python ML Jupyter Notebooks

    Python ML Jupyter Notebooks

    Practice and tutorial-style notebooks

    Python ML Jupyter Notebooks is an educational repository that demonstrates how to implement machine learning algorithms and data science workflows using Python. The project provides numerous examples and tutorials covering classical machine learning techniques such as regression, classification, clustering, and dimensionality reduction. It includes code implementations that show how to build models using popular libraries like scikit-learn, NumPy, pandas, and Matplotlib. The repository is...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo