Showing 580 open source projects for "ml"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    X For You Feed Algorithm

    X For You Feed Algorithm

    Algorithm powering the For You feed on X

    X For You Feed Algorithm is the open-sourced core recommendation system that powers the For You feed on X (the social network formerly known as Twitter), and it represents one of the first times a major social platform has published production-level ranking code for public review and experimentation. The repository contains the full pipeline that ingests user engagement and content candidate data, processes it through retrieval, hydration, filtering, scoring, and selection layers, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    RestorePhotos.io

    RestorePhotos.io

    Restoring old and blurry face photos with AI

    ...It wraps the GFPGAN model (served via Replicate) behind a friendly Next.js front end, so non-technical users can upload an image and receive an enhanced version without ever touching ML code. The workflow is straightforward: you upload a photo, the serverless API route sends it to Replicate, and the restored image is returned and displayed in the UI. The project is production-oriented, not just a toy: it uses Bytescale for storage and image processing, Vercel for hosting and serverless functions, Auth.js + Neon for authentication and database, and Upstash Redis for rate limiting. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    SuperDuperDB

    SuperDuperDB

    Integrate, train and manage any AI models and APIs with your database

    Build and manage AI applications easily without needing to move your data to complex pipelines and specialized vector databases. Integrate AI and vector search directly with your database including real-time inference and model training. Just using Python. A single scalable deployment of all your AI models and APIs which is automatically kept up-to-date as new data is processed immediately. No need to introduce an additional database and duplicate your data to use vector search and build on...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Deequ

    Deequ

    Deequ is a library built on top of Apache Spark

    ...It also includes a little domain-specific language called DQDL (Data Quality Definition Language) which allows declarative specification of quality rules. Users typically run Deequ before feeding data downstream (to ML pipelines, analytics, or production systems), enabling early detection and isolation of data errors. There is also a Python wrapper, PyDeequ, for users who prefer working from Python environments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    The Hundred-Page Machine Learning Book

    The Hundred-Page Machine Learning Book

    The Python code to reproduce illustrations from Machine Learning Book

    The Hundred-Page Machine Learning Book is the official companion repository for The Hundred-Page Machine Learning Book written by machine learning researcher Andriy Burkov. The repository contains Python code used to generate the figures, visualizations, and illustrative examples presented in the book. Its purpose is to help readers better understand the concepts explained in the text by allowing them to run and experiment with the underlying code themselves. The book itself provides a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MediaPipe Solutions

    MediaPipe Solutions

    Cross-platform, customizable ML solutions

    MediaPipe is an open-source framework developed by Google for building cross-platform machine learning pipelines that process audio, video, and other streaming data in real time. The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Chinese-XLNet

    Chinese-XLNet

    Chinese XLNet pre-trained model

    Chinese-XLNet is a Chinese language pre-trained model based on the XLNet architecture, providing an advanced foundation for natural language processing tasks in Mandarin and other Chinese dialects. Unlike traditional masked language modeling, XLNet uses a permutation language modeling objective that captures bidirectional context more effectively by training over all possible token orderings, yielding richer contextual representations. This model is trained on large-scale Chinese text...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PythonPark

    PythonPark

    Python open source project "The Road to Self-Study Programming"

    PythonPark is a large, curated “learning playground” for Python — essentially a comprehensive self-study meta-repository aimed at helping learners progress in Python programming, data science, machine learning, web scraping, and software engineering practices. It aggregates tutorials, learning guides, project examples, and resources across topics: from Python basics and data structures to machine learning, web scraping, and even interview preparation and “programmer life” guidance. Because...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    EKS Best Practices

    EKS Best Practices

    A best practices guide for day 2 operations

    The Amazon EKS Best Practices Guide is a public repository containing comprehensive documentation and guidance for operating production-grade Kubernetes clusters on AWS’s managed service, Amazon EKS. Rather than a code library, it serves as a reference catalogue of patterns, anti-patterns, checklists and architectures across domains such as security, reliability, scalability, networking, cost optimization and hybrid cloud deployments. The repository is maintained by AWS but open to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    PLT (Programming Languages Theory)

    PLT (Programming Languages Theory)

    Programming Language Theory

    Curated roadmap to Programming Language Theory, collecting seminal papers, books, and resources into a navigable structure for self-study. It spans foundational topics like lambda calculus, type systems, interpreters, compilers, and formal semantics, while also pointing to contemporary areas such as effect systems, dependent types, and verification. Each section clusters materials by theme so learners can build understanding step by step instead of grazing at random. The list emphasizes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TensorFlow Probability

    TensorFlow Probability

    Probabilistic reasoning and statistical analysis in TensorFlow

    ...TensorFlow Probability (TFP) is a Python library built on TensorFlow that makes it easy to combine probabilistic models and deep learning on modern hardware (TPU, GPU). It's for data scientists, statisticians, ML researchers, and practitioners who want to encode domain knowledge to understand data and make predictions. Since TFP inherits the benefits of TensorFlow, you can build, fit, and deploy a model using a single language throughout the lifecycle of model exploration and production. TFP is open source and available on GitHub. Tools to build deep probabilistic models, including probabilistic layers and a `JointDistribution` abstraction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    tvm

    tvm

    Open deep learning compiler stack for cpu, gpu, etc.

    Apache TVM is an open source machine learning compiler framework for CPUs, GPUs, and machine learning accelerators. It aims to enable machine learning engineers to optimize and run computations efficiently on any hardware backend. The vision of the Apache TVM Project is to host a diverse community of experts and practitioners in machine learning, compilers, and systems architecture to build an accessible, extensible, and automated open-source framework that optimizes current and emerging...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    AWS Neuron

    AWS Neuron

    Powering Amazon custom machine learning chips

    ...Using Neuron developers can easily train their machine learning models on any popular framework such as TensorFlow, PyTorch, and MXNet, and run it optimally on Amazon EC2 Inf1 instances. You can continue to use the same ML frameworks you use today and migrate your software onto Inf1 instances with minimal code changes and without tie-in to vendor-specific solutions. Neuron is pre-integrated into popular machine learning frameworks like TensorFlow, MXNet and Pytorch to provide a seamless training-to-inference workflow. It includes a compiler, runtime driver, as well as debug and profiling utilities with a TensorBoard plugin for visualization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    autoresearch

    autoresearch

    AI agents autonomously run and improve ML experiments overnight

    autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Open LLMs

    Open LLMs

    A list of open LLMs available for commercial use

    Open LLMs, by the same author behind applied-ml — serves as a curated directory of open large language models (LLMs) that are available for commercial or open-source use. Rather than proprietary or closed-source LLMs, this repo focuses on freely available or permissively licensed models that practitioners can download, run, fine-tune or integrate without restrictive licensing. For teams or developers interested in experimenting with LLMs but wanting to avoid vendor lock-in or licensing constraints, open-llms offers a practical starting point. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Python-Spider

    Python-Spider

    Python3 web crawler practice

    Python-Spider is a repository intended to teach or provide examples for writing web spiders / crawlers in Python — part of a broader learning and resource collection by its author. The code and documentation are oriented toward beginners or intermediate learners who want to learn how to fetch, parse, and extract data from websites programmatically. As part of the author’s public learning-path repositories, python-spider likely includes examples of HTTP requests, HTML parsing, maybe...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Homemade Machine Learning

    Homemade Machine Learning

    Python examples of popular machine learning algorithms

    homemade-machine-learning is a repository by Oleksii Trekhleb containing Python implementations of classic machine-learning algorithms done “from scratch”, meaning you don’t rely heavily on high-level libraries but instead write the logic yourself to deepen understanding. Each algorithm is accompanied by mathematical explanations, visualizations (often via Jupyter notebooks), and interactive demos so you can tweak parameters, data, and observe outcomes in real time. The purpose is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ODD Platform

    ODD Platform

    First open-source data discovery and observability platform

    Unlock the power of big data with OpenDataDiscovery Platform. Experience seamless end-to-end insights, powered by unprecedented observability and trust - from ingestion to production - while building your ideal tech stack! Democratize data and accelerate insights. Find data that fits your use case and discover hints left by your peers to leverage existing knowledge. Explore tags, ownership details, links to other sources and other information to shorten and simplify data discovery phase....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Perfect Roadmap To Learn Data Science

    Perfect Roadmap To Learn Data Science

    Basic To Intermediate Python data science guide

    Perfect Roadmap To Learn Data Science In 2025 is an extended, updated learning pathway curated for the modern data-science landscape — blending classical data-analysis, statistics, machine learning, deep learning, computer vision, NLP, as well as current deployment and MLOps practices to prepare learners for data-science careers in 2025. The roadmap is organized to guide learners systematically: starting with Python fundamentals and math/statistics, then progressing through classical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Darts

    Darts

    A python library for easy manipulation and forecasting of time series

    ...The library also makes it easy to backtest models, combine the predictions of several models, and take external data into account. Darts supports both univariate and multivariate time series and models. The ML-based models can be trained on potentially large datasets containing multiple time series, and some of the models offer a rich support for probabilistic forecasting. We recommend to first setup a clean Python environment for your project with at least Python 3.7 using your favorite tool (conda, venv, virtualenv with or without virtualenvwrapper).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MLton

    MLton

    A whole-program optimizing compiler for Standard ML

    MLton is a whole-program optimizing compiler for Standard ML. MLton generates small executables with excellent runtime performance, utilizing untagged and unboxed native integers, reals, and words, unboxed native arrays, fast arbitrary-precision arithmetic based on GnuMP, and multiple code generation and garbage collection strategies. In addition, MLton provides a feature rich Standard ML programming environment, with full support for SML97 as given in The Definition of Standard ML (Revised), a number of useful language extensions, a complete implementation of the Standard ML Basis Library, various useful libraries, a simple and fast C foreign function interface, the ML Basis system for programming with source libraries, and tools such as a lexer generator, a parser generator, and a profiler.
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 22
    PoseidonQ  - AI/ML Based QSAR Modeling

    PoseidonQ - AI/ML Based QSAR Modeling

    ML based QSAR Modelling And Translation of Model to Deployable WebApps

    ...Link : https://pubs.acs.org/doi/10.1021/acs.jcim.4c02372 - Simple to use and no compromise on essential features necessary to make reliable QSAR models. - From Generating Reliable ML Based QSAR Models to Developing Your Own QSAR WebApp. For any feedback or queries, contact kabeermuzammil614@gmail.com - Available on Windows and Linux - Software Authorship - Muzammil Kabier -If You are Facing Issues in Deployment to Streamlit, Try 'requirements.txt' in the Github repo or The Files Deposited Here.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    MLPerf

    MLPerf

    Reference implementations of MLPerf™ training benchmarks

    ...These implementations are valid as starting points for benchmark implementations but are not fully optimized and are not intended to be used for "real" performance measurements of software frameworks or hardware. Benchmarking the performance of training ML models on a wide variety of use cases, software, and hardware drives AI performance across the tech industry. The MLPerf Training working group draws on expertise in AI and the technology that powers AI from across the industry to design and create industry-standard benchmarks. Together, we create the reference implementations, rules, policies, and procedures to benchmark a wide variety of AI workloads.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    ML YouTube Courses

    ML YouTube Courses

    Discover the latest machine learning / AI courses on YouTube

    ...Rather than hosting course content itself, it acts as a discovery hub pointing learners to the most valuable freely available video curricula. The list is periodically updated to include new courses and maintain relevance. Overall, ML-YouTube-Courses functions as a centralized directory for self-paced AI education through video learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    vidurOS

    vidurOS

    An ultra-light Linux distro built for cybersecurity, AI/ML work

    🚀 Introducing VidurOS — A Custom Linux Distro for Cybersecurity, Programming & AI/ML After weeks of deep customization, testing, and design, I’m excited to announce the launch of VidurOS 🎉 🔥 What is VidurOS? A lightweight, blazing-fast Linux distribution built from Ubuntu Server 22.04 — optimized for: • 🛡️ Cybersecurity Tools (Nmap, Wireshark, Metasploit & more) • 💻 Programmers & Devs (Python, Node.js, C/C++, Git, Vim) • 🧠 AI/ML Enthusiasts (Jupyter, pip, SciPy preloaded — with TensorFlow/PyTorch optional) • 💾 Old PCs & VMs (XFCE-based, ISO size ~1.7 GB) 🌟 Built using Cubic, with custom theming, a responsive UI/UX, and designed to revive low-spec hardware...
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB