train free download - SourceForge

Showing 667 open source projects for "train"

View related business solutions

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

Train LLM From Scratch

A straightforward method for training your LLM

Train LLM From Scratch is an educational PyTorch project that shows how to build and train a transformer-based language model from the ground up. It is based on the architecture described in Attention Is All You Need and is designed to make the training pipeline understandable rather than hidden behind a large framework. The repository walks through the process from downloading data to generating text with a trained model.

Downloads: 2 This Week

Last Update: 7 days ago
See Project
2

How to Train Your GPT

Build a modern LLM from scratch. Every line commented

How to Train Your GPT is an interactive textbook that teaches users how to build, train, and run a modern language model from scratch. It is written for learners with minimal machine-learning background, using simple explanations, commented code, and practical examples. The project covers the same broad family of architecture behind systems such as GPT-style models, LLaMA-style models, Claude-style systems, and Mistral-style models.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
3

OpenVINO Training Extensions

Trainable models and NN optimization tools

OpenVINO™ Training Extensions provide a convenient environment to train Deep Learning models and convert them using the OpenVINO™ toolkit for optimized inference. When ote_cli is installed in the virtual environment, you can use the ote command line interface to perform various actions for templates related to the chosen task type, such as running, training, evaluating, exporting, etc. ote train trains a model (a particular model template) on a dataset and saves results in two files. ote optimize optimizes a pre-trained model using NNCF or POT depending on the model format. ...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
4

SageMaker Training Toolkit

Train machine learning models within Docker containers

Train machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code.

Downloads: 0 This Week

Last Update: 2025-09-22
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Ludwig

A codeless platform to train and test deep learning models

Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code. All you need to provide is a CSV file containing your data, a list of columns to use as inputs, and a list of columns to use as outputs, Ludwig will do the rest. Simple commands can be used to train models both locally and in a distributed way, and to use them to predict on new data.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
6

Phenaki - Pytorch

Implementation of Phenaki Video, which uses Mask GIT

...It will also combine another technique involving a token critic for potentially even better generations. A new paper suggests that instead of relying on the predicted probabilities of each token as a measure of confidence, one can train an extra critic to decide what to iteratively mask during sampling. This repository will also endeavor to allow the researcher to train on text-to-image and then text-to-video. Similarly, for unconditional training, the researcher should be able to first train on images and then fine tune on video.

Downloads: 0 This Week

Last Update: 2024-07-29
See Project
7

Porcupine

On-device wake word detection powered by deep learning

...Linux (x86_64), macOS (x86_64, arm64), and Windows (x86_64). Scalable. It can detect multiple always-listening voice commands with no added runtime footprint. Self-service. Developers can train custom wake word models using Picovoice Console. Porcupine is the right product if you need to detect one or a few static (always-listening) voice commands. If you want to create voice experiences similar to Alexa or Google, see the Picovoice platform.

Downloads: 7 This Week

Last Update: 2025-12-11
See Project
8

GluonTS

Probabilistic time series modeling in Python

GluonTS is a Python package for probabilistic time series modeling, focusing on deep learning based models. GluonTS requires Python 3.6 or newer, and the easiest way to install it is via pip. We train a DeepAR-model and make predictions using the simple "airpassengers" dataset. The dataset consists of a single time-series, containing monthly international passengers between the years 1949 and 1960, a total of 144 values (12 years * 12 months). We split the dataset into train and test parts, by removing the last three years (36 months) from the train data. ...

Downloads: 0 This Week

Last Update: 15 hours ago
See Project
9

LLM From Scratch

Build and train a GPT-style language model

LLM From Scratch is a hands-on educational workshop project that teaches developers how to build and train a GPT-style language model entirely from scratch using PyTorch. Instead of relying on high-level abstractions or prebuilt frameworks, the project walks users through implementing every core component manually, including tokenization, transformer architecture, training loops, and autoregressive text generation. The repository is intentionally simplified to focus on conceptual clarity, using a compact model of roughly 10 million parameters that can train on consumer hardware such as laptops within a relatively short time. ...

Downloads: 0 This Week

Last Update: 2026-05-07
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
10

ShoppingAgent

Custom Chinese chatbot with Seq2Seq, GPT, and agent features

ShoppingAgent is an open source Chinese conversational AI system that allows users to build and train their own chatbot using custom datasets. It provides multiple implementations of chatbot architectures, including traditional Seq2Seq models as well as newer GPT-style approaches, reflecting the evolution of conversational AI techniques. ShoppingAgent is structured to support experimentation across different deep learning frameworks such as TensorFlow, PyTorch, and MindSpore, giving developers flexibility in how they train and deploy models. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
11

SageMaker Python SDK

Training and deploying machine learning models on Amazon SageMaker

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the SDK, you can train and deploy models using popular deep learning frameworks Apache MXNet and TensorFlow. You can also train and deploy models with Amazon algorithms, which are scalable implementations of core machine learning algorithms that are optimized for SageMaker and GPU training. If you have your own algorithms built into SageMaker-compatible Docker containers, you can train and host models using these as well.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
12

Determined

Determined, deep learning training platform

...Determined’s cluster scheduling offers first-class support for deep learning and seamless spot instance support. Check out examples of how you can use Determined to train popular deep learning models at scale.

Downloads: 0 This Week

Last Update: 2025-03-19
See Project
13

OpenClaw-RL

Train any agents simply by 'talking'

OpenClaw-RL is an open-source reinforcement learning framework designed to train and personalize AI agents built on the OpenClaw ecosystem. The project focuses on enabling agents to improve their behavior through interactive learning rather than relying solely on static prompts or predefined skills. One of its key ideas is allowing users to train an AI agent simply by interacting with it conversationally, using natural language feedback to guide the learning process.

Downloads: 0 This Week

Last Update: 2026-05-23
See Project
14

Hivemind

Decentralized deep learning in PyTorch. Built to train models

...Fault-tolerant backpropagation: forward and backward passes succeed even if some nodes are unresponsive or take too long to respond. Decentralized parameter averaging: iteratively aggregate updates from multiple workers without the need to synchronize across the entire network. Train neural networks of arbitrary size: parts of their layers are distributed across the participants with the Decentralized Mixture-of-Experts. If you have succesfully trained a model or created a downstream repository with the help of our library, feel free to submit a pull request that adds your project to the list.

Downloads: 0 This Week

Last Update: 2026-01-03
See Project
15

WeChatMsg

Project aimed at extracting, exporting, and analyzing chat records

...Beyond simple export, the project includes mechanisms for analyzing chat histories and generating annual reports or visual summaries about messaging trends, interaction patterns, and more. The original README communicates a guiding philosophy about owning personal data and using it responsibly to train personalized AI agents or preserve memories. Although the repository has seen periods of inactivity and may not receive frequent updates, its widespread use indicates community interest in preserving chat logs and understanding conversation data outside of the WeChat interface.

Downloads: 233 This Week

Last Update: 2026-02-06
See Project
16

MiniMind

Train a 26M-parameter GPT from scratch in just 2h

minimind is a framework that enables users to train a 26-million-parameter GPT (Generative Pre-trained Transformer) model from scratch in approximately two hours. It provides a streamlined process for data preparation, model training, and evaluation, making it accessible for individuals and organizations to develop their own language models without extensive computational resources.

Downloads: 0 This Week

Last Update: 2025-10-21
See Project
17

LeetCode

Solutions to LeetCode by Go, 100% test coverage

Aimed towards programming enthusiasts who want to improve algorithm capabilities through LeetCode, containing many algorithm questions. Most of them are real interview questions of Google, Facebook, LinkedIn, Apple, etc. and it always help to sharp our algorithm Skills. Level up your coding skills and quickly land a job. This is the best place to expand your knowledge and get prepared for your next interview. This repo shows the solutions in Go with the code style strictly following the...

Downloads: 263 This Week

Last Update: 1 day ago
See Project
18

LLM Datasets

Curated list of datasets and tools for post-training

...Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. Licensing and provenance are surfaced to encourage compliant usage and to guide dataset selection in commercial settings. For practitioners, the repo is a practical “starting pantry” that accelerates experimentation and helps keep data wrangling from dominating the project timeline.

Downloads: 0 This Week

Last Update: 2026-04-29
See Project
19

Habitat-Lab

A modular high-level library to train embodied AI agents

Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. ...

Downloads: 0 This Week

Last Update: 2026-05-07
See Project
20

YOLOv9

Learning What You Want to Learn Using Programmable Gradient Info

...It is a modern object detection repository focused on improving how deep networks preserve useful information during training. The project introduces Programmable Gradient Information and the GELAN architecture to improve gradient flow, parameter efficiency, and train-from-scratch performance. It provides scripts and model assets for training, testing, and running inference on detection tasks. YOLOv9 is designed for real-time detection scenarios where both accuracy and efficiency matter. It is especially relevant for researchers and engineers comparing next-generation YOLO architectures or building production computer vision systems.

Downloads: 1 This Week

Last Update: 2026-06-02
See Project
21

nanoGPT

The simplest, fastest repository for training/finetuning models

...It distills the GPT architecture into a few hundred lines of Python code, making it far easier to understand than large, production-scale implementations. The repo is organized with a training pipeline (dataset preprocessing, model definition, optimizer, training loop) and inference script so you can train a small GPT on text datasets like Shakespeare or custom corpora. It emphasizes readability and clarity: the training loop is cleanly written, and the code avoids heavy abstractions, letting students follow the architecture step by step. While simple, it can still train non-trivial models on modern GPUs and generate coherent text. ...

Downloads: 4 This Week

Last Update: 2025-11-12
See Project
22

lightning AI

The most intuitive, flexible, way for researchers to build models

...Download the code and type 'lightning run app'. Feel free to ssh into any machine and run from there as well. In research, we often have multiple separate scripts to train models, finetune them, collect results and more.

Downloads: 6 This Week

Last Update: 2026-05-27
See Project
23

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle

PaddleOCR offers exceptional, multilingual, and practical Optical Character Recognition (OCR) tools that can help users train better models and apply them into practice. Inspired by PaddlePaddle, PaddleOCR is an ultra lightweight OCR system, with multilingual recognition, digit recognition, vertical text recognition, as well as long text recognition. It features a PPOCR series of high-quality pre-trained models, which includes: ultra lightweight ppocr_mobile series models, general ppocr_server series models, and ultra lightweight compression ppocr_mobile_slim series models. ...

Downloads: 60 This Week

Last Update: 2026-06-11
See Project
24

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Downloads: 44 This Week

Last Update: 2025-07-29
See Project
25

Train Traffic Simulator

A game about station masters job , managing trains which are arriving

This program is a simulator for managing arrival and departure for different train traffic for different station , here the player acts just like a station master . Trains are created at different intervals of time with some metadata , the player is supposed to modify the signals and the switches to stop trains at the station or let them pass without any interruption. The goal will be to prevent delays , achieve maximum throughput and most importantly not cause collisions .

Downloads: 2 This Week

Last Update: 2025-05-24
See Project