checkpoint free download

Showing 18 open source projects for "checkpoint"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

HRM-Text

1B text generation model based on the HRM architecture

...The system combines hierarchical recurrent design, task-completion strengthening, and latent-space reasoning. Its training stack includes PrefixLM sequence packing, FlashAttention 3 kernels, PyTorch FSDP2, evaluation scripts, and checkpoint conversion tools. The repository supports reference pretraining runs for smaller and larger configurations, with Hopper-class GPUs expected for the attention path. It is useful for researchers and engineers exploring efficient language model pretraining, reasoning-focused architectures, and reproducible foundation model experiments.

Downloads: 0 This Week

Last Update: 2026-06-17
See Project
2

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

AUTOMATIC1111's stable-diffusion-webui is a powerful, user-friendly web interface built on the Gradio library that allows users to easily interact with Stable Diffusion models for AI-powered image generation. Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options. With a flexible installation process across Windows, Linux, and...

1 Review

Downloads: 199 This Week

Last Update: 2025-06-02
See Project
3

Llama Recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method

The 'llama-recipes' repository is a companion to the Meta Llama models. We support the latest version, Llama 3.1, in this repository. The goal is to provide a scalable library for fine-tuning Meta Llama models, along with some example scripts and notebooks to quickly get started with using the models in a variety of use-cases, including fine-tuning for domain adaptation and building LLM-based applications with Llama and other tools in the LLM ecosystem. The examples here showcase how to run...

Downloads: 0 This Week

Last Update: 2025-01-22
See Project
4

Gemma in PyTorch

The official PyTorch implementation of Google's Gemma models

gemma_pytorch provides the official PyTorch reference for running and fine-tuning Google’s Gemma family of open models. It includes model definitions, configuration files, and loading utilities for multiple parameter scales, enabling quick evaluation and downstream adaptation. The repository demonstrates text generation pipelines, tokenizer setup, quantization paths, and adapters for low-rank or parameter-efficient fine-tuning. Example notebooks walk through instruction tuning and evaluation...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
5

Harness-1

Ultra Recipe for Training Long-Horizon Search Agents

...This lets the policy focus on higher-level decisions instead of trying to keep every detail inside the model context. The repository includes inference utilities, training scripts, evaluation runners, dataset tools, and documentation for running the released checkpoint. Its main value is showing how a smaller open model can approach advanced search-agent behavior through structured retrieval state and reinforcement learning.

Downloads: 0 This Week

Last Update: 2026-06-15
See Project
6

opcode

A powerful GUI app and Toolkit for Claude Code

opcode is an open source desktop application and toolkit designed to enhance the developer experience when working with Claude Code by providing a graphical interface and advanced workflow management tools. The project acts as a command center for AI-assisted programming, bridging the gap between command-line workflows and modern visual development environments. Built using the Tauri framework, Opcode enables developers to manage multiple Claude sessions, create custom agents, and track...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
7

Coconut

Training Large Language Model to Reason in a Continuous Latent Space

Coconut is the official PyTorch implementation of the research paper “Training Large Language Models to Reason in a Continuous Latent Space.” The framework introduces a novel method for enhancing large language models (LLMs) with continuous latent reasoning steps, enabling them to generate and refine reasoning chains within a learned latent space rather than relying solely on discrete symbolic reasoning. It supports training across multiple reasoning paradigms—including standard...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
8

Zypher Agent

A minimal yet powerful framework for creating AI agents

Zypher Agent is an open-source framework for building full-featured AI agents that can be embedded directly into applications, enabling reactive decision loops where the agent dynamically chooses its next actions. Unlike workflow-style orchestrators, it uses a reactive agent loop that interprets the task, reasons about next steps via LLMs, and integrates directly with extensible tools and external services. Zypher prioritizes native support for multiple model providers such as OpenAI and...

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
9

Extended Dreambooth How-To Guides

Implementation of Dreambooth

Extended Dreambooth How-To Guides is an implementation and extended toolkit for fine-tuning Stable Diffusion models using the DreamBooth technique, enabling users to train AI image generators to reproduce specific subjects, styles, or identities from a small set of reference images. The project adapts and expands upon earlier DreamBooth research by providing practical scripts, notebooks, and workflows that allow users to train personalized models on local machines, cloud environments, or...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

OpenNMT-tf

Neural machine translation and sequence learning using TensorFlow

OpenNMT is an open-source ecosystem for neural machine translation and neural sequence learning. OpenNMT-tf is a general-purpose sequence learning toolkit using TensorFlow 2. While neural machine translation is the main target task, it has been designed to more generally support sequence-to-sequence mapping, sequence tagging, sequence classification, language modeling. Models are described with code to allow training custom architectures and overriding default behavior. For example, the...

Downloads: 0 This Week

Last Update: 2023-08-04
See Project
11

Alpaca.cpp

Locally run an Instruction-Tuned Chat-Style LLM

...Download the zip file corresponding to your operating system from the latest release. The weights are based on the published fine-tunes from alpaca-lora, converted back into a PyTorch checkpoint with a modified script and then quantized with llama.cpp the regular way.

1 Review

Downloads: 7 This Week

Last Update: 2023-03-24
See Project
12

Keepsake

Version control for machine learning

Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage. You can get the data back out using the command-line interface or a notebook.

Downloads: 0 This Week

Last Update: 2024-08-15
See Project
13

GPT-2 FR

GPT-2 French demo | Démo française de GPT-2

...Create an environment then install the two packages pip install tensorflow==1.14 gpt-2-simple. A script and a notebook are available in the src folder to fine-tune GPT-2 on your own datasets. The output of each workout, i.e. the folder checkpoint/run1, is to be put ingpt2-model/model1 model2 model3 etc. You can run the script deploy_cloudrun.shto deploy all your different models (into gpt2-model) at once. However, you must have already initialized the gcloud CLI tool (Cloud SDK).

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
14

Open BEAGLE

Evolutionary Computation Framework in C++

Open BEAGLE is a C++ Evolutionary Computation (EC) framework. It provides an high-level software environment to do any kind of EC, with support for tree-based genetic programming; bit string, integer-valued vector, and real-valued vector genetic algorithms; and evolution strategy. The Open BEAGLE architecture follows strong principles of object oriented programming, where abstractions are represented by loosely coupled objects and where it is common and easy to reuse code. Open BEAGLE is...

3 Reviews

Downloads: 3 This Week

Last Update: 2013-04-25
See Project
15

mms-300m-1130-forced-aligner

CTC-based forced aligner for audio-text in 158 languages

mms-300m-1130-forced-aligner is a multilingual forced alignment model based on Meta’s MMS-300M wav2vec2 checkpoint, adapted for Hugging Face’s Transformers library. It supports forced alignment between audio and corresponding text across 158 languages, offering broad multilingual coverage. The model enables accurate word- or phoneme-level timestamping using Connectionist Temporal Classification (CTC) emissions. Unlike other tools, it provides significant memory efficiency compared to the TorchAudio forced alignment API. ...

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
16

fashion-clip

CLIP model fine-tuned for zero-shot fashion product classification

...The model learns to align product images and descriptive text using contrastive learning, enabling it to perform well across various fashion-related tasks without additional supervision. FashionCLIP 2.0, the latest version, uses the laion/CLIP-ViT-B-32-laion2B-s34B-b79K checkpoint for improved accuracy, achieving better F1 scores across multiple benchmarks compared to earlier versions. It supports multilingual fashion queries and works best with clean, product-style images against white backgrounds. The model can be used for product search, recommendation systems, or visual tagging in e-commerce platforms.

Downloads: 0 This Week

Last Update: 2025-07-02
See Project
17

Mistral Large 3 675B Instruct 2512 NVFP4

Quantized 675B multimodal instruct model optimized for NVFP4

Mistral Large 3 675B Instruct 2512 NVFP4 is a frontier-scale multimodal Mixture-of-Experts model featuring 675B total parameters and 41B active parameters, trained from scratch on 3,000 H200 GPUs. This NVFP4 checkpoint is a post-training-activation quantized version of the original instruct model, created through a collaboration between Mistral AI, vLLM, and Red Hat using llm-compressor. It retains the same instruction-tuned behavior as the FP8 model, making it ideal for production assistants, agentic workflows, scientific tasks, and long-context enterprise systems. ...

Downloads: 0 This Week

Last Update: 2025-12-03
See Project
18

VaultGemma

VaultGemma: 1B DP-trained Gemma variant for private NLP tasks

...Training ran on TPU v6e using JAX and Pathways with privacy-preserving algorithms (DP-SGD, truncated Poisson subsampling) and DP scaling laws to balance compute and privacy budgets. Benchmarks on the 1B pre-trained checkpoint show expected utility trade-offs (e.g., HellaSwag 10-shot 39.09, BoolQ 0-shot 62.04, PIQA 0-shot 68.00), reflecting its privacy-first design.

Downloads: 0 This Week

Last Update: 2025-09-17
See Project