A JAX-native LLM Post-Training Library
Flax is a neural network library for JAX
Z80-μLM is a 2-bit quantized language model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Multi-modal large language model designed for audio understanding
Gem Dash aka Boulder or Dyna Blaster like 8-bit style game in Python
Optical-packet node transceiver frequency allocation
Open Multilingual Multimodal Chat LMs
UtilityHub is a lightweight, all-in-one desktop utility.
Evolutionary Algorithm using Python
A repo for distributed training of language models with Reinforcement
Reinforcement learning (RL) tutorial series
The most simple, flexible, and comprehensive OpenAI Gym trading
Quantitative analysis, strategies and backtests
Trading backtesting environment for training reinforcement learning
High-quality single-file implementations of SOTA Offline
Code for "Learning to summarize from human feedback"
A PyTorch Library for Meta-learning Research
Implementations of basic RL algorithms with minimal lines of codes
TradeMaster is an open-source platform for quantitative trading
Massively Parallel Deep Reinforcement Learning
A high-performance distributed training framework
Implementation of Reinforcement Learning Algorithms. Python, OpenAI
Implementations and code to accompany DeepMind publications