A PyTorch-based Speech Toolkit
Automatically find issues in image datasets
Sample applications for Google Kubernetes Engine (GKE)
Large Language Model Principles and Practice Tutorial from Scratch
Minimal examples of data structures and algorithms in Python
Implement CPU from scratch and play with large model deployments
Bidirectional token-classification model for identifiable info
Open-source large language model family from Tencent Hunyuan
Claude + Obsidian knowledge companion
Genome modeling and design across all domains of life
Cloud-native open source data warehouse for analytics and AI queries
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Benchmarking synthetic data generation methods
Data and tools for generating and inspecting OLMo pre-training data
A collection of learning resources for curious software engineers
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
A frontier, first-principles handbook
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
Pythonic tool for running machine-learning/high performance workflows
Conditional GAN for generating synthetic tabular data
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Open source clone of the Age of Empires II engine