GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Image generation model with single-stream diffusion transformer
Structure-from-Motion and Multi-View Stereo
Python inference and LoRA trainer package for the LTX-2 audio–video
Official code repo for the O'Reilly Book
Kimi K2 is the large language model series developed by Moonshot AI
Official inference repo for FLUX.2 models
NVR with realtime local object detection for IP cameras
AI video generator optimized for low VRAM and older GPUs use
Code for running inference and finetuning with SAM 3 model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
The media player for language learning, with dual subtitles
OBLITERATE THE CHAINS THAT BIND YOU
Open-source AI agent framework
A theoretical reconstruction of the Claude Mythos architecture
Singing Voice Synthesis via Shallow Diffusion Mechanism
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Open-source multi-speaker long-form text-to-speech model
157 models, 30 providers, one command to find what runs on hardware
Code for the paper Language Models are Unsupervised Multitask Learners
Make videos programmatically with React
High-Resolution Image Synthesis with Latent Diffusion Models
Official inference repo for FLUX.1 models