Advanced language and coding AI model
Official code repo for the O'Reilly Book
Official inference repo for FLUX.2 models
NVR with realtime local object detection for IP cameras
Synchronized Translation for Videos
AI video generator optimized for low VRAM and older GPUs use
Image generation model with single-stream diffusion transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Singing Voice Synthesis via Shallow Diffusion Mechanism
Kimi K2 is the large language model series developed by Moonshot AI
Code for the paper Language Models are Unsupervised Multitask Learners
A theoretical reconstruction of the Claude Mythos architecture
Code for running inference and finetuning with SAM 3 model
Qwen3-TTS is an open-source series of TTS models
The media player for language learning, with dual subtitles
Official inference repo for FLUX.1 models
High-Resolution Image Synthesis with Latent Diffusion Models
Universal LLM Deployment Engine with ML Compilation
Python inference and LoRA trainer package for the LTX-2 audio–video
VGGFace2 Dataset for Face Recognition
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A graphical frontend to tesseract-ocr
On-device AI agent Chrome extension powered by Transformers.js