1 min voice data can also be used to train a good TTS model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Python tool for converting files and office documents to Markdown
Improve your Baduk skills by training with KataGo
OCR software, free and offline
A modular, primitive-first, python-first PyTorch library
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A high-throughput and memory-efficient inference and serving engine
Open-source, high-performance AI model with advanced reasoning
AI agent harness for AI coding agents
Agentic, Reasoning, and Coding (ARC) foundation models
A Lightweight Face Recognition and Facial Attribute Analysis
Code for running inference and finetuning with SAM 3 model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
The highest-scoring AI memory system ever benchmarked
A lightweight audio-to-MIDI converter with pitch bend detection
Fast stable diffusion on CPU and AI PC
NVR with realtime local object detection for IP cameras
Official inference repo for FLUX.2 models
Generate short videos with one click using AI LLM
gpt-4o for windows, macos and linux
An Open Source implementation of Notebook LM with more flexibility
Open-source AI agent framework
AI video generator optimized for low VRAM and older GPUs use
High-Resolution 3D Assets Generation with Large Scale Diffusion Models