Wan2.2: Open and Advanced Large-Scale Video Generative Model
Fast and memory-efficient exact attention
Open-source, high-performance AI model with advanced reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
1 min voice data can also be used to train a good TTS model
Agentic, Reasoning, and Coding (ARC) foundation models
A Lightweight Face Recognition and Facial Attribute Analysis
AI video generator optimized for low VRAM and older GPUs use
Python tool for converting files and office documents to Markdown
Fast stable diffusion on CPU and AI PC
OCR software, free and offline
Advanced language and coding AI model
Improve your Baduk skills by training with KataGo
Universal LLM Deployment Engine with ML Compilation
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
NVR with realtime local object detection for IP cameras
AI Fully Automated Short Video Engine
AI agent harness for AI coding agents
A lightweight audio-to-MIDI converter with pitch bend detection
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A sound cloning tool with a web interface, using your voice
Open-source AI agent framework
Automatic Speech Recognition with Word-level Timestamps
Faster Whisper transcription with CTranslate2
Python inference and LoRA trainer package for the LTX-2 audio–video