State-of-the-art 2D and 3D Face Analysis Project
Advanced language and coding AI model
The most powerful and modular diffusion model GUI, api and backend
3D reconstruction software
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Run Local LLMs on Any Device. Open-source
From Images to High-Fidelity 3D Assets
Agentic, Reasoning, and Coding (ARC) foundation models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A simple, high-quality voice conversion tool focused on ease of use
InvokeAI is a leading creative engine for Stable Diffusion models
Awesome multilingual OCR toolkits based on PaddlePaddle
Web interface for generating images using Stable Diffusion models
DeepMind's software stack for physics-based simulation
Open-source, high-performance AI model with advanced reasoning
1 min voice data can also be used to train a good TTS model
Python-based neural networks API
Offline inference engine for art, real-time voice conversations
Datasets, transforms and models specific to Computer Vision
Generating Immersive, Explorable, and Interactive 3D Worlds
An open-source RAG-based tool for chatting with your documents
NVR with realtime local object detection for IP cameras
AI-powered video clipping and highlight generation
TextWorld is a sandbox learning environment for the training
Open Source Document Management System for Digital Archives