Focus on prompting and generating
Port of Facebook's LLaMA model in C/C++
Wan2.2: Open and Advanced Large-Scale Video Generative Model
OCRmyPDF adds an OCR text layer to scanned PDF files
The most powerful and modular diffusion model GUI, api and backend
Contexts Optical Compression
RGBD video generation model conditioned on camera input
Qwen3 is the large language model series developed by Qwen team
Robust Speech Recognition via Large-Scale Weak Supervision
Open source machine learning framework
Visualizer for neural network, deep learning, machine learning models
An experimental version of DeepSeek model
Open-source, high-performance AI model with advanced reasoning
Stable Diffusion web UI
Powerful AI language model (MoE) optimized for efficiency/performance
Label Studio is a multi-type data labeling and annotation tool
NVR with realtime local object detection for IP cameras
Google Testing and Mocking Framework
InvokeAI is a leading creative engine for Stable Diffusion models
Qwen3-Coder is the code version of Qwen3
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-Sora: Democratizing Efficient Video Production for All
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Download media files from a telegram conversation/chat/channel
High-Resolution 3D Assets Generation with Large Scale Diffusion Models