Efficient Triton Kernels for LLM Training
FlashInfer: Kernel Library for LLM Serving
The leading agent orchestration platform for Claude
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
GUI for a Vocal Remover that uses Deep Neural Networks
Automate native Android apps with AI using accessibility APIs
Real time face swap and one-click video deepfake
Geometric deep learning extension library for PyTorch
Deep and Machine Learning for Microscopy
State-of-the-art 2D and 3D Face Analysis Project
The most powerful and modular diffusion model GUI, api and backend
Stable Diffusion web UI
Focus on prompting and generating
Advanced language and coding AI model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
Agentic, Reasoning, and Coding (ARC) foundation models
OCRmyPDF adds an OCR text layer to scanned PDF files
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Run Local LLMs on Any Device. Open-source
Code for running inference and finetuning with SAM 3 model
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
Powerful AI language model (MoE) optimized for efficiency/performance