Port of Facebook's LLaMA model in C/C++
Python-based neural networks API
Master the essential skills needed to recognize and solve problems
Personal AI, On Personal Devices
Official inference repo for FLUX.1 models
Python inference and LoRA trainer package for the LTX-2 audio–video
Stable Diffusion web UI
Offline Text To Speech synthesis for python
OCRmyPDF adds an OCR text layer to scanned PDF files
Video-based AI memory library. Store millions of text chunks in MP4
Robust Speech Recognition via Large-Scale Weak Supervision
A high-throughput and memory-efficient inference and serving engine
A minimal, secure Python interpreter written in Rust for use by AI
Image polygonal annotation with Python
Public repository for Agent Skills
1 min voice data can also be used to train a good TTS model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Source code of PyGAD, Python 3 library for building genetic algorithms
Reverse-engineered Python API for Google Gemini web app
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful and modular diffusion model GUI, api and backend
Low-code app builder for RAG and multi-agent AI applications
A simple, high-quality voice conversion tool focused on ease of use
Code for running inference and finetuning with SAM 3 model
Deepfakes Software For All