OCRmyPDF adds an OCR text layer to scanned PDF files
Deepfakes Software For All
A lightweight audio-to-MIDI converter with pitch bend detection
Advanced language and coding AI model
A Python library powered by Language Models (LLMs)
A high-throughput and memory-efficient inference and serving engine
Create UIs for your machine learning model in Python in 3 minutes
Low-level Python library used to interact with a Substra network
A unified interface for distributed computing
Image polygonal annotation with Python
Extensible, parallel implementations of t-SNE
A Python Automated Machine Learning tool that optimizes ML
1 min voice data can also be used to train a good TTS model
Awesome multilingual OCR toolkits based on PaddlePaddle
DSPy: The framework for programming—not prompting—language models
Source code of PyGAD, Python 3 library for building genetic algorithms
From Images to High-Fidelity 3D Assets
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Web interface for generating images using Stable Diffusion models
Agentic, Reasoning, and Coding (ARC) foundation models
AI-data warehouse to enrich, transform and analyze unstructured data
Low-code app builder for RAG and multi-agent AI applications
Comprehensive Gradio WebUI for audio processing
The machine learning toolkit for time series analysis in Python
Official inference repo for FLUX.2 models