Web interface for generating images using Stable Diffusion models
Awesome multilingual OCR toolkits based on PaddlePaddle
Advanced language and coding AI model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
1 min voice data can also be used to train a good TTS model
Offline Text To Speech synthesis for python
A lightweight audio-to-MIDI converter with pitch bend detection
Speech-to-text, text-to-speech, and speaker recognition
State-of-the-art TTS model under 25MB
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.2 models
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
OCR software, free and offline
Multi-class confusion matrix library in Python
A reactive notebook for Python
Asynchronous multi-platform robot framework written in Python
Official inference repo for FLUX.1 models
NVR with realtime local object detection for IP cameras
Petastorm library enables single machine or distributed training
A Python library for learning and evaluating knowledge graph embedding
A modular, primitive-first, python-first PyTorch library
Free, open source crypto trading bot
A python tool that uses GPT-4, FFmpeg, and OpenCV
Generate short videos with one click using AI LLM
A robust, efficient, low-latency speech-to-text library