OCRmyPDF adds an OCR text layer to scanned PDF files
Parse files for optimal RAG
A Powerful Native Multimodal Model for Image Generation
Create UIs for your machine learning model in Python in 3 minutes
Open-source multi-speaker long-form text-to-speech model
Offline Text To Speech synthesis for python
The Triton Inference Server provides an optimized cloud
SOTA Open Source TTS
A Model Context Protocol server for searching and analyzing arXiv
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Shinkai allows you to create advanced AI (local) agents effortlessly
A computer vision framework to create and deploy apps in minutes
A fast embedded library for approximate nearest neighbor search
Face Mask Detection system based on computer vision and deep learning
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Open source embedded speech-to-text engine
Technologies for automating food production on various scales