OCRmyPDF adds an OCR text layer to scanned PDF files
3D reconstruction software
The AI-powered coding wizard
Lightweight framework for building Agents with memory, knowledge, etc.
Offline Text To Speech synthesis for python
Build multi-modal Agents with memory, knowledge, tools and reasoning
Empower Your Dev Ecosystem with AI Agents
Plug-and-play library to enable agents to call MCP and UTCP tools
Official python implementation of UTCP. UTCP is an open standard
Enlightened library to convert HTML and CSS to SVG
Data Lake for Deep Learning. Build, manage, and query datasets
Turn your existing data infrastructure into a feature store
A deep learning toolkit for Text-to-Speech, battle-tested in research
Improved JPEG encoder
The deep learning toolkit for speech-to-text
A list of accessible speech corpora for ASR, TTS
Repository for gathering information on study materials
Open source embedded speech-to-text engine
Deep learning for text to speech
Open source speech models for Julius in English and other languages.
A Weka Plugin that uses a Genetic Algorithm for Data Oversampling
Computerized guideline editor for clinical decision support
10x faster matrix and vector operations