A library for converting HTML into PDFs using ReportLab
Compute distance between sequences
Snippet solution for Vim
Instagram OSINT tool for gathering profile data and public posts
Framework for building real-time voice and multimodal AI agents
Easily compute clip embeddings and build a clip retrieval system
Faster Whisper transcription with CTranslate2
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Management of Yandex Station and other smart home devices
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
TextWorld is a sandbox learning environment for the training
Tools for manipulating datasets
Agent harness to make your slop code well-engineered and beautiful
A Unified Framework for Text-to-3D and Image-to-3D Generation
Official Python inference and LoRA trainer package
Open source healthcare AI
A full spaCy pipeline and models for scientific/biomedical documents
A Repo For Document AI
A community-supported supercharged version of paperless
Controllable & emotion-expressive zero-shot TTS
A fast TTS architecture with conditional flow matching
Accurate × Fast × Comprehensive
Speakr is a personal, self-hosted web application
Underthesea - Vietnamese NLP Toolkit