OCRmyPDF adds an OCR text layer to scanned PDF files
Speech recognition module for Python
Machine learning, conversational dialog engine for creating chat bots
Transforming Multimodal Content into Captivating Multilingual Audio
Parse files for optimal RAG
High-quality multi-lingual text-to-speech library by MyShell.ai
Web UI for your scripts with execution management
A Data Entry Tool for Windows and Linux
Python & command-line tool to gather text on the Web
A Python library for audio data augmentation
Python Terminal Toolkit - a Spiced Up TUI Library
Audio generation using diffusion models, in PyTorch
Rich is a Python library for rich text and beautiful formatting
Seamlessly integrate LLMs into scikit-learn
Central interface to connect your LLM's with external data
A Python package for segmenting geospatial data with the SAM
An open-source toolkit for monitoring Language Learning Models (LLMs)
A speech-text foundation model for real time dialogue
Tool for visualizing and tracking your machine learning experiments
User toolkit for analyzing and interfacing with Large Language Models
A minimal implementation of diffusion models for text generation
CPT: A Pre-Trained Unbalanced Transformer
Turn words into chords
An interpretable and efficient predictor using pre-trained models
A Modern Fully-Fledged Mouse and Keyboard AutoClicker