Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference server for text embeddings models API layer
Robust Speech Recognition via Large-Scale Weak Supervision
OCR model for complex documents with layout-aware structured outputs
Contexts Optical Compression
Open source semantic search and text analytics for large document sets
A high-quality PDF to Markdown tool based on large language model
AI-powered tool for generating, optimizing, and translating subtitles
A simple tool for reading in poorly redacted documents
Easily compute clip embeddings and build a clip retrieval system
Agent harness to make your slop code well-engineered and beautiful
Advanced NLP with spaCy: A free online course
Self-hosted collection of powerful web-based tools for everyday tasks
Audiocraft is a library for audio processing and generation
End-to-end speech processing toolkit
Lightning-fast, on-device TTS, running natively via ONNX
Use Microsoft Edge's online text-to-speech service from Python
Python library for scraping and analyzing online news articles easily
Chinese XLNet pre-trained model
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
TextWorld is a sandbox learning environment for the training
Framework for building realtime multimodal voice AI agents apps
PDFCraft is a free, privacy-focused PDF toolkit
NLTK Source
Bidirectional token-classification model for identifiable info