All-in-one text de-duplication
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Speech recognition module for Python
The simplest, fastest repository for training/finetuning models
Ready-to-use OCR with 80+ supported languages
Python module for parsing semi-structured text into python tables
Wan2.2: Open and Advanced Large-Scale Video Generative Model
The behavior guidance framework for customer-facing LLM agents
A robust, efficient, low-latency speech-to-text library
A Python utility / library to sort imports
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Mozc - a Japanese Input Method Editor designed for multi-platform
A python parametric CAD scripting framework based on OCCT
Python & command-line tool to gather text on the Web
A generative speech model for daily dialogue
A Powerful Native Multimodal Model for Image Generation
Comprehensive Markdown plugin built for Django
State-of-the-art TTS model under 25MB
PDF to Markdown with vision models
Python tool for converting files and office documents to Markdown
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Label Studio is a multi-type data labeling and annotation tool
A pure-python PDF library capable of splitting, merging, cropping
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Library for OCR-related tasks powered by Deep Learning