Open Source Document Management System for Digital Archives
Multimodal embedding and reranking models built on Qwen3-VL
Supercharge Your LLM with the Fastest KV Cache Layer
GenAI Processors is a lightweight Python library
AutoGluon: AutoML for Image, Text, and Tabular Data
Build Vision Agents quickly with any model or video provider
Generate audiobooks from e-books
Qwen2.5-VL is the multimodal large language model series
OCR model for complex documents with layout-aware structured outputs
Foundation model for image generation
A speech-text foundation model for real time dialogue
Document content and metadata extraction microservice
The Python code to reproduce illustrations from Machine Learning Book
Cloud-native open source data warehouse for analytics and AI queries
Python library for scraping and analyzing online news articles easily
A Python package for segmenting geospatial data with the SAM
Controllable and fast Text-to-Speech for over 7000 languages
lightweight package to simplify LLM API calls
Fast stable diffusion on CPU and AI PC
Adding guardrails to large language models
Qwen3-ASR is an open-source series of ASR models
Instant voice cloning by MIT and MyShell. Audio foundation model
Check code for common misspellings
Open-source multi-speaker long-form text-to-speech model
Scalable data pre processing and curation toolkit for LLMs