State-of-the-art TTS model under 25MB
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
InvokeAI is a leading creative engine for Stable Diffusion models
Models for the spaCy Natural Language Processing (NLP) library
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Open Source Document Management System for Digital Archives
Python tool for converting files and office documents to Markdown
Open source personal AI Assistant for Linux, Windows and Mac
A framework to enable multimodal models to operate a computer
Library for OCR-related tasks powered by Deep Learning
A robust, efficient, low-latency speech-to-text library
Industrial-strength Natural Language Processing (NLP)
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Audiocraft is a library for audio processing and generation
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of Make-A-Video, new SOTA text to video generator
Pushing the Limits of Mathematical Reasoning in Open Language Models
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Qwen3-omni is a natively end-to-end, omni-modal LLM
SoTA open-source TTS
Scalable data pre processing and curation toolkit for LLMs
An open-source toolkit for monitoring Language Learning Models (LLMs)
Machine learning, conversational dialog engine for creating chat bots
Tool for visualizing and tracking your machine learning experiments
A generative speech model for daily dialogue