A community-supported supercharged version of paperless
Web interface for generating images using Stable Diffusion models
State-of-the-art TTS model under 25MB
InvokeAI is a leading creative engine for Stable Diffusion models
Open source personal AI Assistant for Linux, Windows and Mac
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Open Source Document Management System for Digital Archives
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Machine Learning Systems: Design and Implementation
Pushing the Limits of Mathematical Reasoning in Open Language Models
Python tool for converting files and office documents to Markdown
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A robust, efficient, low-latency speech-to-text library
Qwen3-omni is a natively end-to-end, omni-modal LLM
Models for the spaCy Natural Language Processing (NLP) library
⚡ Building applications with LLMs through composability ⚡
Library for OCR-related tasks powered by Deep Learning
A framework to enable multimodal models to operate a computer
Easy-to-use Speech Toolkit including Self-Supervised Learning model
MTEB: Massive Text Embedding Benchmark
Audiocraft is a library for audio processing and generation
Open source machine learning framework to automate text conversations
An open-source toolkit for monitoring Language Learning Models (LLMs)
Machine learning, conversational dialog engine for creating chat bots
Implementation of Make-A-Video, new SOTA text to video generator