Scalable data pre processing and curation toolkit for LLMs
Qwen3-omni is a natively end-to-end, omni-modal LLM
A very simple framework for state-of-the-art NLP
Generate blog articles from video or audio
Audiocraft is a library for audio processing and generation
Implementation of Imagen, Google's Text-to-Image Neural Network
Stanford NLP Python library for many human languages
Easily compute clip embeddings and build a clip retrieval system
Collection of Gemma 3 variants that are trained for performance
Chat with it via text and voice
An Open Source text-to-speech system built by inverting Whisper
Diffusion Bee is the easiest way to run Stable Diffusion locally
Multilingual sentence & image embeddings with BERT
Synchronized Translation for Videos
User toolkit for analyzing and interfacing with Large Language Models
Free, high-quality text-to-speech API endpoint to replace OpenAI
StreamSpeech is a seamless model for offline speech recognition
Easy-to-use and powerful NLP library with Awesome model zoo
Han Language Processing
Open source personal AI Assistant for Linux, Windows and Mac
Qwen-Image is a powerful image generation foundation model
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A full spaCy pipeline and models for scientific/biomedical documents
A sound cloning tool with a web interface, using your voice
A modular graph-based Retrieval-Augmented Generation (RAG) system