Persian NLP Toolkit
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
A Unified Framework for Text-to-3D and Image-to-3D Generation
OCR model for complex documents with layout-aware structured outputs
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Official MiniMax Model Context Protocol (MCP) server
A lightweight text-to-speech model with zero-shot voice cloning
Open-Sora: Democratizing Efficient Video Production for All
Industrial-level controllable zero-shot text-to-speech system
Framework for building realtime multimodal voice AI agents apps
A full spaCy pipeline and models for scientific/biomedical documents
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Using AI models to automatically provide commentary and edit videos
Stanford NLP Python library for many human languages
Powerful Android AI agent with tools, automation, and Linux shell
LLM abstractions that aren't obstructions
Extension of Google Research’s PaperBanana
Qwen3-ASR is an open-source series of ASR models
A lightweight framework for building LLM-based agents
Qwen-Image is a powerful image generation foundation model
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Official Python inference and LoRA trainer package
Towards Real-World Vision-Language Understanding
Free, high-quality text-to-speech API endpoint to replace OpenAI
Qwen3 is the large language model series developed by Qwen team