A very simple framework for state-of-the-art NLP
End-to-end speech processing toolkit
A modular graph-based Retrieval-Augmented Generation (RAG) system
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
HY-Motion model for 3D character animation generation
Stable Diffusion built-in to Blender
LLM-based Reinforcement Learning audio edit model
Interface for OuteTTS models
A chatbot built based on a large model
A full spaCy pipeline and models for scientific/biomedical documents
Create videos with Stable Diffusion
Open-source multi-speaker long-form text-to-speech model
Multi-lingual large voice generation model, providing inference
Supercharge Your LLM with the Fastest KV Cache Layer
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
LLM
Guiding Instruction-based Image Editing via Multimodal Large Language
ImageBind One Embedding Space to Bind Them All
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Diffusion Transformer with Fine-Grained Chinese Understanding
lightweight package to simplify LLM API calls
Open-Sora: Democratizing Efficient Video Production for All
Build AI-powered semantic search applications
The official repo of Qwen chat & pretrained large language model