Scalable data pre processing and curation toolkit for LLMs
Build AI-powered semantic search applications
Build Vision Agents quickly with any model or video provider
Visual Causal Flow
Fast multimodal LLM for real-time voice interaction and AI apps
Diffusion Transformer with Fine-Grained Chinese Understanding
Large-language-model & vision-language-model based on Linear Attention
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
AutoGluon: AutoML for Image, Text, and Tabular Data
OCR expert VLM powered by Hunyuan's native multimodal architecture
Using AI models to automatically provide commentary and edit videos
Running large language models on a single GPU
Edit videos with Claude Code
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Open source AI VTuber platform with voice chat and Live2D avatars
The official repo of Qwen chat & pretrained large language model
Models for the spaCy Natural Language Processing (NLP) library
Ultra-Efficient LLMs on End Device
A python library that makes AMR parsing, generation and visualization
Audio foundation model excelling in audio understanding
A framework to enable multimodal models to operate a computer
Multi-modal large language model designed for audio understanding
Large Multimodal Models for Video Understanding and Editing
Build a large language model from 0 only with Python foundation
A Personalized LLM-powered Agent Frameworks