In-App assistant SDK to build a multimodal conversational UX websites
Self-hosted AI audio transcription
Multilingual Document Layout Parsing in a Single Vision-Language Model
Based on the LangChain/LangGraph framework
Qwen3-ASR is an open-source series of ASR models
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
End-to-end speech processing toolkit
Code release for Cut and Learn for Unsupervised Object Detection
Statistical machine intelligence and learning engine
Jittor is a high-performance deep learning framework
Cross-platform, customizable ML solutions
Production ready toolkit to run AI locally
AI assistant based on large models that can actively think and plan
Chinese XLNet pre-trained model
Pre-trained Deep Learning models and demos
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
Convert AI papers to GUI
Framework for building AI-powered interactive digital humans and agent
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
The standard data-centric AI package for data quality and ML
NLP Cloud serves high performance pre-trained or custom models for NER
Bailing is a voice dialogue robot similar to GPT-4o
Stanford NLP Python library for many human languages