VMZ: Model Zoo for Video Modeling
Integrating LLMs into structured NLP pipelines
Chinese XLNet pre-trained model
Convert AI papers to GUI
The leading agent orchestration platform for Claude
A very simple framework for state-of-the-art NLP
End-to-end speech processing toolkit
Qwen3-Coder is the code version of Qwen3
Jittor is a high-performance deep learning framework
The standard data-centric AI package for data quality and ML
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
PyTorch code and models for VJEPA2 self-supervised learning from video
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
GLM-4-Voice | End-to-End Chinese-English Conversational Model
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
Qwen3-omni is a natively end-to-end, omni-modal LLM
Bailing is a voice dialogue robot similar to GPT-4o
Obsei is a low code AI powered automation tool
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Multi-modal large language model designed for audio understanding
Leading free and open-source liveliness check &face recognition system