Research-oriented chatbot framework
text and image to video generation: CogVideoX (2024) and CogVideo
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Fast inference engine for Transformer models
An alignment auditing agent capable of exploring alignment hypothesis
A modular high-level library to train embodied AI agents
A library for scientific machine learning & physics-informed learning
Free, ultrafast Copilot alternative for Vim and Neovim
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Qwen2.5-VL is the multimodal large language model series
C++ DataFrame for statistical, Financial, and ML analysis
Pushing the Limits of Mathematical Reasoning in Open Language Models
Official implementation of Watermark Anything with Localized Messages
FlashMLA: Efficient Multi-head Latent Attention Kernels
Open platform for training, serving, and evaluating language models
PaddlePaddle End-to-End Development Toolkit
A research prototype of a human-centered web agent
SAPIEN Manipulation Skill Framework
The fastest way to bring multi-agent workflows to production
Implementation of Vision Transformer, a simple way to achieve SOTA
Private chat with local GPT with document, images, video, etc.
A library for deep learning end-to-end dialog systems and chatbots
Qwen3-omni is a natively end-to-end, omni-modal LLM
Phi-3.5 for Mac: Locally-run Vision and Language Models
SGLang is a fast serving framework for large language models