Qwen2.5-VL is the multimodal large language model series
Programmatic access to the AlphaGenome model
Ultra-Efficient LLMs on End Device
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Industrial-level controllable zero-shot text-to-speech system
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open-source deep-learning framework
Models for object and human mesh reconstruction
Contexts Optical Compression
Provides convenient access to the Anthropic REST API from any Python 3
Generating Immersive, Explorable, and Interactive 3D Worlds
Achieving 3+ generation speedup on reasoning tasks
Open-Source Financial Large Language Models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Foundation Models for Time Series
Advancing Open-source World Models
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Sharp Monocular Metric Depth in Less Than a Second
Video Object and Interaction Deletion
Easy Docker setup for Stable Diffusion with user-friendly UI
FAIR Sequence Modeling Toolkit 2
Phi-3.5 for Mac: Locally-run Vision and Language Models
Qwen-Image is a powerful image generation foundation model
Code for running inference with the SAM 3D Body Model 3DB
PyTorch code and models for the DINOv2 self-supervised learning