GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Repo for SeedVR2 & SeedVR
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
LLM-based Reinforcement Learning audio edit model
Open-weight, large-scale hybrid-attention reasoning model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Tooling for the Common Objects In 3D dataset
FAIR Sequence Modeling Toolkit 2
Open-source large language model family from Tencent Hunyuan
Revolutionizing Database Interactions with Private LLM Technology
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Language modeling in a sentence representation space
Designed for text embedding and ranking tasks
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model
Official implementation of Watermark Anything with Localized Messages
General-purpose image editing model that delivers high-fidelity
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
Multi-modal large language model designed for audio understanding
Release for Improved Denoising Diffusion Probabilistic Models
Official DeiT repository
Open Multilingual Multimodal Chat LMs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of model parallel autoregressive transformers on GPUs