Controllable & emotion-expressive zero-shot TTS
Global weather forecasting model using graph neural networks and JAX
Diffusion Bee is the easiest way to run Stable Diffusion locally
Repo for SeedVR2 & SeedVR
General-purpose image editing model that delivers high-fidelity
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Open-source large language model family from Tencent Hunyuan
Tooling for the Common Objects In 3D dataset
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
LLM-based Reinforcement Learning audio edit model
Open-weight, large-scale hybrid-attention reasoning model
Multimodal embedding and reranking models built on Qwen3-VL
Official implementation of Watermark Anything with Localized Messages
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Designed for text embedding and ranking tasks
Bidirectional token-classification model for identifiable info
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
FAIR Sequence Modeling Toolkit 2
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Language modeling in a sentence representation space
Multi-modal large language model designed for audio understanding
Python example app from the OpenAI API quickstart tutorial
Chat & pretrained large vision language model
Open Multilingual Multimodal Chat LMs
Release for Improved Denoising Diffusion Probabilistic Models