gpt-oss-120b and gpt-oss-20b are two open-weight language models
An experimental version of DeepSeek model
Provides convenient access to the Anthropic REST API from any Python 3
DeepSeek Coder: Let the Code Write Itself
Tool for exploring and debugging transformer model behaviors
Hackable and optimized Transformers building blocks
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official implementation of DreamCraft3D
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Video understanding codebase from FAIR for reproducing video models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Towards Real-World Vision-Language Understanding
Industrial-level controllable zero-shot text-to-speech system
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Pushing the Limits of Mathematical Reasoning in Open Language Models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Language modeling in a sentence representation space
General-purpose image editing model that delivers high-fidelity
ICLR2024 Spotlight: curation/training code, metadata, distribution
A SOTA open-source image editing model
OCR expert VLM powered by Hunyuan's native multimodal architecture
The ChatGPT Retrieval Plugin lets you easily find personal documents
High-Resolution Image Synthesis with Latent Diffusion Models