Image generation model with single-stream diffusion transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
An experimental version of DeepSeek model
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeek LLM: Let there be answers
A Unified Framework for Text-to-3D and Image-to-3D Generation
Safety reasoning models built-upon gpt-oss
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Code release for ConvNeXt V2 model
Facebook AI Research Sequence-to-Sequence Toolkit
Code for reproducing key results in the paper
Frontier-scale 675B multimodal base model for custom AI training
Tiny pre-trained IBM model for multivariate time series forecasting
Custom BLEURT model for evaluating text similarity using PyTorch
Multimodal Transformer for document image understanding and layout
Compact English sentence embedding model for semantic search tasks
Efficient English embedding model for semantic search and retrieval
Dia-1.6B generates lifelike English dialogue and vocal expressions
High-compute ultra-reasoning model surpassing model surpassing GPT-5
CLIP model fine-tuned for zero-shot fashion product classification
Quantized 675B multimodal instruct model optimized for NVFP4
Efficient 8B multimodal model tuned for advanced reasoning tasks.