super expressive prompting model based on ltx2.3
Qwen3-Coder is the code version of Qwen3
Video Object and Interaction Deletion
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Large-language-model & vision-language-model based on Linear Attention
Advancing Open-source World Models
MiniMax M2.1, a SOTA model for real-world dev & agents.
State of the art LLM and coding model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Example Discord bot written in Python that uses the completions API
Dataset of GPT-2 outputs for research in detection, biases, and more
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Repo for external large-scale work
Open-source pre-training implementation of Google's LaMDA in PyTorch
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
QwQ-32B is a reasoning-focused language model for complex tasks
High-efficiency reasoning and agentic intelligence model
Vision-language-action model for robot control via images and text
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B