The most powerful local music generation model
Programmatic access to the AlphaGenome model
Qwen3-Coder is the code version of Qwen3
Tool for exploring and debugging transformer model behaviors
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
CLIP, Predict the most relevant text snippet given an image
Official DeiT repository
Open-source pre-training implementation of Google's LaMDA in PyTorch