Easy Docker setup for Stable Diffusion with user-friendly UI
A Pragmatic VLA Foundation Model
DeepMind model for tracking arbitrary points across videos & robotics
Global weather forecasting model using graph neural networks and JAX
An AI-powered security review GitHub Action using Claude
Provides convenient access to the Anthropic REST API from any Python 3
GPT4V-level open-source multi-modal model based on Llama3-8B
Qwen3-ASR is an open-source series of ASR models
Tiny vision language model
Pushing the Limits of Mathematical Reasoning in Open Language Models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Large-language-model & vision-language-model based on Linear Attention
Hunyuan Translation Model Version 1.5
Python SDK for Claude Agent
The Clay Foundation Model - An open source AI model and interface
Chinese and English multimodal conversational language model
Fast and Universal 3D reconstruction model for versatile tasks
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Controllable & emotion-expressive zero-shot TTS
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
OCR expert VLM powered by Hunyuan's native multimodal architecture
Inference framework for 1-bit LLMs
Chat & pretrained large vision language model
Open-source industrial-grade ASR models