Python bindings for llama.cpp
An Efficient Agentic Model for Computer Use
Long-form streaming TTS system for multi-speaker dialogue generation
Ultra-Efficient LLMs on End Device
tiktoken is a fast BPE tokeniser for use with OpenAI's models
GPT4V-level open-source multi-modal model based on Llama3-8B
A series of math-specific large language models of our Qwen2 series
Generating Immersive, Explorable, and Interactive 3D Worlds
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Fast-stable-diffusion + DreamBooth
Collection of Gemma 3 variants that are trained for performance
High-resolution models for human tasks
CLIP, Predict the most relevant text snippet given an image
Genome modeling and design across all domains of life
Achieving 3+ generation speedup on reasoning tasks
Pretrained time-series foundation model developed by Google Research
Generate Any 3D Scene in Seconds
FAIR Sequence Modeling Toolkit 2
A PyTorch library for implementing flow matching algorithms
Diffusion Transformer with Fine-Grained Chinese Understanding
Open-source large language model family from Tencent Hunyuan
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Controllable & emotion-expressive zero-shot TTS