Robust Speech Recognition Across Languages, Dialects
Hunyuan Translation Model Version 1.5
Official implementation of Watermark Anything with Localized Messages
GPT4V-level open-source multi-modal model based on Llama3-8B
A series of math-specific large language models of our Qwen2 series
Fast-stable-diffusion + DreamBooth
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Advancing Open-source World Models
Code for running inference with the SAM 3D Body Model 3DB
DeepSeek Coder: Let the Code Write Itself
Open-Source Financial Large Language Models
ICLR2024 Spotlight: curation/training code, metadata, distribution
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
Inference code for scalable emulation of protein equilibrium ensembles
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
RGBD video generation model conditioned on camera input
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Qwen2.5-VL is the multimodal large language model series
Open-source deep-learning framework