Repo of Qwen2-Audio chat & pretrained large audio language model
A series of math-specific large language models of our Qwen2 series
Open-source large language model family from Tencent Hunyuan
Multimodal Diffusion with Representation Alignment
Capable of understanding text, audio, vision, video
Research code artifacts for Code World Model (CWM)
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Revolutionizing Database Interactions with Private LLM Technology
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
A Customizable Image-to-Video Model based on HunyuanVideo
Multimodal-Driven Architecture for Customized Video Generation
Chat & pretrained large vision language model
Implementation of the Surya Foundation Model for Heliophysics
Inference framework for 1-bit LLMs
Programmatic access to the AlphaGenome model
Phi-3.5 for Mac: Locally-run Vision and Language Models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
High-Resolution Image Synthesis with Latent Diffusion Models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Diffusion Transformer with Fine-Grained Chinese Understanding
A Unified Framework for Text-to-3D and Image-to-3D Generation
Personalize Any Characters with a Scalable Diffusion Transformer