Ling is a MoE LLM provided and open-sourced by InclusionAI
Repo of Qwen2-Audio chat & pretrained large audio language model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
4M: Massively Multimodal Masked Modeling
PyTorch code and models for the DINOv2 self-supervised learning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Models for object and human mesh reconstruction
Designed for text embedding and ranking tasks
The official PyTorch implementation of Google's Gemma models
Diffusion Transformer with Fine-Grained Chinese Understanding
A Customizable Image-to-Video Model based on HunyuanVideo
Implementation of "MobileCLIP" CVPR 2024
Multimodal Diffusion with Representation Alignment
Official code for Style Aligned Image Generation via Shared Attention
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official implementation of DreamCraft3D
The ChatGPT Retrieval Plugin lets you easily find personal documents
Implementation of the Surya Foundation Model for Heliophysics
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Powerful open source image generation model
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open Multilingual Multimodal Chat LMs
Fine-tuning ChatGLM-6B with PEFT