Large Multimodal Models for Video Understanding and Editing
Inference script for Oasis 500M
4M: Massively Multimodal Masked Modeling
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official DeiT repository
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
A Customizable Image-to-Video Model based on HunyuanVideo
Open-source large language model family from Tencent Hunyuan
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pokee Deep Research Model Open Source Repo
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude
Designed for text embedding and ranking tasks
A series of math-specific large language models of our Qwen2 series
Implementation of the Surya Foundation Model for Heliophysics
Repo of Qwen2-Audio chat & pretrained large audio language model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Chat & pretrained large audio language model proposed by Alibaba Cloud
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention