ChatGPT interface with better UI
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
LTX-Video Support for ComfyUI
tiktoken is a fast BPE tokeniser for use with OpenAI's models
One-click local MCP server installation in desktop apps
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Diversity-driven optimization and large-model reasoning ability
Collection of Gemma 3 variants that are trained for performance
Diffusion Transformer with Fine-Grained Chinese Understanding
PyTorch code and models for the DINOv2 self-supervised learning
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
4M: Massively Multimodal Masked Modeling
Large Multimodal Models for Video Understanding and Editing
Designed for text embedding and ranking tasks
Official implementation of DreamCraft3D
Implementation of "MobileCLIP" CVPR 2024
Repo of Qwen2-Audio chat & pretrained large audio language model
The official PyTorch implementation of Google's Gemma models
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
The ChatGPT Retrieval Plugin lets you easily find personal documents
Inference script for Oasis 500M
ICLR2024 Spotlight: curation/training code, metadata, distribution