Open-weight, large-scale hybrid-attention reasoning model
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
ChatGPT interface with better UI
This repository contains the official implementation of FastVLM
High-resolution models for human tasks
Large Multimodal Models for Video Understanding and Editing
Unified Multimodal Understanding and Generation Models
Language modeling in a sentence representation space
Dataset of GPT-2 outputs for research in detection, biases, and more
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Diffusion Transformer with Fine-Grained Chinese Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
The ChatGPT Retrieval Plugin lets you easily find personal documents
MiniMax-M2, a model built for Max coding & agentic workflows
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-Source Financial Large Language Models!
Open-source, high-performance Mixture-of-Experts large language model
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open Multilingual Multimodal Chat LMs
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Repo for external large-scale work