MiniMax-M2, a model built for Max coding & agentic workflows
Generate Any 3D Scene in Seconds
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
PyTorch code and models for the DINOv2 self-supervised learning
Official implementation of DreamCraft3D
LLM-based Reinforcement Learning audio edit model
A Family of Open Foundation Models for Code Intelligence
Python example app from the OpenAI API quickstart tutorial
Provides convenient access to the Anthropic REST API from any Python 3
OCR expert VLM powered by Hunyuan's native multimodal architecture
This repository contains the official implementation of FastVLM
Diversity-driven optimization and large-model reasoning ability
Capable of understanding text, audio, vision, video
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A series of math-specific large language models of our Qwen2 series
Inference code for scalable emulation of protein equilibrium ensembles
Memory-efficient and performant finetuning of Mistral's models
Pushing the Limits of Mathematical Reasoning in Open Language Models
Towards Real-World Vision-Language Understanding
A SOTA open-source image editing model