New set of lightweight state-of-the-art, open foundation models
Inference script for Oasis 500M
OCR expert VLM powered by Hunyuan's native multimodal architecture
Pretrained time-series foundation model developed by Google Research
Official implementation of DreamCraft3D
LLM-based Reinforcement Learning audio edit model
code for Mesh R-CNN, ICCV 2019
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Open-weight, large-scale hybrid-attention reasoning model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
The ChatGPT Retrieval Plugin lets you easily find personal documents
Open-source, high-performance Mixture-of-Experts large language model
Open source large language model by Alibaba
Release for Improved Denoising Diffusion Probabilistic Models
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
DeepSeek LLM: Let there be answers
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official code for Style Aligned Image Generation via Shared Attention
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Software that can generate photos from paintings
Fine-tuning ChatGLM-6B with PEFT
Official PyTorch Implementation of "Scalable Diffusion Models"
llama.go is like llama.cpp in pure Golang