Repo for SeedVR2 & SeedVR
DeepSeek Coder: Let the Code Write Itself
Official repository for LTX-Video
VMZ: Model Zoo for Video Modeling
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A Unified Framework for Text-to-3D and Image-to-3D Generation
AlphaFold 3 inference pipeline
Release for Improved Denoising Diffusion Probabilistic Models
Example Discord bot written in Python that uses the completions API
Open-source multi-speaker long-form text-to-speech model
High-Resolution Image Synthesis with Latent Diffusion Models
Hackable and optimized Transformers building blocks
Tongyi Deep Research, the Leading Open-source Deep Research Agent
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Tool for exploring and debugging transformer model behaviors
State-of-the-art (SoTA) text-to-video pre-trained model
Programmatic access to the AlphaGenome model
OCR expert VLM powered by Hunyuan's native multimodal architecture
ChatGPT interface with better UI
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Pushing the Limits of Mathematical Reasoning in Open Language Models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Phi-3.5 for Mac: Locally-run Vision and Language Models
Unified Multimodal Understanding and Generation Models