Repo for SeedVR2 & SeedVR
Fast-stable-diffusion + DreamBooth
CLIP, Predict the most relevant text snippet given an image
Bidirectional token-classification model for identifiable info
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
A Customizable Image-to-Video Model based on HunyuanVideo
Open-source large language model family from Tencent Hunyuan
A Multi-Modal World Model for Reconstructing, Generating, Simulation
FAIR Sequence Modeling Toolkit 2
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Renderer for the harmony response format to be used with gpt-oss
A Powerful Native Multimodal Model for Image Generation
Chinese and English multimodal conversational language model
Inference code for scalable emulation of protein equilibrium ensembles
Revolutionizing Database Interactions with Private LLM Technology
Video Object and Interaction Deletion
Z80-μLM is a 2-bit quantized language model
Tool for exploring and debugging transformer model behaviors
A Unified Framework for Text-to-3D and Image-to-3D Generation
Project Lyra: Open Generative 3D World Models
Ultra-Efficient LLMs on End Device
Pretrained time-series foundation model developed by Google Research
PyTorch code and models for the DINOv2 self-supervised learning