Native and Compact Structured Latents for 3D Generation
Official inference repo for FLUX.2 models
Python inference and LoRA trainer package for the LTX-2 audio–video
Project Lyra: Open Generative 3D World Models
The repository provides code for running inference with SAM 2
Long-form streaming TTS system for multi-speaker dialogue generation
Genome modeling and design across all domains of life
PyTorch code and models for VJEPA2 self-supervised learning from video
Ajenti Core and stock plugins
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Visual Causal Flow
Official Python inference and LoRA trainer package
High-Resolution Image Synthesis with Latent Diffusion Models
Multi-modal large language model designed for audio understanding
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
From Images to High-Fidelity 3D Assets
An experimental version of DeepSeek model
Qwen2.5-VL is the multimodal large language model series
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Complete Two-Factor Authentication for Django
LaTeX source and supporting code for Think Python, 2nd edition
An Open-source Framework for Data-centric Language Agents
Chat with your SQL database
Towards Human-Level Text-to-Speech through Style Diffusion
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System