From Images to High-Fidelity 3D Assets
A TTS model capable of generating ultra-realistic dialogue
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A Modular Simulation Framework and Benchmark for Robot Learning
Synthesizing and manipulating 2048x1024 images with conditional GANs
Advancing Open-source World Models
An alignment auditing agent capable of exploring alignment hypothesis
Empowering Code Generation with OSS-Instruct
Generate Any 3D Scene in Seconds
A high-quality rapid TTS voice cloning model
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
State-of-the-art TTS model under 25MB
Machine learning image inpainting task that removes watermarks
SAPIEN Manipulation Skill Framework
Open-source multi-speaker long-form text-to-speech model
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Pre-trained Deep Learning models and demos
Anthropic's Interactive Prompt Engineering Tutorial
Foundation model for image generation
Multimodal Diffusion with Representation Alignment
Real-World Centric Foundation GUI Agents
A.S.E (AICGSecEval) is a repository-level AI-generated code security
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Anthropic's educational courses
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning