Video Object and Interaction Deletion
Python SDK for Claude Agent
Accurate × Fast × Comprehensive
Open-Source Financial Large Language Models
Code for running inference with the SAM 3D Body Model 3DB
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Renderer for the harmony response format to be used with gpt-oss
Revolutionizing Database Interactions with Private LLM Technology
Programmatic access to the AlphaGenome model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
MOSS‑TTS Family open‑source speech and sound generation model
Achieving 3+ generation speedup on reasoning tasks
Easy Docker setup for Stable Diffusion with user-friendly UI
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Industrial-level controllable zero-shot text-to-speech system
Tooling for the Common Objects In 3D dataset
High-Resolution Image Synthesis with Latent Diffusion Models
A Powerful Native Multimodal Model for Image Generation
Generating Immersive, Explorable, and Interactive 3D Worlds
Repo for SeedVR2 & SeedVR
Fast-stable-diffusion + DreamBooth
VMZ: Model Zoo for Video Modeling
CLIP, Predict the most relevant text snippet given an image
Bidirectional token-classification model for identifiable info
Inference script for Oasis 500M