Provides convenient access to the Anthropic REST API from any Python 3
Easy Docker setup for Stable Diffusion with user-friendly UI
Inference script for Oasis 500M
A Systematic Framework for Interactive World Modeling
A Unified Framework for Text-to-3D and Image-to-3D Generation
Foundational Models for State-of-the-Art Speech and Text Translation
RGBD video generation model conditioned on camera input
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Safety reasoning models built-upon gpt-oss
The ChatGPT Retrieval Plugin lets you easily find personal documents
Open source large language model by Alibaba
Detect faces in an image
Open Multilingual Multimodal Chat LMs
Let us control diffusion models
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Dia-1.6B generates lifelike English dialogue and vocal expressions
Multimodal 7B model for image, video, and text understanding tasks
685B model with improved agents and consistency