Provides convenient access to the Anthropic REST API from any Python 3
Generating Immersive, Explorable, and Interactive 3D Worlds
Ling is a MoE LLM provided and open-sourced by InclusionAI
Chinese and English multimodal conversational language model
Research code artifacts for Code World Model (CWM)
A Unified Framework for Text-to-3D and Image-to-3D Generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Revolutionizing Database Interactions with Private LLM Technology
Recovering the Visual Space from Any Views
Diversity-driven optimization and large-model reasoning ability
Project Lyra: Open Generative 3D World Models
Ultra-Efficient LLMs on End Device
A PyTorch library for implementing flow matching algorithms
Official implementation of DreamCraft3D
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Large Multimodal Models for Video Understanding and Editing
An AI-powered security review GitHub Action using Claude
Generate Any 3D Scene in Seconds
The Clay Foundation Model - An open source AI model and interface
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
An Efficient Agentic Model for Computer Use
This repository contains the official implementation of FastVLM
Open Source Speech Language Model
Open-source industrial-grade ASR models
Fast-stable-diffusion + DreamBooth