Python SDK for Claude Agent
Ling is a MoE LLM provided and open-sourced by InclusionAI
Wan2.2: Open and Advanced Large-Scale Video Generative Model
OCR expert VLM powered by Hunyuan's native multimodal architecture
A Powerful Native Multimodal Model for Image Generation
From Images to High-Fidelity 3D Assets
Visual Causal Flow
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
code for Mesh R-CNN, ICCV 2019
Inference script for Oasis 500M
Long-form streaming TTS system for multi-speaker dialogue generation
Large Multimodal Models for Video Understanding and Editing
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
ChatGPT interface with better UI
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Code for the paper Hybrid Spectrogram and Waveform Source Separation
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)