Python inference and LoRA trainer package for the LTX-2 audio–video
From Images to High-Fidelity 3D Assets
Awesome multilingual OCR toolkits based on PaddlePaddle
Fast stable diffusion on CPU and AI PC
Instructions on how to use the Realtime API on Microcontrollers
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Claude Code image, a one-stop open source transit service
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Open-Source Financial Large Language Models
A Systematic Framework for Interactive World Modeling
The Clay Foundation Model - An open source AI model and interface
Open Source Speech Language Model
Extension index for stable-diffusion-webui
Ling is a MoE LLM provided and open-sourced by InclusionAI
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
An implementation of model parallel GPT-2 and GPT-3-style models
Facebook AI Research Sequence-to-Sequence Toolkit