Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Stable Diffusion built-in to Blender
Kimi K2 is the large language model series developed by Moonshot AI
Image generation model with single-stream diffusion transformer
Automated translation solution for visual novels
Generating Immersive, Explorable, and Interactive 3D Worlds
Audio foundation model excelling in audio understanding
950 line, minimal, extensible LLM inference engine built from scratch
The AI toolkit for the AI developer
High-performance library for gradient boosting on decision trees
Finding the Scaling Law of Agents. A multi-agent framework
Photorealistic Synthetic Dataset for Holistic Indoor Scene
C++ library for high performance inference on NVIDIA GPUs
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen-Image is a powerful image generation foundation model
MoBA: Mixture of Block Attention for Long-Context LLMs
Empowering Code Generation with OSS-Instruct
ICLR2024 Spotlight: curation/training code, metadata, distribution
New family of code large language models (LLMs)
A modular high-level library to train embodied AI agents
Capable of understanding text, audio, vision, video
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
Deep learning library
Autonomous novel writing CLI AI Agent — agents write, audit, and revise novels with human review gates