Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Stable Diffusion built-in to Blender
Finding the Scaling Law of Agents. A multi-agent framework
Audio foundation model excelling in audio understanding
Automated translation solution for visual novels
The AI toolkit for the AI developer
Generating Immersive, Explorable, and Interactive 3D Worlds
A modular high-level library to train embodied AI agents
Qwen-Image is a powerful image generation foundation model
MoBA: Mixture of Block Attention for Long-Context LLMs
Training Large Language Model to Reason in a Continuous Latent Space
950 line, minimal, extensible LLM inference engine built from scratch
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Tooling for the Common Objects In 3D dataset
Empowering Code Generation with OSS-Instruct
ICLR2024 Spotlight: curation/training code, metadata, distribution
New family of code large language models (LLMs)
This repo contains the code for 1D tokenizer and generator
A Universal Customization Method for Single and Multi Conditioning
Deep learning library
Capable of understanding text, audio, vision, video
Official Code for DragGAN (SIGGRAPH 2023)
Official Implementation of "Graph of Thoughts
Code release for "Detecting Twenty-thousand Classes
Lightweight anchor-free object detection model