Qwen-Image is a powerful image generation foundation model
MoBA: Mixture of Block Attention for Long-Context LLMs
Training Large Language Model to Reason in a Continuous Latent Space
950 line, minimal, extensible LLM inference engine built from scratch
Empowering Code Generation with OSS-Instruct
Capable of understanding text, audio, vision, video
Official Implementation of "Graph of Thoughts
Implementation of model parallel autoregressive transformers on GPUs
An interpretable and efficient predictor using pre-trained models