Implementation of a U-net complete with efficient attention
A simple but complete full-attention transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Capable of understanding text, audio, vision, video
A series of math-specific large language models of our Qwen2 series
Qwen2.5-VL is the multimodal large language model series
A Systematic Framework for Interactive World Modeling
Easily turn large sets of image urls to an image dataset
The most powerful and modular diffusion model GUI, api and backend
State-of-the-art 2D and 3D Face Analysis Project
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Book about interpretable machine learning
Claude Code skill that researches any topic across Reddit + X
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
1 min voice data can also be used to train a good TTS model
A framework to enable multimodal models to operate a computer
Python library for defining and optimizing mathematical expressions
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Lightweight Python library for adding real-time multi-object tracking
CodeGeeX2: A More Powerful Multilingual Code Generation Model
State-of-the-art TTS model under 25MB
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Code for Language models can explain neurons in language models paper
Virtual AI anchor that combines state-of-the-art technology