Implementation of a U-net complete with efficient attention
A simple but complete full-attention transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Capable of understanding text, audio, vision, video
A series of math-specific large language models of our Qwen2 series
Qwen2.5-VL is the multimodal large language model series
A Systematic Framework for Interactive World Modeling
Easily turn large sets of image urls to an image dataset
The most powerful and modular diffusion model GUI, api and backend
Gravia is a desktop AI virtual assistant
State-of-the-art 2D and 3D Face Analysis Project
Open source large language model by Alibaba
2^x Image Super-Resolution
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Book about interpretable machine learning
Claude Code skill that researches any topic across Reddit + X
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Client-side indecent content checking powered by TensorFlow.js
1 min voice data can also be used to train a good TTS model
A framework to enable multimodal models to operate a computer
C++ library for high performance inference on NVIDIA GPUs
A no-frills ChatGPT client for Emacs
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Python library for defining and optimizing mathematical expressions