Implementation of a U-net complete with efficient attention
A simple but complete full-attention transformer
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Capable of understanding text, audio, vision, video
A series of math-specific large language models of our Qwen2 series
Qwen2.5-VL is the multimodal large language model series
A Systematic Framework for Interactive World Modeling
Gravia is a desktop AI virtual assistant
Easily turn large sets of image urls to an image dataset
The most powerful and modular diffusion model GUI, api and backend
Open source large language model by Alibaba
State-of-the-art 2D and 3D Face Analysis Project
2^x Image Super-Resolution
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Book about interpretable machine learning
Claude Code skill that researches any topic across Reddit + X
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Client-side indecent content checking powered by TensorFlow.js
1 min voice data can also be used to train a good TTS model
A framework to enable multimodal models to operate a computer
C++ library for high performance inference on NVIDIA GPUs
A no-frills ChatGPT client for Emacs
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Python library for defining and optimizing mathematical expressions