Chinese and English multimodal conversational language model
Tensor search for humans
Implementation of a U-net complete with efficient attention
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Multilingual sentence & image embeddings with BERT
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Fast-stable-diffusion + DreamBooth
Personalize Any Characters with a Scalable Diffusion Transformer
Dealing with all unstructured data, such as reverse image search
Expressive Portrait Image Animation for Live Streaming
ComfyUI wrapper nodes for WanVideo and related models
Run a full local LLM stack with one command using Docker
Video,audio&Files Downloader&Convert with built-in browser with AI.
CLI tool to extract (meta)data from PDF and manipulate PDF files
Open-Sora: Democratizing Efficient Video Production for All
Arch Linux installer - guided, templates etc.
Code for running inference and finetuning with SAM 3 model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Sharp Monocular View Synthesis in Less Than a Second
Simplifies the local serving of AI models from any source
Sharp Monocular Metric Depth in Less Than a Second
Convert various image, audio and video formats from your context menu.
A powerful Import and Export tool between Excel and Database
The electronic structure package for quantum computers
Make drawing and labeling bounding boxes easy as cake