Removes backgrounds from pictures. Extension for webui
Minimal scripts to run the emulator in a container for various systems
Machine Learning Pipelines for Kubeflow
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Gemma open-weight LLM library, from Google DeepMind
Music player and music library manager for Linux, Windows, and macOS
High-quality implementations of standard and SOTA methods
A fast, powerful, and simple hierarchical vision transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Integrate ChatGPT into your own discord bot
CLIP + FFT/DWT/RGB = text to image/video
Tooling for the Common Objects In 3D dataset
The Markdown Editor for Linux
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
MII makes low-latency and high-throughput inference possible
An open source library for GPU-accelerated robot learning
Refer and Ground Anything Anywhere at Any Granularity
DomainBed is a suite to test domain generalization algorithms
PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for the DINOv2 self-supervised learning
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Language modeling in a sentence representation space
The repository provides code for running inference with SAM 2
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning