State-of-the-art (SoTA) text-to-video pre-trained model
The machine learning toolkit for time series analysis in Python
The data structure for multimodal data
Benchmarking Multimodal Agents for Open-Ended Tasks
State-of-the-art diffusion models for image and audio generation
Graph Neural Network Library for PyTorch
Geometric deep learning extension library for PyTorch
Simple and easily configurable grid world environments
Deep learning optimization library making distributed training easy
Open Source Differentiable Computer Vision Library
Deep learning optimization library: makes distributed training easy
Implementation of a U-net complete with efficient attention
Implementation of Make-A-Video, new SOTA text to video generator
Implementation of Video Diffusion Models
Generate 3D objects conditioned on text or images
CLIP + FFT/DWT/RGB = text to image/video
Visual localization made easy with hloc
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
High-Resolution 3D Human Digitization from A Single Image
Point cloud diffusion for 3D model synthesis
Toolkit for developing and comparing reinforcement learning algorithms
pyntcloud is a Python library for working with 3D point clouds
Notebooks, models and techniques for the generation of AI Art
3D-aware GANs based on NeRF (arXiv)
Based on the Disco Diffusion, version of the AI art creation software