Seamlessly extend your preferred base images to be Lambda compatible
Code for running inference with the SAM 3D Body Model 3DB
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
GPT4V-level open-source multi-modal model based on Llama3-8B
Generating Immersive, Explorable, and Interactive 3D Worlds
Official Python inference and LoRA trainer package
An open-source photo thumbnail service by globo.com
Diffusion Transformer with Fine-Grained Chinese Understanding
Chinese and English multimodal conversational language model
An unsupervised and free tool for image and video dataset analysis
Train machine learning models within Docker containers
Lets make video diffusion practical
Automatically find issues in image datasets
Implementation of Imagen, Google's Text-to-Image Neural Network
Fast image augmentation library and an easy-to-use wrapper
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Fast-stable-diffusion + DreamBooth
An open source implementation of CLIP
21 Lessons, Get Started Building with Generative AI
A Pioneering Open-Source Alternative to GPT-4o
Towards Real-World Vision-Language Understanding
Offline inference engine for art, real-time voice conversations
Implementation of a U-net complete with efficient attention
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Dealing with all unstructured data, such as reverse image search