Build multimodal AI applications with cloud-native stack
Specify a github or local repo, github pull request
Modular AI runtime for robots
Using AI models to automatically provide commentary and edit videos
Official code for StoryMem: Multi-shot Long Video Storytelling
Mobile manipulation research tools for roboticists
High-resolution models for human tasks
Multimodal-Driven Architecture for Customized Video Generation
No-code AI workflow
Turn your website into a GIF
Implementation of Phenaki Video, which uses Mask GIT
Lightweight demo to build a conversational AI search engine quickly
Habit Tracker for the AI Coding Workshop
A Python package for interactive geospaital analysis and visualization
Any model. Any hardware. Zero compromise
Connect any Open Data to any LLM with Model Context Protocol
Dealing with all unstructured data, such as reverse image search
Open source clone of the Age of Empires II engine
AI video generator optimized for low VRAM and older GPUs use
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Democratizing Reinforcement Learning for LLMs
Game Boy emulator written in Python
Visual Causal Flow
Outcome driven agent development framework that evolves
Implementation of Video Diffusion Models