An Open Source implementation of Notebook LM with more flexibility
State-of-the-art (SoTA) text-to-video pre-trained model
Public opinion analysis system
Lets make video diffusion practical
Python SDK for Claude Agent
A robust, efficient, low-latency speech-to-text library
MCP server that integrates Confluence and Jira
A framework to enable multimodal models to operate a computer
Virtual AI anchor that combines state-of-the-art technology
Letta (formerly MemGPT) is a framework for creating LLM services
Implementation of AudioLM audio generation model in Pytorch
A curated collection of skills for AI coding agents
Accurate × Fast × Comprehensive
HY-Motion model for 3D character animation generation
Implementation of Vision Transformer, a simple way to achieve SOTA
Real-time behaviour synthesis with MuJoCo, using Predictive Control
21 Lessons, Get Started Building with Generative AI
The open-source tool for building high-quality datasets
Conversational voice AI agents
Fast backend for long-term AI user memory via structured profiles
A text-to-speech, speech-to-text and speech-to-speech library
The repository provides code for running inference with SAM 2
Audiocraft is a library for audio processing and generation
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A Powerful Native Multimodal Model for Image Generation