State-of-the-art (SoTA) text-to-video pre-trained model
OpenDAN is an open source Personal AI OS
Taming Stable Diffusion for Lip Sync
Large Audio Language Model built for natural interactions
Python chatbot framework with Natural Language Understanding
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
RGBD video generation model conditioned on camera input
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Python package built to ease deep learning on graph
Bailing is a voice dialogue robot similar to GPT-4o
A fast library for AutoML and tuning
Real-time voice interactive digital human
AI Suite for upscaling, interpolating & restoring images/videos
Free, local, open-source AI app builder
High quality, fast, modular reference implementation of SSD in PyTorch
Embed images and sentences into fixed-length vectors
CLIP + FFT/DWT/RGB = text to image/video
A fast embedded library for approximate nearest neighbor search
Real-time music generation using stable diffusion techniques AI
Fast & easy transfer learning for NLP
Efficient 3D human pose estimation in video using 2D keypoint