Towards Human-Sounding Speech
A lightweight, powerful framework for multi-agent workflows
Low-code app builder for RAG and multi-agent AI applications
Message Passing Neural Networks for Molecule Property Prediction
Create videos with Stable Diffusion
Controllable & emotion-expressive zero-shot TTS
The official Python SDK for the ElevenLabs API
A text-to-speech, speech-to-text and speech-to-speech library
Use any LLMs (Large Language Models) for Deep Research
Benchmarking Multimodal Agents for Open-Ended Tasks
An open-source AI avatar generator web app
Create custom engineering agents for your codebase
TypeScript AI platform with AI chat, Autonomous agents
ESP32 Camera motion capture application to record JPEGs to SD card
Implementation of Video Diffusion Models
CUDA Templates for Linear Algebra Subroutines
The most intuitive, flexible, way for researchers to build models
An open-source AI agent that brings the power of Grok
Release for Improved Denoising Diffusion Probabilistic Models
A Unified Framework for Text-to-3D and Image-to-3D Generation
Model Context Protocol tool support for LangChain
VMAS is a vectorized differentiable simulator
Agents write python code to call tools and orchestrate other agents
AI-data warehouse to enrich, transform and analyze unstructured data
Intelligent companion for seamless AI engineering and research