A Universal Customization Method for Single and Multi Conditioning
A Unified Framework for Image Customization
Multi-Agent daTa geneRation Infra and eXperimentation framework
Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
An Open Source text-to-speech system built by inverting Whisper
Lightning-fast, on-device TTS, running natively via ONNX
MARS5 speech model (TTS) from CAMB.AI
A command-line utility for taking automated screenshots of websites
This repository provides an advanced RAG
Learn AI and LLMs from scratch using free resources
MetricFlow allows you to define, build, and maintain metrics in code
An MCP server that autonomously evaluates web applications
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Repo of Qwen2-Audio chat & pretrained large audio language model
Helping you get the most out of AWS, wherever you use MCP
A distributed and persistent archive replay system using IPFS
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
The data structure for multimodal data
Command-line tool to delete merged Git branches
Django friendly finite state machine support
Misago is fully featured modern forum application
Enabling PyTorch on Google TPU