Embed images and sentences into fixed-length vectors
textgen, Text Generation models
Framework that is dedicated to making neural data processing
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Voice dialogue, role-playing, multi-topic discussion, picture creation
Generate 3D objects conditioned on text or images
Unofficial Parallel WaveGAN
AI-based tool for removing hardsubs and text-like watermarks
Latent Diffusion and Stable Diffusion Implementation
CLIP + FFT/DWT/RGB = text to image/video
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
CSAw is an NLP framework for low-resource languages
The first Chinese LLaMA2 model in the open source community
Text-to-Image generation. The repo for NeurIPS 2021 paper
Clarity in the current fast-paced mess of Open Source innovation
Label, clean and enrich text datasets with LLMs
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
SoftVC VITS Singing Voice Conversion
Let us control diffusion models
Application that simplifies the installation of AI-related projects
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Microsoft speech synthesis tool, built with Electron
Implementation of MusicLM music generation model in Pytorch
Basaran, an open-source alternative to the OpenAI text completion API