High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of MusicLM music generation model in Pytorch
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Let us control diffusion models
Application that simplifies the installation of AI-related projects
Basaran, an open-source alternative to the OpenAI text completion API
Overcoming Data Limitations for High-Quality Video Diffusion Models
mice stt tts
Inference code for Llama models
Unified embedding model
A tool to create the analytical index of a manuscript
Convert an image to text to spot intelligible words.
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Open source annotation tool for machine learning practitioners
Resources, corpora, and tools for Chinese natural language processing
A graphical manager for ollama that can manage your LLMs
AI-powered tool to quickly remove watermarks from videos flawlessly
An open-source framework for training large multimodal models
Ainee - AI Notetaking and Learning Companion
Implementation of Nougat Neural Optical Understanding
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Python package for easily interfacing with chat apps