Chinese and English multimodal conversational language model
Data loaders and abstractions for text and NLP
Official python implementation of UTCP. UTCP is an open standard
Solve end to end problems using Llama model family
Conversational voice AI agents
ChatGPT extension for scientific research work
Official code for Style Aligned Image Generation via Shared Attention
Embed images and sentences into fixed-length vectors
PPTAgent: Generating and Evaluating Presentations
The standard data-centric AI package for data quality and ML
Proofs, cases, concept supplements, and reference explanations
Build cross-modal and multimodal applications on the cloud
Run the Stable Diffusion releases in a Docker container
Application that simplifies the installation of AI-related projects
Text-to-Image generation. The repo for NeurIPS 2021 paper
Resources, corpora, and tools for Chinese natural language processing
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Implementation of MusicLM music generation model in Pytorch
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Inference code for Llama models
Basaran, an open-source alternative to the OpenAI text completion API
Implementation of Nougat Neural Optical Understanding
Unified embedding model
An open-source framework for training large multimodal models
Open source annotation tool for machine learning practitioners