Diffusion Transformer with Fine-Grained Chinese Understanding
Miso TTS is an 8 billion, highly emotive text-to-speech model
Qwen-Image is a powerful image generation foundation model
RGBD video generation model conditioned on camera input
FAIR Sequence Modeling Toolkit 2
Capable of understanding text, audio, vision, video
A state-of-the-art open visual language model
AI Suite for upscaling, interpolating & restoring images/videos
Open-source, high-performance Mixture-of-Experts large language model
StudioOllamaUI is a local, portable interface for Ollama
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
LLaMA: Open and Efficient Foundation Language Models
A latent text-to-image diffusion model
An implementation of model parallel GPT-2 and GPT-3-style models