Long-form streaming TTS system for multi-speaker dialogue generation
Image generation model with single-stream diffusion transformer
VS Code extension for LLM-assisted code/text completion
Multi-lingual large voice generation model, providing inference
Code and models for ICML 2024 paper, NExT-GPT
AI suite powered by state-of-the-art models and providing advanced AI
"Big Model" trains a visual multimodal VLM with 26M parameters
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A system for agentic LLM-powered data processing and ETL
Integrate the opencode AI assistant with Neovim
Export and Share your ChatGPT conversation history
Curated AI engineering notes on LLMs, generative models, and tools
The easiest way to use Ollama in .NET
Visual Causal Flow
Faster and easier training and deployments
Diffusion Transformer with Fine-Grained Chinese Understanding
Hands-on .NET course for building real-world generative AI apps
Shared repository for open-sourced projects from the Google AI Lang
A TTS model capable of generating ultra-realistic dialogue
Autonomous LLM agent for end-to-end data science workflows
Multilingual Document Layout Parsing in a Single Vision-Language Model
LongBench v2 and LongBench (ACL 25'&24')
Flexible Photo Recrafting While Preserving Your Identity
Large Multimodal Models for Video Understanding and Editing
Concatenate a directory full of files into a single prompt