tiktoken is a fast BPE tokeniser for use with OpenAI's models
Qwen2.5-VL is the multimodal large language model series
Large-language-model & vision-language-model based on Linear Attention
Unified Multimodal Understanding and Generation Models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Robust BERT-based model for English with improved MLM training