tiktoken is a fast BPE tokeniser for use with OpenAI's models
This repository contains the official implementation of FastVLM
Official repository for LTX-Video
Qwen2.5-VL is the multimodal large language model series
Unified Multimodal Understanding and Generation Models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Hackable and optimized Transformers building blocks
State-of-the-art (SoTA) text-to-video pre-trained model
Multi-modal large language model designed for audio understanding
Large-language-model & vision-language-model based on Linear Attention
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
A minimal PyTorch re-implementation of the OpenAI GPT