Designed for text embedding and ranking tasks
Repo of Qwen2-Audio chat & pretrained large audio language model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A guidance language for controlling large language models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Large-language-model & vision-language-model based on Linear Attention
TigerBot: A multi-language multi-task LLM
A high-quality PDF to Markdown tool based on large language model
Open source libraries and APIs to build custom preprocessing pipelines
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Build a modern LLM from scratch. Every line commented
Semi-Structured Agentic Framework. Workflows build themselves
The official implementation of RAPTOR
Synthetic data curation for post-training and data extraction
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention
Weaving the Digital Agent Galaxy
Unified framework for building enterprise RAG pipelines
A New Axis of Sparsity for Large Language Models
Ling is a MoE LLM provided and open-sourced by InclusionAI
Scalable data pre processing and curation toolkit for LLMs
Autoregressive Model Beats Diffusion
StarVector is a foundation model for SVG generation
Open Source Deep Research Alternative to Reason and Search
Accessible large language models via k-bit quantization for PyTorch