A series of math-specific large language models of our Qwen2 series
Multi-Agents LLM Financial Trading Framework
TokenSpeed is a speed-of-light LLM inference engine
Open-source evaluation toolkit of large multi-modality models (LMMs)
A frontier, first-principles handbook
Multilingual sentence & image embeddings with BERT
A Telegram bot for Large Language Models
State-of-the-art Parameter-Efficient Fine-Tuning
Your Personal Research Multi-Tool
NBA sports betting using machine learning
Free ChatGPT&DeepSeek API Key
Language-model investigation agent with a terminal UI
Concatenate a directory full of files into a single prompt
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Low-latency REST API for serving text-embeddings
Bridging LLM and Recommender System
Minimal reproduction of OneRec
A powerful tool for automated LLM fuzzing
NeurIPS2025 Spotlight] Quantized Attention
General technology for enabling AI capabilities w/ LLMs and MLLMs
Large-language-model & vision-language-model based on Linear Attention