A course of learning LLM inference serving on Apple Silicon
A series of math-specific large language models of our Qwen2 series
Data Lake for Deep Learning. Build, manage, and query datasets
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Committed to building an open, public welfare
Request recommended movies, TV shows and anime to Jellyseer/Overseer
GitLab automatic code review tool based on large models
LLM powered fuzzing via OSS-Fuzz
Repo of Qwen2-Audio chat & pretrained large audio language model
The official implementation of RAPTOR
AI-driven multi-agent research assistant automating hypothesis
Mastering Applied AI, One Concept at a Time
Llama Chinese community, real-time aggregation
A New Axis of Sparsity for Large Language Models
LLM training in simple, raw C/CUDA
Scalable data pre processing and curation toolkit for LLMs
User toolkit for analyzing and interfacing with Large Language Models
Qwen3-omni is a natively end-to-end, omni-modal LLM
LongBench v2 and LongBench (ACL 25'&24')
Driving with Graph Visual Question Answering
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
An Efficient Web-enhanced Question Answering System
Official Repo for ICML 2024 paper
Implementation for MatMul-free LM
CV, NLP, LLM project applications, and advanced engineering deployment