Qwen3 is the large language model series developed by Qwen team
Agentic, Reasoning, and Coding (ARC) foundation models
Interact with your documents using the power of GPT
Structured outputs for llms
Powerful AI language model (MoE) optimized for efficiency/performance
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
⚡ Building applications with LLMs through composability ⚡
An AI personal assistant for your digital brain
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Qwen3-Coder is the code version of Qwen3
lightweight package to simplify LLM API calls
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Framework and no-code GUI for fine-tuning LLMs
Ongoing research training transformer models at scale
Tongyi Deep Research, the Leading Open-source Deep Research Agent
SimpleMem: Efficient Lifelong Memory for LLM Agents
A New Axis of Sparsity for Large Language Models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Inference Llama 2 in one file of pure C
A modular graph-based Retrieval-Augmented Generation (RAG) system
Replace OpenAI GPT with another LLM in your app
PandasAI is a Python library that integrates generative AI
This repository provides an advanced RAG
State-of-the-art Parameter-Efficient Fine-Tuning
Open-weight, large-scale hybrid-attention reasoning model