Deploy your agentic worfklows to production
NeurIPS2025 Spotlight] Quantized Attention
Open-source evaluation toolkit of large multi-modality models (LMMs)
Unified framework for building enterprise RAG pipelines
Streamlines and simplifies prompt design for both developers
Learning to Reason with Search for LLMs via Reinforcement Learning
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs
SDG is a specialized framework
Document (PDF, Word, PPTX ...) extraction and parse API
Performance-optimized AI inference on your GPUs
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Structured data extraction and instruction calling with ML, LLM
Open Source Deep Research Alternative to Reason and Search
95% token savings. 155x faster queries. 16 languages
A security scanner for custom LLM applications
Tools for merging pretrained large language models
This repository provides an advanced RAG
A Pioneering Open-Source Alternative to GPT-4o
Automatic question answering for local knowledge bases based on LLM
Tool-integrated Reasoning LLM Agents
AIlice is a fully autonomous, general-purpose AI agent
Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM
Serving multiple LoRA finetuned LLM as one
Run Mixtral-8x7B models in Colab or consumer desktops