A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
State-of-the-art Parameter-Efficient Fine-Tuning
A high-performance ML model serving framework, offers dynamic batching
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The open source post-building layer for agents
AI-Driven Exploration in the Space of Code
Benchmark LLMs by fighting in Street Fighter 3
Recipes to train reward model for RLHF
A simple, performant and scalable Jax LLM
Leaderboard Comparing LLM Performance at Producing Hallucinations
slime is an LLM post-training framework for RL Scaling
Central interface to connect your LLM's with external data
The official repository for ERNIE 4.5 and ERNIEKit
Open source libraries and APIs to build custom preprocessing pipelines
Tools like web browser, computer access and code runner for LLMs
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Big Model Application Development Practice 1
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
NBA sports betting using machine learning
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
A series of math-specific large language models of our Qwen2 series
Performance-optimized AI inference on your GPUs
A lightweight vLLM implementation built from scratch
MobileLLM Optimizing Sub-billion Parameter Language Models