Enhances Tesseract OCR output using LLMs (local or API)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Document (PDF, Word, PPTX ...) extraction and parse API
Implementation for MatMul-free LM
Leaderboard Comparing LLM Performance at Producing Hallucinations
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
DepGraph: Towards Any Structural Pruning
High-performance inference framework for large language models
A dataset consists of 15,140 ChatGPT prompts from Reddit
Code and models for ICML 2024 paper, NExT-GPT
Run PyTorch LLMs locally on servers, desktop and mobile
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A system for agentic LLM-powered data processing and ETL
Power CLI and Workflow manager for LLMs (core package)
Examples and tutorials to help developers build AI systems
LightLLM is a Python-based LLM (Large Language Model) inference
CV, NLP, LLM project applications, and advanced engineering deployment
Instruction-tuning LLM with Chinese Medical Knowledge
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Structured data extraction and instruction calling with ML, LLM
Robust recipes to align language models with human and AI preferences
An Open-source Framework for Data-centric Language Agents
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Open Source Deep Research Alternative to Reason and Search